Duet3D Logo Duet3D
    • Tags
    • Documentation
    • Order
    • Register
    • Login

    DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1

    Scheduled Pinned Locked Moved
    Beta Firmware
    12
    132
    7.4k
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Garfieldundefined
      Garfield
      last edited by

      At the time you couldn't ssh - the RPi wasn't responsive at all, if you had an SSH session open it just stopped responding.

      I will try the screen though - what does that offer? - a non DWC web gui ?

      A Former User? 1 Reply Last reply Reply Quote 0
      • A Former User?
        A Former User @jay_s_uk
        last edited by

        @jay_s_uk said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:

        Spoke too soon.

        if you've got access to console/ssh could you also run something like top and see if it spots something to suggest DCS get stuck in a loop?

        jay_s_ukundefined 1 Reply Last reply Reply Quote 0
        • A Former User?
          A Former User @Garfield
          last edited by A Former User

          @Garfield said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:

          I will try the screen though - what does that offer?

          its a terminal multiplexer / window manager or sometihng like so. it achieves that dcs will keep running if you have a network glitch. if you run dcs in the foreground and ssh stops all the processes in that shell are terminated - with screen they can keep running.

          ChrisPundefined 1 Reply Last reply Reply Quote 0
          • Garfieldundefined
            Garfield
            last edited by Garfield

            First message

            [warn] RepRapFirmware got a bad header checksum

            and then [screen is terminating]

            Danalundefined 1 Reply Last reply Reply Quote 0
            • Danalundefined
              Danal @Garfield
              last edited by

              @Garfield said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:

              [warn] RepRapFirmware got a bad header checksum

              I was getting those when I very first setup my D3 & Pi. Firmly re-seating the ribbon cable on both ends cleared them up.

              Delta / Kossel printer fanatic

              1 Reply Last reply Reply Quote 0
              • jay_s_ukundefined
                jay_s_uk @A Former User
                last edited by

                @bearer said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:

                @jay_s_uk said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:

                Spoke too soon.

                if you've got access to console/ssh could you also run something like top and see if it spots something to suggest DCS get stuck in a loop?

                Terminal dies as soon as the web connection does.
                I'll try and run DCS through the session and see what it spits out. It'll be later on though as I'm on bedtime duty now.

                Owns various duet boards and is the main wiki maintainer for the Teamgloomy LPC/STM32 port of RRF. Assume I'm running whatever the latest beta/stable build is

                1 Reply Last reply Reply Quote 0
                • A Former User?
                  A Former User
                  last edited by

                  the only thing I could think of that with respect to DCS to complaining about the RDY pin is basically an interrupt storm which can grind the Pi to a halt. not sure if relevant though.

                  1 Reply Last reply Reply Quote 0
                  • Garfieldundefined
                    Garfield
                    last edited by Garfield

                    Well this sucks - it's just done the same thing with RC9 ..... I had screen running at the time and it reported nothing ....

                    [debug] Assigning filament Prusament PETG to extruder drive 0
                    [debug] Requesting update of key boards, seq 0 -> 0
                    [debug] Requesting update of key directories, seq 0 -> 0
                    [debug] Requesting update of key fans, seq 0 -> 7
                    [debug] Requesting update of key heat, seq 0 -> 7
                    [debug] Requesting update of key inputs, seq 0 -> 0
                    [debug] Requesting update of key job, seq 0 -> 2
                    [debug] Requesting update of key move, seq 0 -> 30
                    [debug] Requesting update of key network, seq 0 -> 3
                    [debug] Requesting update of key sensors, seq 0 -> 4
                    [debug] Requesting update of key spindles, seq 0 -> 0
                    [debug] Requesting update of key state, seq 0 -> 1
                    [debug] Requesting update of key tools, seq 0 -> 5
                    [debug] Requesting update of key volumes, seq 0 -> 0
                    [debug] IPC#2: Got new UNIX connection, checking mode...
                    [debug] IPC#2: Subscription processor registered in Patch mode
                    [debug] IPC#3: Got new UNIX connection, checking mode...
                    [debug] Updated key boards
                    [debug] IPC#3: Command processor added
                    [debug] IPC#3: Received command AddUserSession
                    [debug] Updated key directories
                    [debug] Updated key fans
                    [debug] Updated key heat
                    [debug] Updated key inputs
                    [debug] Updated key job
                    [debug] Updated key move
                    [debug] Updated key network
                    [debug] IPC#4: Got new UNIX connection, checking mode...
                    [debug] IPC#4: Command processor added
                    [debug] IPC#4: Received command ResolvePath
                    [debug] IPC#5: Got new UNIX connection, checking mode...
                    [debug] IPC#5: Command processor added
                    [debug] IPC#4: Connection closed
                    [debug] IPC#5: Received command ResolvePath
                    [debug] IPC#6: Got new UNIX connection, checking mode...
                    [debug] IPC#5: Connection closed
                    [debug] IPC#7: Got new UNIX connection, checking mode...
                    [debug] IPC#6: Command processor added
                    [debug] IPC#6: Received command ResolvePath
                    [debug] IPC#7: Command processor added
                    [debug] IPC#7: Received command ResolvePath
                    [debug] IPC#6: Connection closed
                    [debug] IPC#7: Connection closed
                    [debug] Updated key sensors
                    [debug] IPC#8: Got new UNIX connection, checking mode...
                    [debug] IPC#8: Command processor added
                    [debug] IPC#8: Received command ResolvePath
                    [debug] IPC#8: Connection closed
                    [debug] Updated key state
                    [debug] Updated key tools
                    [debug] Updated key volumes
                    [debug] IPC#9: Got new UNIX connection, checking mode...
                    [debug] IPC#10: Got new UNIX connection, checking mode...
                    [debug] IPC#11: Got new UNIX connection, checking mode...
                    [debug] IPC#10: Command processor added
                    [debug] IPC#9: Command processor added
                    [debug] IPC#11: Command processor added
                    [debug] IPC#9: Received command ResolvePath
                    [debug] IPC#10: Received command ResolvePath
                    [debug] IPC#11: Received command ResolvePath
                    [debug] Requesting update of key job, seq 2 -> 3
                    [debug] Requesting update of key move, seq 30 -> 31
                    [debug] IPC#9: Connection closed
                    [debug] IPC#11: Connection closed
                    [debug] IPC#10: Connection closed
                    [debug] Updated key job
                    [debug] Updated key move
                    [debug] IPC#12: Got new UNIX connection, checking mode...
                    [debug] IPC#12: Subscription processor registered in Patch mode
                    [debug] IPC#13: Got new UNIX connection, checking mode...
                    [debug] IPC#13: Command processor added
                    [debug] IPC#13: Received command ResolvePath
                    
                    

                    I feel the need for a compatibility matrix for the 3 main components - which versions of RRF work wich versions of DWC.

                    I'd like to help the team figure this out but I haven't a clue where to start and I need the printer at least 'functional' even if there are known issues. Right now it isn't even functional.

                    I'm using a 4 Gig RPi 4 if it is worth anything (yes I know its OTT) ...

                    A Former User? 1 Reply Last reply Reply Quote 0
                    • A Former User?
                      A Former User @Garfield
                      last edited by

                      @Garfield said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:

                      Well this sucks - it's just done the same thing with RC9 ..... I had screen running at the time and it reported nothing ....

                      unfortunate, but still probably still helpfull for chrishamm. what happened to the ssh session timed out after a while or was it terminated immediately?

                      1 Reply Last reply Reply Quote 0
                      • Garfieldundefined
                        Garfield
                        last edited by

                        Terminated immediately ...

                        A Former User? 1 Reply Last reply Reply Quote 0
                        • ChrisPundefined
                          ChrisP @A Former User
                          last edited by

                          @bearer said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:

                          @Garfield said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:

                          I will try the screen though - what does that offer?

                          its a terminal multiplexer / window manager or sometihng like so. it achieves that dcs will keep running if you have a network glitch. if you run dcs in the foreground and ssh stops all the processes in that shell are terminated - with screen they can keep running.

                          It's not a network glitch. When it happens it tends to take out the entire Pi. Completely. So using screen isn't going to help. I've managed to get the network to stay up about 3 or 4 times out of 40 or so crashes, in which case screen isn't needed as you can still issue commands.... everything just takes an age to respond. But yeh, as soon as the SSH connection goes a power cycle is the only fix.

                          @Garfield said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:

                          Well this sucks - it's just done the same thing with RC9 ..... I had screen running at the time and it reported nothing ....

                          That sucks. It's only been RC10 that I've had this issue on. To the extent that the first time it happened I went and checked to see if my router had died.

                          A Former User? 1 Reply Last reply Reply Quote 0
                          • A Former User?
                            A Former User @Garfield
                            last edited by A Former User

                            @Garfield said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:

                            Terminated immediately ...

                            thats interesting; it means the cpu is able to relatively cleanly terminate the session as opposed to just freezing; although it doesn't help you. see below for correction.

                            Garfieldundefined 1 Reply Last reply Reply Quote 0
                            • Garfieldundefined
                              Garfield
                              last edited by Garfield

                              I wonder if something didn't uninstall or get overwritten in the 'downgrade' process. I never used RC9, I came straight from RC6

                              1 Reply Last reply Reply Quote 0
                              • A Former User?
                                A Former User @ChrisP
                                last edited by

                                @ChrisP said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:

                                So using screen isn't going to help.

                                mostly a precaution to avoid terminating the process if the session is interrupted for other reasons.

                                1 Reply Last reply Reply Quote 0
                                • Garfieldundefined
                                  Garfield @A Former User
                                  last edited by

                                  @bearer I should say that there is no disconnect - just zero repsonse - no messages, it just stops ... I use a commercial tool (Secure CRT 8.7) and it still thinks it is connected but hitting enter just causes an on screen line feed.

                                  A Former User? 1 Reply Last reply Reply Quote 0
                                  • A Former User?
                                    A Former User @Garfield
                                    last edited by

                                    @Garfield said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:

                                    @bearer I should say that there is no disconnect - just zero repsonse - no messages, it just stops ... I use a commercial tool (Secure CRT 8.7) and it still thinks it is connected but hitting enter just causes an on screen line feed.

                                    ah, that is more what i was expecting. it would terminate after 30-60 seconds or so as a timeout; in turn meaning the pi forze or was too busy to close the connection. still good info one way or the other.

                                    Garfieldundefined 1 Reply Last reply Reply Quote 0
                                    • Garfieldundefined
                                      Garfield @A Former User
                                      last edited by

                                      @bearer said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:

                                      it would terminate after 30-60 seconds or so as a timeout

                                      Yup it does ...

                                      1 Reply Last reply Reply Quote 0
                                      • A Former User?
                                        A Former User
                                        last edited by

                                        (i wonder if setting process affinity could isolate the hanging to leave a core running ssh etc, if possible in raspbian - anyways thats it for me today)

                                        1 Reply Last reply Reply Quote 0
                                        • Garfieldundefined
                                          Garfield
                                          last edited by Garfield

                                          I found this in the duet web server log

                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]: info: Microsoft.AspNetCore.Mvc.Infrastructure.ControllerActionInvoker[3]
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]:       Route matched with {action = "Get", controller = "WebSocket"}. Executing controller action with signature System.Threading.Tasks.Task Get() on controller DuetWebServer.Controllers.WebSocketController (DuetWe
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]: fail: DuetWebServer.Controllers.WebSocketController[0]
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]:       [WebSocketController] DCS is not started
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]: info: Microsoft.AspNetCore.Mvc.Infrastructure.ControllerActionInvoker[2]
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]:       Executed action DuetWebServer.Controllers.WebSocketController.Get (DuetWebServer) in 339302.5643ms
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]: info: Microsoft.AspNetCore.Routing.EndpointMiddleware[1]
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]:       Executed endpoint 'DuetWebServer.Controllers.WebSocketController.Get (DuetWebServer)'
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]: info: Microsoft.AspNetCore.Hosting.Diagnostics[2]
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]:       Request finished in 339446.6255ms 101
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]: warn: DuetWebServer.Services.ModelObserver[0]
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]:       Failed to synchronize machine model
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]: System.Net.Sockets.SocketException (107): Transport endpoint is not connected
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]:    at DuetAPI.Utility.JsonHelper.ReceiveUtf8Json(Socket socket, CancellationToken cancellationToken) in /home/christian/duet/DuetSoftwareFramework/src/DuetAPI/Utility/JsonHelper.cs:line 154
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]:    at DuetAPIClient.BaseConnection.ReceiveJson(CancellationToken cancellationToken) in /home/christian/duet/DuetSoftwareFramework/src/DuetAPIClient/BaseConnection.cs:line 294
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]:    at DuetAPIClient.SubscribeConnection.GetMachineModelPatch(CancellationToken cancellationToken) in /home/christian/duet/DuetSoftwareFramework/src/DuetAPIClient/SubscribeConnection.cs:line 100
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]:    at DuetWebServer.Services.ModelObserver.Execute() in /home/christian/duet/DuetSoftwareFramework/src/DuetWebServer/Services/ModelObserver.cs:line 156
                                          Apr 26 19:19:11 duet3 DuetWebServer[1106]: info: Microsoft.AspNetCore.Mvc.Infrastructure.ControllerActionInvoker[2]
                                          
                                          Apr 26 19:59:29 duet3 DuetWebServer[1106]: warn: DuetWebServer.Services.ModelObserver[0]
                                          Apr 26 19:59:29 duet3 DuetWebServer[1106]:       Failed to synchronize machine model
                                          Apr 26 19:59:29 duet3 DuetWebServer[1106]: System.Net.Internals.SocketExceptionFactory+ExtendedSocketException (99): Cannot assign requested address /var/run/dsf/dcs.sock
                                          Apr 26 19:59:29 duet3 DuetWebServer[1106]:    at System.Net.Sockets.Socket.DoConnect(EndPoint endPointSnapshot, SocketAddress socketAddress)
                                          Apr 26 19:59:29 duet3 DuetWebServer[1106]:    at System.Net.Sockets.Socket.Connect(EndPoint remoteEP)
                                          Apr 26 19:59:29 duet3 DuetWebServer[1106]:    at DuetAPIClient.BaseConnection.Connect(ClientInitMessage initMessage, String socketPath, CancellationToken cancellationToken) in /home/christian/duet/DuetSoftwareFramework/src/DuetAPIClient/BaseConnection.cs:l
                                          Apr 26 19:59:29 duet3 DuetWebServer[1106]:    at DuetWebServer.Services.ModelObserver.Execute() in /home/christian/duet/DuetSoftwareFramework/src/DuetWebServer/Services/ModelObserver.cs:line 131
                                          Apr 26 19:59:30 duet3 DuetWebServer[1106]: info: Microsoft.AspNetCore.Hosting.Diagnostics[1]
                                          Apr 26 19:59:30 duet3 DuetWebServer[1106]:       Request starting HTTP/1.1 GET http://10.100.2.225/machine
                                          Apr 26 19:59:30 duet3 DuetWebServer[1106]: info: Microsoft.AspNetCore.Routing.EndpointMiddleware[0]
                                          Apr 26 19:59:30 duet3 DuetWebServer[1106]:       Executing endpoint 'DuetWebServer.Controllers.WebSocketController.Get (DuetWebServer)'
                                          Apr 26 19:59:30 duet3 DuetWebServer[1106]: info: Microsoft.AspNetCore.Mvc.Infrastructure.ControllerActionInvoker[3]
                                          Apr 26 19:59:30 duet3 DuetWebServer[1106]:       Route matched with {action = "Get", controller = "WebSocket"}. Executing controller action with signature System.Threading.Tasks.Task Get() on controller DuetWebServer.Controllers.WebSocketController (DuetWe
                                          Apr 26 19:59:30 duet3 DuetWebServer[1106]: fail: DuetWebServer.Controllers.WebSocketController[0]
                                          Apr 26 19:59:30 duet3 DuetWebServer[1106]:       [WebSocketController] DCS is not started
                                          Apr 26 19:59:30 duet3 DuetWebServer[1106]: info: Microsoft.AspNetCore.Mvc.Infrastructure.ControllerActionInvoker[2]
                                          Apr 26 19:59:30 duet3 DuetWebServer[1106]:       Executed action DuetWebServer.Controllers.WebSocketController.Get (DuetWebServer) in 6.6056ms
                                          Apr 26 19:59:30 duet3 DuetWebServer[1106]: info: Microsoft.AspNetCore.Routing.EndpointMiddleware[1]
                                          Apr 26 19:59:30 duet3 DuetWebServer[1106]:       Executed endpoint 'DuetWebServer.Controllers.WebSocketController.Get (DuetWebServer)'
                                          Apr 26 19:59:30 duet3 DuetWebServer[1106]: info: Microsoft.AspNetCore.Hosting.Diagnostics[2]
                                          Apr 26 19:59:30 duet3 DuetWebServer[1106]:       Request finished in 7.2117ms 101
                                          Apr 26 19:59:32 duet3 DuetWebServer[1106]: info: Microsoft.AspNetCore.Hosting.Diagnostics[1]
                                          Apr 26 19:59:32 duet3 DuetWebServer[1106]:       Request starting HTTP/1.1 GET http://10.100.2.225/machine
                                          Apr 26 19:59:32 duet3 DuetWebServer[1106]: info: Microsoft.AspNetCore.Routing.EndpointMiddleware[0]
                                          Apr 26 19:59:32 duet3 DuetWebServer[1106]:       Executing endpoint 'DuetWebServer.Controllers.WebSocketController.Get (DuetWebServer)'
                                          Apr 26 19:59:32 duet3 DuetWebServer[1106]: info: Microsoft.AspNetCore.Mvc.Infrastructure.ControllerActionInvoker[3]
                                          Apr 26 19:59:32 duet3 DuetWebServer[1106]:       Route matched with {action = "Get", controller = "WebSocket"}. Executing controller action with signature System.Threading.Tasks.Task Get() on controller DuetWebServer.Controllers.WebSocketController (DuetWe
                                          Apr 26 19:59:32 duet3 DuetWebServer[1106]: fail: DuetWebServer.Controllers.WebSocketController[0]
                                          Apr 26 19:59:32 duet3 DuetWebServer[1106]:       [WebSocketController] DCS is not started
                                          
                                          

                                          Same error even in RC9

                                          1 Reply Last reply Reply Quote 0
                                          • Danalundefined
                                            Danal
                                            last edited by Danal

                                            @Garfield said in DCS Crash with 3.01-R10 / DWC 2.1.5 / DSF 2.1.1:

                                            I feel the need for a compatibility matrix for the 3 main components - which versions of RRF work wich versions of DWC.

                                            Interesting. Duet3 + Pi 4B, 4 gig. I've been having random hangs that take a power cycle to clear. I am also on RC10, as of mid evening yesterday. I was not certain this was happening, nor certain that it started at RC whatever, so I have not reported anything, yet.

                                            Now that I think about it, it came on hard when I switched to RC10. I had to power cycle at least eight or ten times last night.

                                            I typically have a DWC, a VNC and a SSH running. They all just hang. Attempting to start a new SSH also hangs (note, not refused, connects and never gets a password prompt).

                                            I will see what data I can gather.

                                            Delta / Kossel printer fanatic

                                            1 Reply Last reply Reply Quote 0
                                            • First post
                                              Last post
                                            Unless otherwise noted, all forum content is licensed under CC-BY-SA