3.2b3: Lost connection to Linux about 15min after startup.



  • Hi,
    my Duet3 board with toolboard in SBC mode just failed a print about 12 minutes into the print with the following errors present on the PanelDue:

    16 m Connected
    3m09 Lost connection to Linux
    3m09 Warning: Discarded msg drc=10 typ=4510 RID=1985 exp1988
    3m09 Cancelled printing file 0:/gcodes/VCP-Ormerod-Belt-Clip.gcode, print time was 0h 8m
    3m09: Error: Failed to switch off remote heater 1: Response timeout: CAN addr 10, req type 6013, RID=1985
    

    The CAN sync LEDs on main board and tool board still blink in sync. The 5V and VIN LEDs are on.

    Configuration files are at https://github.com/oliof/printerconfigs/tree/main/v-core_pro/system

    M115
    FIRMWARE_NAME: RepRapFirmware for Duet 3 MB6HC FIRMWARE_VERSION: 3.2-beta3 ELECTRONICS: Duet 3 MB6HC v0.6 or 1.0 FIRMWARE_DATE: 2020-11-07
    M115 B10
    Duet TOOL1LC firmware version 3.2beta3 (2020-11-07)
    
    

    M122 after power cycling the otherwise unresponsive board:

    M122
    === Diagnostics ===
    RepRapFirmware for Duet 3 MB6HC version 3.2-beta3 running on Duet 3 MB6HC v0.6 or 1.0 (SBC mode)
    Board ID: 08DGM-9T66A-G63SJ-6J1D6-3SD6R-9U0BA
    Used output buffers: 1 of 40 (10 max)
    === RTOS ===
    Static ram: 122124
    Dynamic ram: 137340 of which 0 recycled
    Never used RAM 132728, free system stack 180 words
    Tasks: Linux(ready,131) HEAT(blocked,353) CanReceiv(blocked,869) CanSender(blocked,371) CanClock(blocked,352) TMC(blocked,54) MAIN(running,1205) IDLE(ready,19)
    Owned mutexes: HTTP(MAIN)
    === Platform ===
    Last reset 00:06:54 ago, cause: power up
    Last software reset at 2020-11-08 16:33, reason: User, GCodes spinning, available RAM 131236, slot 1
    Software reset code 0x0003 HFSR 0x00000000 CFSR 0x00000000 ICSR 0x0044a000 BFAR 0x00000000 SP 0xffffffff Task MAIN
    Error status: 0x00
    MCU temperature: min 30.9, current 32.5, max 33.1
    Supply voltage: min 24.0, current 24.1, max 24.1, under voltage events: 0, over voltage events: 0, power good: yes
    12V rail voltage: min 12.2, current 12.2, max 12.3, under voltage events: 0
    Driver 0: position 0, standstill, reads 20689, writes 11 timeouts 0, SG min/max 0/0
    Driver 1: position 0, standstill, reads 20690, writes 11 timeouts 0, SG min/max 0/0
    Driver 2: position 0, standstill, reads 20691, writes 11 timeouts 0, SG min/max 0/0
    Driver 3: position 0, standstill, reads 20691, writes 11 timeouts 0, SG min/max 0/0
    Driver 4: position 0, standstill, reads 20692, writes 11 timeouts 0, SG min/max 0/0
    Driver 5: position 0, standstill, reads 20693, writes 11 timeouts 0, SG min/max 0/0
    Date/time: 2020-11-08 17:02:43
    Slowest loop: 758.65ms; fastest: 0.18ms
    === Storage ===
    Free file entries: 10
    SD card 0 not detected, interface speed: 37.5MBytes/sec
    SD card longest read time 0.0ms, write time 0.0ms, max retries 0
    === Move ===
    Hiccups: 0(0), FreeDm: 375, MinFreeDm: 375, MaxWait: 0ms
    Bed compensation in use: none, comp offset 0.000
    === MainDDARing ===
    Scheduled moves 0, completed moves 0, StepErrors 0, LaErrors 0, Underruns [0, 0, 0], CDDA state -1
    === AuxDDARing ===
    Scheduled moves 0, completed moves 0, StepErrors 0, LaErrors 0, Underruns [0, 0, 0], CDDA state -1
    === Heat ===
    Bed heaters = -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1, chamberHeaters = -1 -1 -1 -1
    === GCodes ===
    Segments left: 0
    Movement lock held by null
    HTTP* is doing "M122" in state(s) 0
    Telnet is idle in state(s) 0
    File is idle in state(s) 0
    USB is idle in state(s) 0
    Aux is idle in state(s) 0
    Trigger* is idle in state(s) 0
    Queue is idle in state(s) 0
    LCD is idle in state(s) 0
    SBC is idle in state(s) 0
    Daemon is idle in state(s) 0
    Aux2 is idle in state(s) 0
    Autopause is idle in state(s) 0
    Code queue is empty.
    === CAN ===
    Messages sent 10, send timeouts 10, longest wait 4ms for type 6024, free CAN buffers 47
    === SBC interface ===
    State: 0, failed transfers: 0
    Last transfer: 19ms ago
    RX/TX seq numbers: 12796/12797
    SPI underruns 0, overruns 0
    Number of disconnects: 0, IAP RAM available 0x20a78
    Buffer RX/TX: 0/0-0
    === Duet Control Server ===
    Duet Control Server v3.2.0-beta3
    Code buffer space: 4096
    Configured SPI speed: 8000000 Hz
    Full transfers per second: 10.85
    

    Link to the printed file

    VCP-Ormerod-Belt-Clip.gcode



  • Might be worth checking your linux syslog to see if there are any errors from DCS.



  • @gloomyandy no DCS errors in the logs.



  • Retrying the prints with trailing the dcs logs, codelogger running, pronterface interfacing via USB to the board, of course "a watched kettle never boils", but I do see repeated undervoltage events in the logs. I will try to move the Pi to it's own power supply to see whether that helps. I am a bit strapped for power sockets, but I will find a way.



  • And here we go again

    pronterface captured:

    Lost connection to Linux
    Warning: Discarded msg src=10 typ=4510 RID=42 exp 43
    Done printing file
    Cancelled printing file 0:/gcodes/VCP-Ormerod-Belt-Clip.gcode, print time was 0h 22m
    Rx0:11a20a7f 02 00 00 00 00 00 00 00
    Rx0:11a30a00 02 00 00 00 00 00 00 00
    Rx0:11a50a00 03 00 00 00 00 00 00 00
    Rx0:11a20a7f 02 00 00 00 00 00 00 00
    Rx0:11a30a00 02 00 00 00 00 00 00 00
    Rx0:11a50a00 03 00 00 00 00 00 00 00
    Rx0:11a20a7f 02 00 00 00 00 00 00 00
    Rx0:11a30a00 02 00 00 00 00 00 00 00
    Rx0:11a50a00 03 00 00 00 00 00 00 00
    Rx0:11a20a7f 02 00 00 00 00 00 00 00
    Rx0:11a30a00 02 00 00 00 00 00 00 00
    Rx0:11a50a00 03 00 00 00 00 00 00 00
    Error: Failed to switch off remote heater 1: Response timeout: CAN addr 10, req type 6013, RID=42
    [ERROR] Error: Failed to switch off remote heater 1: Response timeout: CAN addr 10, req type 6013, RID=42
    

    M122 via serial

    === Diagnostics ===
    RepRapFirmware for Duet 3 MB6HC version 3.2-beta3 running on Duet 3 MB6HC v0.6 or 1.0 (SBC mode)
    Board ID: 08DGM-9T66A-G63SJ-6J1D6-3SD6R-9U0BA
    Used output buffers: 1 of 40 (40 max)
    === RTOS ===
    Static ram: 122124
    Dynamic ram: 138812 of which 84 recycled
    Never used RAM 131172, free system stack 120 words
    Tasks: Linux(blocked,79) HEAT(blocked,277) CanReceiv(blocked,794) CanSender(blocked,344) CanClock(blocked,352) TMC(blocked,18) MAIN(running,669) IDLE(ready,19)
    Owned mutexes: USB(MAIN)
    === Platform ===
    Last reset 00:54:23 ago, cause: power up
    Last software reset at 2020-11-08 16:33, reason: User, GCodes spinning, available RAM 131236, slot 1
    Software reset code 0x0003 HFSR 0x00000000 CFSR 0x00000000 ICSR 0x0044a000 BFAR 0x00000000 SP 0xffffffff Task MAIN
    Error status: 0x04
    [ERROR] Error status: 0x04
    
    MCU temperature: min 32.0, current 35.2, max 37.0
    Supply voltage: min 23.9, current 24.0, max 24.1, under voltage events: 0, over voltage events: 0, power good: yes
    12V rail voltage: min 12.2, current 12.2, max 12.3, under voltage events: 0
    Driver 0: position 2357, standstill, reads 47099, writes 19 timeouts 0, SG min/max 0/1023
    Driver 1: position -32, standstill, reads 47098, writes 19 timeouts 0, SG min/max 0/1023
    Driver 2: position 7680, standstill, reads 47117, writes 0 timeouts 0, SG min/max not available
    Driver 3: position 0, standstill, reads 47103, writes 14 timeouts 0, SG min/max 0/174
    Driver 4: position 0, standstill, reads 47104, writes 14 timeouts 0, SG min/max 0/145
    Driver 5: position 0, standstill, reads 47103, writes 14 timeouts 0, SG min/max 0/187
    Date/time: 2020-11-08 17:50:13
    Slowest loop: 7067.05ms; fastest: 0.11ms
    === Storage ===
    Free file entries: 10
    SD card 0 not detected, interface speed: 37.5MBytes/sec
    SD card longest read time 0.0ms, write time 0.0ms, max retries 0
    === Move ===
    Hiccups: 0(0), FreeDm: 375, MinFreeDm: 350, MaxWait: 1058464ms
    Bed compensation in use: none, comp offset 0.000
    === MainDDARing ===
    Scheduled moves 22194, completed moves 22194, StepErrors 0, LaErrors 0, Underruns [0, 501, 3], CDDA state -1
    === AuxDDARing ===
    Scheduled moves 0, completed moves 0, StepErrors 0, LaErrors 0, Underruns [0, 0, 0], CDDA state -1
    === Heat ===
    Bed heaters = 0 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1 -1, chamberHeaters = -1 -1 -1 -1
    === GCodes ===
    Segments left: 0
    Movement lock held by null
    HTTP* is idle in state(s) 0
    Telnet is idle in state(s) 0
    File* is idle in state(s) 0
    USB is ready with "M122" in state(s) 0
    Aux is idle in state(s) 0
    Trigger* is idle in state(s) 0
    Queue* is idle in state(s) 0
    LCD is idle in state(s) 0
    SBC* is idle in state(s) 0
    Daemon is idle in state(s) 0
    Aux2 is idle in state(s) 0
    Autopause is idle in state(s) 0
    Code queue is empty.
    === CAN ===
    Messages sent 27548, send timeouts 27522, longest wait 2ms for type 6012, free CAN buffers 47
    === SBC interface ===
    State: 0, failed transfers: 1
    Last transfer: 13752ms ago
    RX/TX seq numbers: 0/37238
    SPI underruns 1, overruns 1
    Number of disconnects: 1, IAP RAM available 0x20a78
    Buffer RX/TX: 0/0-0
    

    Network connection to the Pi is dead. I will sort out the power to the pi, and then try again.


Log in to reply