Author Topic: Talos II reboots itself  (Read 6776 times)

DKnoto

  • Jr. Member
  • **
  • Posts: 83
  • Karma: +13/-0
    • View Profile
Talos II reboots itself
« on: December 01, 2023, 03:23:34 pm »
Since September I have been having problems with spontaneous reboots of my machine, it has happened four times. Recently it was very annoying, the machine rebooted over and over again without loading the operating system. Only removing the power plug helped.

I managed to take a picture of what the report looks like after such a fall:

https://www.dropbox.com/scl/fi/hk0zqvfjpidexnw7m1xln/Talos-II-2023-11-27-Crash.jpeg?rlkey=0jj34pjpv708zigt08kzfk3zl&dl=0

Any ideas what could cause this behavior?
Desktop: Talos II T2P9S01 REV 1.01 | IBM Power 9/18c DD2.3, 02CY646 | AMD Radeon Pro WX7100 | 64GB RAM | SSD 1TB

Hasturtium

  • Full Member
  • ***
  • Posts: 155
  • Karma: +10/-0
    • View Profile
Re: Talos II reboots itself
« Reply #1 on: December 02, 2023, 06:09:33 pm »
Looks like it might be tied to this, though I couldn’t say what’s triggering it… In any case it looks like updating the firmware to 2.18+ could mitigate the issue. I'm running a Blackbird, so I can't really provide further insight on this one. But I hope it helps.
« Last Edit: December 02, 2023, 08:16:32 pm by Hasturtium »

ClassicHasClass

  • Sr. Member
  • ****
  • Posts: 473
  • Karma: +37/-0
  • Talospace Earth Orbit
    • View Profile
    • Floodgap
Re: Talos II reboots itself
« Reply #2 on: December 03, 2023, 10:33:41 pm »
Not sure, but definitely worth having a connection open to the BMC and watching any activity. How far does it get through Hostboot?

Also, did anything guard out?

MPC7500

  • Hero Member
  • *****
  • Posts: 596
  • Karma: +41/-1
    • View Profile
    • Twitter
Re: Talos II reboots itself
« Reply #3 on: December 07, 2023, 03:50:44 pm »
I had the same problem some time ago. After a thunderstorm with a lightning strike. I had to re-flash the firmware.
It could also indicate a bad PSU, which happens surprisingly often.

DKnoto

  • Jr. Member
  • **
  • Posts: 83
  • Karma: +13/-0
    • View Profile
Re: Talos II reboots itself
« Reply #4 on: December 08, 2023, 01:01:22 am »
I have a power supply with a 10-year warranty ;) Thermaltake Toughpower TF1 1550W.
Desktop: Talos II T2P9S01 REV 1.01 | IBM Power 9/18c DD2.3, 02CY646 | AMD Radeon Pro WX7100 | 64GB RAM | SSD 1TB

Hasturtium

  • Full Member
  • ***
  • Posts: 155
  • Karma: +10/-0
    • View Profile
Re: Talos II reboots itself
« Reply #5 on: December 08, 2023, 06:05:15 pm »
I have a power supply with a 10-year warranty ;) Thermaltake Toughpower TF1 1550W.

Then that likely isn't it, though don't dismiss it outright. Have you considered re-flashing the firmware as MPC7500 suggested?

DKnoto

  • Jr. Member
  • **
  • Posts: 83
  • Karma: +13/-0
    • View Profile
Re: Talos II reboots itself
« Reply #6 on: December 10, 2023, 02:02:21 am »
Have you considered re-flashing the firmware as MPC7500 suggested?

Yes, I'm considering it but I'm a little afraid to mess something up. I've never done it and my Talos II is my critical resource at the moment.
Desktop: Talos II T2P9S01 REV 1.01 | IBM Power 9/18c DD2.3, 02CY646 | AMD Radeon Pro WX7100 | 64GB RAM | SSD 1TB

MPC7500

  • Hero Member
  • *****
  • Posts: 596
  • Karma: +41/-1
    • View Profile
    • Twitter
Re: Talos II reboots itself
« Reply #7 on: December 10, 2023, 06:07:32 am »
It's pretty simple. You only have to update the BMC and the OpenPOWER firmware, not the FPGA.
https://wiki.raptorcs.com/wiki/Updating_Firmware

DKnoto

  • Jr. Member
  • **
  • Posts: 83
  • Karma: +13/-0
    • View Profile
Re: Talos II reboots itself
« Reply #8 on: December 10, 2023, 02:08:19 pm »
I think I need to hurry up, a while ago I had another crash. I wasn't doing anything in particular I was listening to music on YT and reading Twitter in Firefox. The worst part is that my FreeBSD in Qemu VM has destroyed again...  :(
Desktop: Talos II T2P9S01 REV 1.01 | IBM Power 9/18c DD2.3, 02CY646 | AMD Radeon Pro WX7100 | 64GB RAM | SSD 1TB

ejfluhr

  • Newbie
  • *
  • Posts: 44
  • Karma: +3/-0
    • View Profile
Re: Talos II reboots itself
« Reply #9 on: December 13, 2023, 06:11:40 pm »
Is the error always on c4?   If yes, can you disable c4?   

DKnoto

  • Jr. Member
  • **
  • Posts: 83
  • Karma: +13/-0
    • View Profile
Re: Talos II reboots itself
« Reply #10 on: January 11, 2024, 04:45:13 am »
On the kernel 6.6.8-200.fc39.ppc64le I have recently had similar instances of reboots but they occurred when connecting a device to USB, printer, iPad. On kernel 6.6.9-200.fc39.ppc64le it has been correct for two days.
Desktop: Talos II T2P9S01 REV 1.01 | IBM Power 9/18c DD2.3, 02CY646 | AMD Radeon Pro WX7100 | 64GB RAM | SSD 1TB

DKnoto

  • Jr. Member
  • **
  • Posts: 83
  • Karma: +13/-0
    • View Profile
Re: Talos II reboots itself
« Reply #11 on: January 15, 2024, 01:26:10 am »
Over the past four days on kernel 6.6.9-200, I have made more than a dozen attempts to connect various devices to USB ports and the situation has not repeated.
Desktop: Talos II T2P9S01 REV 1.01 | IBM Power 9/18c DD2.3, 02CY646 | AMD Radeon Pro WX7100 | 64GB RAM | SSD 1TB