Show Posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.


Messages - FlyingBlackbird

Pages: 1 2 3 [4] 5 6 7
46
Firmware / Re: [Guide] How to run firmware temporariliy
« on: February 16, 2020, 01:39:40 am »
Did you find a way to temporarily update PNOR and OpenBMC together (in sync) since this is the recommended procedure...?

https://wiki.raptorcs.com/wiki/Compiling_Firmware#General_advice

> Always upgrade PNOR and BMC together
>    Many mismatched PNOR/BMC version combinations lead to weird failures.

The wiki also describes how to run the OpenBMC firmware temporariliy but I am not sure how and if this works in sync with a PNOR update

https://wiki.raptorcs.com/wiki/Compiling_Firmware#Running_the_firmware_temporarily

47
Blackbird / Re: Regular users issues
« on: February 12, 2020, 04:12:18 pm »
I asked them via mail. This was the answer from today.

Great news! Do you have a URL of their home page for us?

48
Blackbird / Re: How to connect HDD activity LED?
« on: February 12, 2020, 04:11:27 pm »
You could use (I did) something like this with a 150 Ohm resistor inline on one of the legs and connect to pins 14 and 15.

https://www.sparkfun.com/products/9140

and use some shrink tubing to insulate.

Shrink tubing is a good idea, I already have thought about using my Rapsberry Pi jumper cables...

Why do I need the 150 Ohm resistor for Pin 15, is 3.3 V too much?

49
You can verify the firmware with Flashrom and a Bus Pirate.

I have a CH341A device which is supported by the flashrom Linux program but I could not yet find instructions which parts of the memory are relevant for calculating the checksums and
also want to avoid removing the flash chips from the socket just to read them (perhaps this it not required since the flash update is possible from within OpenBMC but if OpenBMC is
compromised it could manipulate the reported checksum too).

50
Blackbird / How to connect HDD activity LED?
« on: February 11, 2020, 03:52:36 pm »
The User Guide for Blackbird explains on pages 18 + 19 the J9800 Front Panel pins to connect eg. LEDs like HDD activity, Power LED and NIC LEDs:

https://wiki.raptorcs.com/w/images/c/ce/C1P9S01_users_guide_version_1_0.pdf

What is unclear to me is how I can connect a usual two-pin HDD LED plug with the front panel pins since

- I can see only one Pin (#14) for the "HDD activity LED cathode" but
- no obvious PIN for the HDD activity LED anode.

So the two-pin HDD LED plug from my case has no obvious two neighbored pins to be plugged into.

1. Which front panel pin should be used for the anode of the LED?
2. Which cable could I use to split the two-pin HDD LED plug from my case into two separate pins for a reliable connection?

51
How could I check the integrity of the installed (flashed) firmware of a Blackbird (or Talos) eg. against the officially downloadable firmware files (which are cryptographically signed)?

https://wiki.raptorcs.com/wiki/Blackbird/Firmware


52
User Zone / Re: petitboot error: PHB#0005[0:5]: PHB Freeze/Fence detected !
« on: February 11, 2020, 01:07:39 am »
That's sad. Would be interesting to know what caused this error.

I will come back here with the RMA result (but I guess it is hard to diagnose)

53
User Zone / Re: petitboot error: PHB#0005[0:5]: PHB Freeze/Fence detected !
« on: February 10, 2020, 03:03:16 pm »
I will try it out this weekend and come back with my results

OK, I have tested my Blackbird with a minimal hardware attached and the planar unmounted (detached) from the case's mouting points and I can reproduce the error
(see the attached log captured via the serial console + the picture of the planar).

Oh unlucky day, I have to open an RMA...

Code: [Select]
...
[    4.095231] IMC PMU (null) Register failed
[    7.230872] kAFS: failed to register: -97
[    7.613717] udevd[1694]: specified group 'kvm' unknown
[    7.621590] udevd[1695]: specified group 'kvm' unknown
nvram process returned non-zero exit status
dmesg: klogctl: Operation not permitted
 )0 [1;24r [m   [4l [?7h [?25l [?1c [H [J [H [J Petitboot (v1.10.3-pdd2d545)
 [2d ────────────────────────────────────────────────────────────────────────────── [4;3HSystem information [5;3HSystem configuration [6;3HSystem status log [7;3HLanguage [8;3HRescan devices [9;3HRetrieve config from URL [10;3HPlugins (0)
 [11d * [0;10;7m  Exit to shell            [22;2H [m  ──────────────────────────────────────────────────────────────────────────────
 [23d Enter=accept, e=edit, n=new, x=exit, l=language, g=log, h=help
 [24d Welcome to Petitboot
 Info: Waiting for device discovery[   86.421879937,3] PHB#0005[0:5]: PHB Freeze/Fence detected !
...
[   92.219717338,3] PHB#0005[0:5]: eeh_freeze_clear on fenced PHB

54
Sorry but are we obliged to register on the site just to download a patch? It seems excessive to me ... Now for example it denies me access ...

I guess the link is using the wrong path (perhaps to a private repo or the internal CI/CD artefacts).

Nevertheless I think you can find the patches here eg. in the master branch:

https://gitlab.com/chromium-ppc64le/chromium-ppc64le/-/tree/master/docker-root/patches/chrome

55
On Amazon a bestseller

THX :-) Is was mixing up this with the MSI M.2 Epander Aero (for 4 M.2 SSDs) which I couldn't find to buy at Amazon or my local computer stores (it was mentioned it ct 3.2020 page 77):

https://www.msi.com/PC-component/M.2-XPANDER-AERO/Overview

56
General Discussion / AMD Radeon Pro WX3200 working with Blackbird?
« on: February 09, 2020, 05:18:45 pm »
I am considering to buy a WX3200 because it is "cheap(er)" (about 220 USD) and supports newer standards like OpenGL 4.6 and Vulkan 1.1.

Has anybody tried out this GPU on a Blackbird (or Talos), does it work and with which OS and module (driver)?

https://www.amd.com/en/products/professional-graphics/radeon-pro-wx-3200

According to AMD this GPU uses Polaris architecture.
I have read rumors that it is a Polaris 12 GPU similar to the Radeon RX 550 LE.

57
User Zone / Re: petitboot error: PHB#0005[0:5]: PHB Freeze/Fence detected !
« on: February 07, 2020, 06:16:24 pm »
Thanks a lot for your quick response, I will try it out this weekend and come back with my results

58
I still have the problems described in the first post.

To avoid lengthy posts in this question I will close this question.

I have opened a new question that contains detailed logs now for better diagnostics and a more precise question:

https://forums.raptorcs.com/index.php/topic,52.0.html

59
User Zone / petitboot error: PHB#0005[0:5]: PHB Freeze/Fence detected !
« on: February 07, 2020, 05:28:15 pm »
My Blackbird does not boot into the petitboot menu anymore (black screen) even though I can see the execution log on screen and via the ssh console with:

Code: [Select]
ssh -p 2200 root@my.ip.address
The last available console output is a failure saying:

Code: [Select]
[   90.334100762,3] PHB#0005[0:5]: eeh_freeze_clear on fenced PHB
               XE autoconfiguration failed

I have only installed a SATA-HDD drive with a bootable Ubuntu Server 19.10 (no GPU).
I have not changed/updated any firmware.
Last thing I have done was testing a PCIe GPU (NVIDIA).
After removing the GPU card from the PCIe slot I could no longer boot into petitboot...

What is going wrong?

The relevant log is (full console output is attached as file):

Code: [Select]
--== Welcome to Hostboot hostboot-3beba24/hbicore.bin ==--

  3.06480|secure|SecureROM valid - enabling functionality
  8.29259|Booting from SBE side 0 on master proc=00050000
  8.46568|ISTEP  6. 5 - host_init_fsi
  8.93808|ISTEP  6. 6 - host_set_ipl_parms
  9.49036|ISTEP  6. 7 - host_discover_targets
 10.09037|HWAS|PRESENT> DIMM[03]=8080000000000000
 10.09038|HWAS|PRESENT> Proc[05]=8000000000000000
 10.09040|HWAS|PRESENT> Core[07]=5565000000000000
 10.49937|ISTEP  6. 8 - host_update_master_tpm
 10.52369|SECURE|Security Access Bit> 0x0000000000000000
 10.52370|SECURE|Secure Mode Disable (via Jumper)> 0x8000000000000000
...
 57.11902|ISTEP 21. 2 - host_verify_hdat
 57.20205|ISTEP 21. 3 - host_start_payload
[   58.179010391,5] OPAL skiboot-c81f9d6 starting...
[   58.179013552,7] initial console log level: memory 7, driver 5
[   58.179015737,6] CPU: P9 generation processor (max 4 threads/core)
...
[   65.266762790,5] PHB#0000:00:00.0 [ROOT] 1014 04c1 R:00 C:060400 B:01..ff SLOT=CPU1 Slot2 (16x)
 Petitboot (v1.10.3-pdd2d545)
 ──────────────────────────────────────────────────────────────────────────────

  System information
  System configuration
  System status log
  Language
  Rescan devices
  Retrieve config from URL
  Plugins (0)
 *Exit to shell           










 ──────────────────────────────────────────────────────────────────────────────
 Enter=accept, e=edit, n=new, x=exit, l=language, g=log, h=help
 Info: Waiting for device discovery[   85.086133287,3] PHB#0005[0:5]: PHB Freeze/Fence detected !
[   85.086197573,3] PHB#0005[0:5]:             PCI FIR=2000000000000000
[   85.086249297,3] PHB#0005[0:5]:         PCI FIR WOF=2000000000000000
[   85.086289203,3] PHB#0005[0:5]:            NEST FIR=0000800000000000
[   85.086354836,3] PHB#0005[0:5]:        NEST FIR WOF=0000800000000000
[   85.086394899,3] PHB#0005[0:5]:            ERR RPT0=0000000000000001
[   85.086489826,3] PHB#0005[0:5]:            ERR RPT1=0000000000000000
[   85.086534460,3] PHB#0005[0:5]:             AIB ERR=0000200000000000
[   85.086941635,3] PHB#0005[0:5]:                  brdgCtl = 00000002
[   85.087002150,3] PHB#0005[0:5]:             deviceStatus = 00200020
[   85.087036852,3] PHB#0005[0:5]:               slotStatus = 00402000
[   85.087081358,3] PHB#0005[0:5]:               linkStatus = a8120008
[   85.087137004,3] PHB#0005[0:5]:             devCmdStatus = 00100107
[   85.087181127,3] PHB#0005[0:5]:             devSecStatus = 00000000
[   85.087239088,3] PHB#0005[0:5]:          rootErrorStatus = 00000000
[   85.087285589,3] PHB#0005[0:5]:          corrErrorStatus = 00000000
[   85.087325009,3] PHB#0005[0:5]:        uncorrErrorStatus = 00000000
[   85.087370016,3] PHB#0005[0:5]:                   devctl = 00000020
[   85.087419580,3] PHB#0005[0:5]:                  devStat = 00000020
[   85.087466277,3] PHB#0005[0:5]:                  tlpHdr1 = 00000000
...
[   85.088610802,3] PHB#0005[0:5]:       phbRxeArbErrorLog1 = 0000000000000000
  [Disk: sda2 / ef49aa17-bb70-4fea-a8fc-29e235f7ab9f]
    Ubuntu, with Linux 5.3.0-26-generic (recovery mode)
    Ubuntu, with Linux 5.3.0-26-generic
    Ubuntu, with Linux 5.3.0-29-generic (recovery mode)
    Ubuntu, with Linux 5.3.0-29-generic
    Ubuntu
[   85.088655011,3] PHB#0005[0:5]:     phbRxeMrgErrorStatus = 0000000000000001
...
[   85.089573601,3] PHB#0005[0:5]:                PEST[0ff] = 3740002a01000000 0000000000000000
 [enP4p1s0f2] Probing from base tftp://192.168.178.1/pxelinux.cfg/[   90.311273403,3] PHB#0005[0:5]: PHB Freeze/Fence detected !
[   90.311357669,3] PHB#0005[0:5]:             PCI FIR=2000000000000000
...
[   90.315282185,3] PHB#0005[0:5]:         phbRegbErrorLog1 = 0001020000000000
[   90.315338900,3] PHB#0005[0:5]:                PEST[000] = 8000000000000000 8000000000000000
[   90.315413179,3] PHB#0005[0:5]:                PEST[001] = 8000000000000000 8000000000000000
[   90.315491213,3] PHB#0005[0:5]:                PEST[002] = 8000000000000000 8000000000000000
...
[   90.333937493,3] PHB#0005[0:5]:                PEST[0fe] = 8000000000000000 8000000000000000
[   90.334011680,3] PHB#0005[0:5]:                PEST[0ff] = b740002a01000000 8000000000000000
[   90.334100762,3] PHB#0005[0:5]: eeh_freeze_clear on fenced PHB
               XE autoconfiguration failed

PS: I have logged in into OpenBMC via SSH and see these strange error messages that may be related:

Code: [Select]
root@blackbird:~# journalctl | grep fail
May 10 19:37:25 blackbird kernel: g_mass_storage 1e6a0000.usb-vhub:p1: failed to start g_mass_storage: -22
May 10 19:37:27 blackbird systemd-udevd[789]: Process 'mtd_probe /dev/mtd2ro' failed with exit code 1.
May 10 19:37:27 blackbird systemd-udevd[790]: Process 'mtd_probe /dev/mtd3ro' failed with exit code 1.
May 10 19:37:27 blackbird systemd-udevd[837]: Process 'mtd_probe /dev/mtd4ro' failed with exit code 1.
May 10 19:37:27 blackbird systemd-udevd[792]: Process 'mtd_probe /dev/mtd5ro' failed with exit code 1.
May 10 19:37:28 blackbird systemd-udevd[788]: Process 'mtd_probe /dev/mtd0ro' failed with exit code 1.
May 10 19:37:28 blackbird systemd-udevd[791]: Process 'mtd_probe /dev/mtd1ro' failed with exit code 1.
May 10 19:37:28 blackbird systemd-udevd[836]: Process 'mtd_probe /dev/mtd6ro' failed with exit code 1.
May 10 19:37:29 blackbird kernel: A link change request failed with some changes committed already. Interface eth0 may have been left with an inconsistent configuration, please check.
May 10 19:37:31 blackbird kernel: A link change request failed with some changes committed already. Interface sit0 may have been left with an inconsistent configuration, please check.
Feb 03 22:05:40 blackbird kernel[1052]: [    3.810720] g_mass_storage 1e6a0000.usb-vhub:p1: failed to start g_mass_storage: -22
Feb 03 22:05:43 blackbird kernel[1052]: [   22.397690] A link change request failed with some changes committed already. Interface eth0 may have been left with an inconsistent configuration, please check.
Feb 03 22:05:43 blackbird kernel[1052]: [   24.051627] A link change request failed with some changes committed already. Interface sit0 may have been left with an inconsistent configuration, please check.
Feb 07 21:45:05 blackbird systemd[1]: Starting Stop the ethernet link failover...
Feb 07 21:45:07 blackbird systemd[1]: Started Stop the ethernet link failover.

PS2: This question is more precise follow-up to that question:
         https://forums.raptorcs.com/index.php?action=post;topic=49.0;last_msg=473

60
I have not yet experienced the problem myself (though I also run NVMe SSDs in my system)

Thanks for your feed-back :-)

Do you use the NVMe SSD together with any other SATA device (which is my constellation supposed to cause problems - in my case I suspect the optical drive)?

Pages: 1 2 3 [4] 5 6 7