Show Posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.

Topics - rjzak

Pages: [1]
General Discussion / Thanks WebKit... [amdgpu crash]
« on: April 17, 2023, 09:25:11 am »
I'm surprised this happened:

Code: [Select]
[86046.568669] amdgpu 0000:01:00.0: amdgpu: IH ring buffer overflow (0x0008A700, 0x0000F9B0, 0x0000A710)
[86046.568709] amdgpu 0000:01:00.0: amdgpu: GPU fault detected: 147 0x02e0c802 for process WebKitWebProces pid 132102 thread WebKitWebP:cs0 pid 132125
[86046.568740] amdgpu 0000:01:00.0: amdgpu:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00101400
[86046.568753] amdgpu 0000:01:00.0: amdgpu:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x06048002
[86046.568776] amdgpu 0000:01:00.0: amdgpu: VM fault (0x02, vmid 3, pasid 32773) at page 1053696, read from 'TC4' (0x54433400) (72)

I'm using Debian 11.6 and figured I'd try Chromium. After a few pages loaded and having it running for ~12 hours, the applications seem to have frozen. The music in the background continued to play and the cursor would move, but the applications are all seemingly non-responsive. SSHing in and running `dmesg` showed that error. Kernel is 6.1.0-7. Anyone else see this before? I noticed that Chromium on Debian also reports it's x86_64, that should have been reason enough to go back to Firefox!

GPU Compute / Accelerators / GPU & disk encryption?
« on: January 07, 2023, 12:09:01 pm »
I flashed the AMD firmware to the BOOTKERNFW, and can see the Petitboot screen via the AMD GPU! However, when the OS boots, it goes dark for a few seconds, then a cursor appears. Only via the serial display via the BMC web interface am I able to see the prompt to unlock the disk to resume booting. Trying to enter the password on the keyboard doesn't work. Is there a way around this, so I don't need another computer to boot the encrypted disk? I used Debian's auto formatter, so /boot is unencrypted, and LVM has the encrypted swap and root partition. I wouldn't be opposed to having a way to inform Petitboot or BMC of the password somehow.

General Discussion / Raptor-provided SATA card issues
« on: December 27, 2022, 09:50:33 am »
I'm using the SATA card provided by Raptor when I bought the Talos II. lsusb identifies it as: SATA controller: Marvell Technology Group Ltd. 88SE9215 PCIe 2.0 x1 4-port SATA 6 Gb/s Controller (rev ff).

Void Linux can see the optical drive as /dev/cdrom which is a link to /dev/sr0. However, I cannot mount a disc, nor can fdisk open it, both with the same error: "no medium found". I was trying some other distros, and after a few warm reboots, the OS no longer sees the optical drive. I attached a hard drive as well, and the distros I tried couldn't see the hard drive at all (though the hard drive works via USB-SATA cable).

dmesg | grep ata has this output:
Code: [Select]
[    2.522439] libata version 3.00 loaded.
[    2.547068] ata1: SATA max UDMA/133 abar m2048@0x620c080040000 port 0x620c080040100 irq 128
[    2.547072] ata2: SATA max UDMA/133 abar m2048@0x620c080040000 port 0x620c080040180 irq 128
[    2.547075] ata3: SATA max UDMA/133 abar m2048@0x620c080040000 port 0x620c080040200 irq 128
[    2.547078] ata4: SATA max UDMA/133 abar m2048@0x620c080040000 port 0x620c080040280 irq 128
[    2.862532] ata1: SATA link down (SStatus 0 SControl 300)
[    2.862564] ata2: SATA link down (SStatus 0 SControl 300)
[    2.863697] ata4: SATA link down (SStatus 0 SControl 300)
[   12.544009] ata3: softreset failed (1st FIS failed)
[   22.544008] ata3: softreset failed (1st FIS failed)
[   57.544575] ata3: softreset failed (1st FIS failed)
[   62.544519] ata3: softreset failed (1st FIS failed)
[   62.544542] ata3: reset failed, giving up
[   62.598911] EXT4-fs (nvme0n1p1): mounted filesystem with ordered data mode. Quota mode: none.
[   62.671972] EXT4-fs (nvme0n1p1): mounted filesystem with ordered data mode. Quota mode: none.
[   65.356285] cfg80211: Loading compiled-in X.509 certificates for regulatory database
[60247.550077] PHB4 PHB#49 Diag-data (Version: 1)
[60247.572032] ata1: failed to stop engine (-19)
[60247.572050] ata2: failed to stop engine (-19)
[60247.572087] ata3: failed to stop engine (-19)
[60247.572103] ata4: failed to stop engine (-19)

So I'm wondering, is there a missing firmware file for this? Should I buy another SATA adapter?

Kernel: 6.0.13_1. Didn't work on older kernels either.

Applications and Porting / Interest in Porting Wasmtime & Cranelift
« on: October 05, 2022, 11:13:25 am »
I'm interested in getting Wasmtime (WebAssembly Wasi runtime library and program) and Cranelift (JIT used by Wasmtime, others) working on ppc64le. One thing that's interesting is that Power seems to be the only modern platform missing (RISC-V support was merged in recently, and they also support s390x, ARM, and of course, x86_64). As visible in the RISC-V PR, the amount of code required for such an undertaking is massive, and requires extensive knowledge of assembly. I'm interested in doing this work, and have been reading IBM's ISA and ELF ABI documentation, but could use a hand.

It would benefit Firefox, since Cranelift is used by Firefox for WebAssembly JIT compilation, I believe. It would help with some random apps, such as the text editor Lapce, which use Cranelift & Wasmtime for plugins.

Also relevant:,27.msg3125.html#msg3125

Talos II / Damaged motherboard?
« on: August 26, 2022, 04:17:52 pm »
I ordered my first Raptor product, the Talos II (+CPUs bundle), and it arrived today! However, after unpacking, I had some concerns.

In the attachments, there's an image of what looks like spilled solder on a memory slot, and a weird orange rust-looking dot on a chip which is not present in the high-resolution image from the Wiki. Yet, there was a note in the package showing that tests passed. How worried should I be? I'm afraid the solder spill could short something out, and I spent too much money (personal money for personal computer) on this so I'm going to nit-pick.

I did email support, waiting to hear back.

Talos II / Advice for installing OS via BMC?
« on: February 01, 2021, 05:18:54 pm »
I'm new to Raptor systems, and I have been granted remote access to a Talos II machine. I need to install the OS, but haven't been successful. So far, I tried the Ubuntu 20.10 PPC64LE and Debian PPC64LE full CD installer over the virtual media, and from the serial connection, I can see the kernel crashes.  I have been able to get to the menus in the serial connection for the Debian network installer, but I don't know the network settings. Is it ok to reuse the BMC IP? Is it a safe assumption that the BMC shares the first Ethernet connection port? Next I'm trying Void Linux to see if that's an improvement.

Pages: [1]