Hi,
i’m having a bit of a strange issue with my system lately. Since ~ 2 weeks my system freezes every now and then.
Checking the journalctl i found the following issues:
BlockquoteJan 30 12:54:27 #hostname## kernel: gmc_v11_0_process_interrupt: 68 callbacks suppressed
Jan 30 12:54:27 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:153 vmid:0 pasid:0)
Jan 30 12:54:27 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: in page starting at address 0x0000000000000000 from client 10
Jan 30 12:54:27 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00000B32
Jan 30 12:54:27 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: Faulty UTCL2 client ID: CPC (0x5)
Jan 30 12:54:27 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: MORE_FAULTS: 0x0
Jan 30 12:54:27 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: WALKER_ERROR: 0x1
Jan 30 12:54:27 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: PERMISSION_FAULTS: 0x3
Jan 30 12:54:27 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: MAPPING_ERROR: 0x1
Jan 30 12:54:27 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: RW: 0x0
Jan 30 12:54:29 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Jan 30 12:54:29 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: failed to reg_write_reg_wait
Jan 30 12:54:32 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Jan 30 12:54:32 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: failed to reg_write_reg_wait
Jan 30 12:54:35 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Jan 30 12:54:35 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: failed to reg_write_reg_wait
Jan 30 12:54:38 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Jan 30 12:54:38 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: failed to reg_write_reg_wait
Jan 30 12:54:40 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Jan 30 12:54:40 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: failed to reg_write_reg_wait
Jan 30 12:54:43 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Jan 30 12:54:43 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: failed to reg_write_reg_wait
Jan 30 12:54:46 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Jan 30 12:54:46 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: failed to reg_write_reg_wait
Jan 30 12:54:49 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Jan 30 12:54:49 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: failed to reg_write_reg_wait
Jan 30 12:54:52 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Jan 30 12:54:52 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: failed to reg_write_reg_wait
Jan 30 12:54:54 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Jan 30 12:54:54 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: failed to reg_write_reg_wait
Jan 30 12:54:57 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)
Jan 30 12:54:57 #hostname## kernel: amdgpu 0000:c4:00.0: amdgpu: failed to reg_write_reg_wait
The systems locks down completely and i can’do anything outside of reboot it hard.
I’ve found the following post describing a similar problem but the workaround listed didn’t help:
My Laptop is a Lenovo Thinkpad P14s with AMD AI Pro 370 and Radeon 890M Graphiccard.
I’m running the latest Fedora 42 release with Kernel Linux 6.18.7-100.fc42.x86_64.
Upgrade to Fedora 43 is currently not an option therefor can’t try if this would fix things. Any help/insights are greatly appreciated! Since I’m fairly new to Linux I’m not sure what other information are needed for the troubleshooting/understanding of the issue. Please let me know if you need anything else or if you have any insights into this.
Thanks!