Constant freezes after suspend on Fedora 40 with AMD GPU (RX 6700 XT)

Fresh F40 installation with all updates freezes right after suspend in > 50% of the time. Besides these many suspend freezes, I get random gnome-shell crashes and randomly usb-c / thunderbolt issues (flaky usb-c). I wonder what’s the reason for so much instability? Recent kernel updates? I always try to send reports, but this never works because of not enough information to submit. What to do?

There have been reports of crashes after suspend because of gnome extensions. GNOME Shell crashes can be connected to the same issue.

Do you have any extensions enabled? If so, you could try disabling them and performing some tests. Afterwards they can be enabled one by one, in order to see which one causes the crash, or by looking into the logs.

I’m noticing the same since a day or so on my AMD laptop, it might be related to the last kernel update (6.10.10).

I’ve booted into 6.10.9 now to see if that improves things.

Hello,
I’m having issues too (Silverblue 40, Lenovo P14s gen4 with AMD Ryzen 7 PRO 7840U w/ Radeon™ 780M), when closing the lid it goes to suspend (but if I plug or unplug the charger it goes out then go back to suspend; I don’t know if it’s related but it wasn’t like that before crash/freezes) and about 25% of the time when I open the lid it kind of reboot without going through the BIOS, I have directly the LUKS password screen, and sometimes it doesn’t detect my WiFi card, I have to reboot.
I had also one complete freeze like 10s after waking up the laptop.
In the journal app, I see things like:

libinput error: event1 - Lid Switch: client bug: event processing lagging behind by 4542ms, your system is too slow - [tilingshell] [extension] extension is disabled
[tilingshell] [TilingLayout] win rect: 6,38,951,1156 Error in size change accounting.

and

[drm:amdgpu_mes_reg_write_reg_wait [amdgpu]] ERROR failed to reg_write_reg_wait
amdgpu 0000:64:00.0: amdgpu: MES failed to respond to msg=MISC (WAIT_REG_MEM)

Just a quick update that this issue is still occurring with the two latest kernels, 6.10.10 and 6.10.11, but not with 6.10.9. There are also reports of it occurring on kernel 6.11.

The issue appears related to PSR (Panel Self Refresh), a powersaving method of the GPU. There are two kernel patches available but they are not yet integrated into the mainline kernel.

A quick GPU recovery that may work as a temporary fix when it happens:

$ sudo cat /sys/kernel/debug/dri/1/amdgpu_gpu_recover

A kernel parameter that you may use to disable PSR completely:

amdgpu.dcdebugmask=0x210

(will potentially increase power usage slightly)

1 Like