AMDGPU random error "retry page fault"

Problem

Sometimes my pc freezes or goes blank and back several times but isn’t responding to any input (mouse nor keyboard). Music is still playing in background. Linux Kernel 6.3.8-200.fc38.x86_64
I don’t know anything to do with the kernel errors, but probably you can? I would be glad to get help or to get to know where I have to ask.

Cause

Not yet known.

Journald Output:

12.06.2023

Jun 12 16:27:47 fedora kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=59421, emitted seq=59423
Jun 12 16:27:47 fedora kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process  pid 0 thread  pid 0
Jun 12 16:27:47 fedora kernel: amdgpu 0000:05:00.0: amdgpu: GPU reset begin!
Jun 12 16:27:47 fedora kernel: ------------[ cut here ]------------
Jun 12 16:27:47 fedora kernel: WARNING: CPU: 3 PID: 20481 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:599 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel: Modules linked in: uinput rfcomm snd_seq_dummy snd_hrtimer nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables nf>
Jun 12 16:27:47 fedora kernel:  snd_hda_core think_lmi rapl snd_acp_config firmware_attributes_class wmi_bmof snd_hwdep pcspkr mc snd_seq cm109 cfg80211 snd_soc_acpi k10temp snd_seq_device thinkpad_acpi i2c_piix4 snd_pci_acp3x snd_pcm ledtrig_audio platform_profile snd_timer rfkill snd vfat soundcore fat i2c_scmi acpi_cpufreq joy>
Jun 12 16:27:47 fedora kernel: CPU: 3 PID: 20481 Comm: kworker/u32:5 Not tainted 6.3.6-200.fc38.x86_64 #1
Jun 12 16:27:47 fedora kernel: Hardware name: LENOVO 20NECTO1WW/20NECTO1WW, BIOS R11ET32W (1.12 ) 12/23/2019
Jun 12 16:27:47 fedora kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jun 12 16:27:47 fedora kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 3f eb 6f e6 e9 5a fd ff ff <0f> 0b b8 ea ff ff ff e9 2e eb 6f e6 b8 ea ff ff ff e9 24 eb 6f e6
Jun 12 16:27:47 fedora kernel: RSP: 0018:ffff95310c743c90 EFLAGS: 00010246
Jun 12 16:27:47 fedora kernel: RAX: ffff8962c1af74b0 RBX: ffff8962d0060000 RCX: 0000000000000000
Jun 12 16:27:47 fedora kernel: RDX: 0000000000000000 RSI: ffff8962d006bef0 RDI: ffff8962d0060000
Jun 12 16:27:47 fedora kernel: RBP: ffff8962d0060000 R08: 000000000003ae80 R09: 0000000000000006
Jun 12 16:27:47 fedora kernel: R10: ffffedd78e4b8008 R11: 0000000000000000 R12: 0000000000001050
Jun 12 16:27:47 fedora kernel: R13: ffff8962d00789a8 R14: ffff8963c670dc00 R15: 0000000000000000
Jun 12 16:27:47 fedora kernel: FS:  0000000000000000(0000) GS:ffff896570ac0000(0000) knlGS:0000000000000000
Jun 12 16:27:47 fedora kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 12 16:27:47 fedora kernel: CR2: 00007f02f4053280 CR3: 000000012592a000 CR4: 00000000003506e0
Jun 12 16:27:47 fedora kernel: Call Trace:
Jun 12 16:27:47 fedora kernel:  <TASK>
Jun 12 16:27:47 fedora kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel:  ? __warn+0x81/0x130
Jun 12 16:27:47 fedora kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel:  ? report_bug+0x171/0x1a0
Jun 12 16:27:47 fedora kernel:  ? handle_bug+0x3c/0x80
Jun 12 16:27:47 fedora kernel:  ? exc_invalid_op+0x17/0x70
Jun 12 16:27:47 fedora kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jun 12 16:27:47 fedora kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel:  gfx_v9_0_hw_fini+0x35/0x700 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_device_ip_suspend_phase2+0x101/0x1a0 [amdgpu]
Jun 12 16:27:47 fedora kernel:  ? amdgpu_device_ip_suspend_phase1+0x6f/0xe0 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_device_ip_suspend+0x36/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_device_pre_asic_reset+0xd3/0x2b0 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_device_gpu_recover+0x4c7/0xd60 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_job_timedout+0x18d/0x240 [amdgpu]
Jun 12 16:27:47 fedora kernel:  drm_sched_job_timedout+0x7a/0x110 [gpu_sched]
Jun 12 16:27:47 fedora kernel:  process_one_work+0x1c7/0x3d0
Jun 12 16:27:47 fedora kernel:  worker_thread+0x51/0x390
Jun 12 16:27:47 fedora kernel:  ? __pfx_worker_thread+0x10/0x10
Jun 12 16:27:47 fedora kernel:  kthread+0xde/0x110
Jun 12 16:27:47 fedora kernel:  ? __pfx_kthread+0x10/0x10
Jun 12 16:27:47 fedora kernel:  ret_from_fork+0x2c/0x50
Jun 12 16:27:47 fedora kernel:  </TASK>
Jun 12 16:27:47 fedora kernel: ---[ end trace 0000000000000000 ]---
Jun 12 16:27:47 fedora kernel: ------------[ cut here ]------------
un 12 16:27:47 fedora kernel: WARNING: CPU: 3 PID: 20481 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:599 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel: Modules linked in: uinput rfcomm snd_seq_dummy snd_hrtimer nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables nf>
Jun 12 16:27:47 fedora kernel:  snd_hda_core think_lmi rapl snd_acp_config firmware_attributes_class wmi_bmof snd_hwdep pcspkr mc snd_seq cm109 cfg80211 snd_soc_acpi k10temp snd_seq_device thinkpad_acpi i2c_piix4 snd_pci_acp3x snd_pcm ledtrig_audio platform_profile snd_timer rfkill snd vfat soundcore fat i2c_scmi acpi_cpufreq joy>
Jun 12 16:27:47 fedora kernel: CPU: 3 PID: 20481 Comm: kworker/u32:5 Tainted: G        W          6.3.6-200.fc38.x86_64 #1
Jun 12 16:27:47 fedora kernel: Hardware name: LENOVO 20NECTO1WW/20NECTO1WW, BIOS R11ET32W (1.12 ) 12/23/2019
Jun 12 16:27:47 fedora kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jun 12 16:27:47 fedora kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 3f eb 6f e6 e9 5a fd ff ff <0f> 0b b8 ea ff ff ff e9 2e eb 6f e6 b8 ea ff ff ff e9 24 eb 6f e6
Jun 12 16:27:47 fedora kernel: RSP: 0018:ffff95310c743c90 EFLAGS: 00010246
Jun 12 16:27:47 fedora kernel: RAX: ffff8962c1af75e0 RBX: ffff8962d0060000 RCX: 0000000000000000
Jun 12 16:27:47 fedora kernel: RDX: 0000000000000000 RSI: ffff8962d006bf08 RDI: ffff8962d0060000
Jun 12 16:27:47 fedora kernel: RBP: ffff8962d0060000 R08: 000000000003ae80 R09: 0000000000000006
Jun 12 16:27:47 fedora kernel: R10: ffffedd78e4b8008 R11: 0000000000000000 R12: 0000000000001050
Jun 12 16:27:47 fedora kernel: R13: ffff8962d00789a8 R14: ffff8963c670dc00 R15: 0000000000000000
Jun 12 16:27:47 fedora kernel: FS:  0000000000000000(0000) GS:ffff896570ac0000(0000) knlGS:0000000000000000
Jun 12 16:27:47 fedora kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 12 16:27:47 fedora kernel: CR2: 00007f02f4053280 CR3: 000000012592a000 CR4: 00000000003506e0
Jun 12 16:27:47 fedora kernel: Call Trace:
Jun 12 16:27:47 fedora kernel:  <TASK>
Jun 12 16:27:47 fedora kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel:  ? __warn+0x81/0x130
Jun 12 16:27:47 fedora kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel:  ? report_bug+0x171/0x1a0
Jun 12 16:27:47 fedora kernel:  ? handle_bug+0x3c/0x80
Jun 12 16:27:47 fedora kernel:  ? exc_invalid_op+0x17/0x70
Jun 12 16:27:47 fedora kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jun 12 16:27:47 fedora kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel:  gfx_v9_0_hw_fini+0x46/0x700 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_device_ip_suspend_phase2+0x101/0x1a0 [amdgpu]
Jun 12 16:27:47 fedora kernel:  ? amdgpu_device_ip_suspend_phase1+0x6f/0xe0 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_device_ip_suspend+0x36/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_device_pre_asic_reset+0xd3/0x2b0 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_device_gpu_recover+0x4c7/0xd60 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_job_timedout+0x18d/0x240 [amdgpu]
Jun 12 16:27:47 fedora kernel:  drm_sched_job_timedout+0x7a/0x110 [gpu_sched]
Jun 12 16:27:47 fedora kernel:  process_one_work+0x1c7/0x3d0
Jun 12 16:27:47 fedora kernel:  worker_thread+0x51/0x390
Jun 12 16:27:47 fedora kernel:  ? __pfx_worker_thread+0x10/0x10
Jun 12 16:27:47 fedora kernel:  kthread+0xde/0x110
Jun 12 16:27:47 fedora kernel:  ? __pfx_kthread+0x10/0x10
Jun 12 16:27:47 fedora kernel:  ret_from_fork+0x2c/0x50
Jun 12 16:27:47 fedora kernel:  </TASK>
Jun 12 16:27:47 fedora kernel: ---[ end trace 0000000000000000 ]---
Jun 12 16:27:47 fedora kernel: ------------[ cut here ]------------
Jun 12 16:27:47 fedora kernel: WARNING: CPU: 3 PID: 20481 at drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c:599 amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel: Modules linked in: uinput rfcomm snd_seq_dummy snd_hrtimer nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables nf>
Jun 12 16:27:47 fedora kernel:  snd_hda_core think_lmi rapl snd_acp_config firmware_attributes_class wmi_bmof snd_hwdep pcspkr mc snd_seq cm109 cfg80211 snd_soc_acpi k10temp snd_seq_device thinkpad_acpi i2c_piix4 snd_pci_acp3x snd_pcm ledtrig_audio platform_profile snd_timer rfkill snd vfat soundcore fat i2c_scmi acpi_cpufreq joy>
Jun 12 16:27:47 fedora kernel: CPU: 3 PID: 20481 Comm: kworker/u32:5 Tainted: G        W          6.3.6-200.fc38.x86_64 #1
Jun 12 16:27:47 fedora kernel: Hardware name: LENOVO 20NECTO1WW/20NECTO1WW, BIOS R11ET32W (1.12 ) 12/23/2019
Jun 12 16:27:47 fedora kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jun 12 16:27:47 fedora kernel: RIP: 0010:amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel: Code: c0 74 33 48 8b 4e 10 48 83 39 00 74 29 89 d1 48 8d 04 88 8b 08 85 c9 74 11 f0 ff 08 74 07 31 c0 e9 3f eb 6f e6 e9 5a fd ff ff <0f> 0b b8 ea ff ff ff e9 2e eb 6f e6 b8 ea ff ff ff e9 24 eb 6f e6
Jun 12 16:27:47 fedora kernel: RSP: 0018:ffff95310c743ca8 EFLAGS: 00010246
Jun 12 16:27:47 fedora kernel: RAX: ffff8962cdf77e68 RBX: ffff8962d0060000 RCX: 0000000000000000
Jun 12 16:27:47 fedora kernel: RDX: 0000000000000000 RSI: ffff8962d0060c48 RDI: ffff8962d0060000
Jun 12 16:27:47 fedora kernel: RBP: ffff8962d0060000 R08: 0000000000000000 R09: 0000000000000000
Jun 12 16:27:47 fedora kernel: R10: 0000000000000001 R11: 0000000000000100 R12: 0000000000001050
Jun 12 16:27:47 fedora kernel: R13: ffff8962d00789a8 R14: ffff8963c670dc00 R15: 0000000000000000
Jun 12 16:27:47 fedora kernel: FS:  0000000000000000(0000) GS:ffff896570ac0000(0000) knlGS:0000000000000000
Jun 12 16:27:47 fedora kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 12 16:27:47 fedora kernel: CR2: 00007f02f4053280 CR3: 000000012592a000 CR4: 00000000003506e0
Jun 12 16:27:47 fedora kernel: Call Trace:
Jun 12 16:27:47 fedora kernel:  <TASK>
Jun 12 16:27:47 fedora kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel:  ? __warn+0x81/0x130
Jun 12 16:27:47 fedora kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel:  ? report_bug+0x171/0x1a0
Jun 12 16:27:47 fedora kernel:  ? handle_bug+0x3c/0x80
Jun 12 16:27:47 fedora kernel:  ? exc_invalid_op+0x17/0x70
Jun 12 16:27:47 fedora kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jun 12 16:27:47 fedora kernel:  ? amdgpu_irq_put+0x46/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel:  ? __pfx_mmhub_v1_0_update_power_gating+0x10/0x10 [amdgpu]
Jun 12 16:27:47 fedora kernel:  gmc_v9_0_hw_fini+0x6d/0x90 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_device_ip_suspend_phase2+0x101/0x1a0 [amdgpu]
Jun 12 16:27:47 fedora kernel:  ? amdgpu_device_ip_suspend_phase1+0x6f/0xe0 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_device_ip_suspend+0x36/0x70 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_device_pre_asic_reset+0xd3/0x2b0 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_device_gpu_recover+0x4c7/0xd60 [amdgpu]
Jun 12 16:27:47 fedora kernel:  amdgpu_job_timedout+0x18d/0x240 [amdgpu]
Jun 12 16:27:47 fedora kernel:  drm_sched_job_timedout+0x7a/0x110 [gpu_sched]
Jun 12 16:27:47 fedora kernel:  process_one_work+0x1c7/0x3d0
Jun 12 16:27:47 fedora kernel:  worker_thread+0x51/0x390
Jun 12 16:27:47 fedora kernel:  ? __pfx_worker_thread+0x10/0x10
Jun 12 16:27:47 fedora kernel:  kthread+0xde/0x110
Jun 12 16:27:47 fedora kernel:  ? __pfx_kthread+0x10/0x10
Jun 12 16:27:47 fedora kernel:  ret_from_fork+0x2c/0x50
Jun 12 16:27:47 fedora kernel:  </TASK>
Jun 12 16:27:47 fedora kernel: ---[ end trace 0000000000000000 ]---
Jun 12 16:27:47 fedora kernel: amdgpu 0000:05:00.0: amdgpu: MODE2 reset
Jun 12 16:27:47 fedora kernel: amdgpu 0000:05:00.0: amdgpu: GPU reset succeeded, trying to resume
Jun 12 16:27:47 fedora kernel: [drm] PCIE GART of 1024M enabled.
Jun 12 16:27:47 fedora kernel: [drm] PTB located at 0x000000F400A00000
Jun 12 16:27:47 fedora kernel: [drm] PSP is resuming...
Jun 12 16:27:47 fedora kernel: [drm] reserve 0x400000 from 0xf401c00000 for PSP TMR
Jun 12 16:27:47 fedora kernel: amdgpu 0000:05:00.0: amdgpu: RAS: optional ras ta ucode is not available
Jun 12 16:27:47 fedora kernel: amdgpu 0000:05:00.0: amdgpu: RAP: optional rap ta ucode is not available
Jun 12 16:27:48 fedora kernel: [drm] kiq ring mec 2 pipe 1 q 0
Jun 12 16:27:48 fedora kernel: [drm] VCN decode and encode initialized successfully(under SPG Mode).
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring gfx_low uses VM inv eng 1 on hub 0
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring gfx_high uses VM inv eng 4 on hub 0
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 5 on hub 0
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 6 on hub 0
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 7 on hub 0
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 8 on hub 0
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 9 on hub 0
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 10 on hub 0
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 11 on hub 0
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 12 on hub 0
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 13 on hub 0
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 1
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 1
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 1
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 1
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 1
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: recover vram bo from shadow start
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: recover vram bo from shadow done
Jun 12 16:27:48 fedora kernel: amdgpu 0000:05:00.0: amdgpu: GPU reset(3) succeeded!
Jun 12 16:27:48 fedora gnome-shell[2024]: amdgpu: The CS has been rejected (-125), but the context isn't robust.
Jun 12 16:27:48 fedora kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Jun 12 16:27:48 fedora gnome-shell[2024]: amdgpu: The process will be terminated.

24.06.2023

Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process gnome-s>
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x0000800106820000 from IH client 0x1b (UTCL2)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00141051
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: TCP (0x8)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x5
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process gnome-s>
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x0000800106821000 from IH client 0x1b (UTCL2)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00141051
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: TCP (0x8)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x5
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process gnome-s>
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x0000800106823000 from IH client 0x1b (UTCL2)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00141051
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: TCP (0x8)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x5
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process gnome-s>
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x0000800106827000 from IH client 0x1b (UTCL2)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00141051
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: TCP (0x8)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x5
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process gnome-s>
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x0000800106822000 from IH client 0x1b (UTCL2)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00141051
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: TCP (0x8)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x5
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process gnome-s>
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x0000800106825000 from IH client 0x1b (UTCL2)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00141051
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: TCP (0x8)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x5
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process gnome-s>
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x0000800106824000 from IH client 0x1b (UTCL2)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00141051
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: TCP (0x8)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x5
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process gnome-s>
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x0000800106826000 from IH client 0x1b (UTCL2)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00141051
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: TCP (0x8)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x5
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process gnome-s>
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x0000800106828000 from IH client 0x1b (UTCL2)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00141051
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: TCP (0x8)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x5
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_low timeout, but soft recovered
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: [gfxhub0] retry page fault (src_id:0 ring:0 vmid:1 pasid:32769, for process gnome-s>
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:   in page starting at address 0x0000800106829000 from IH client 0x1b (UTCL2)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00141051
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          Faulty UTCL2 client ID: TCP (0x8)
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MORE_FAULTS: 0x1
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          WALKER_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          PERMISSION_FAULTS: 0x5
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          MAPPING_ERROR: 0x0
Jun 24 15:22:33 fedora kernel: amdgpu 0000:05:00.0: amdgpu:          RW: 0x1

I’m no expert. But …

It doesn’t look like you are the only one seeing this error.

ABRT Analytics

From the above link, it looks like the error started showing up around 2023-05-31 with 6.3.5+ kernels on Fedora Linux releases 37 and 38. Presumably, you could work around the problem by selecting an older kernel.

I see mentions of “suspend” and “resume” in the AMD GPU driver in the bug trace you posted. A little googling appears to show that there indeed has been recent work on the power saving functionality in the AMD GPU driver.

If you can temporarily disable power saving somehow, that might provide another workaround.

Thank you for the information.

Is there anything I can do? Default to e.g. the 6.1 lts kernel?
I’m quite new to fedora, so I’m not familiar with the system.

On parallel, I will check, if I find some place to report this error(s) to the kernel community.

There are some instructions on how to “pin” an earlier kernel version so that it will be automatically selected during boot here:

Running the command rpm -q kernel should work to list what kernels you currently have installed on your system. Hopefully one of them is older than 6.3.5.