I don’t know if anyone find any of these dmesg logs entries look unusual. Unless I remember wrong, the log [tor jul 18 20:07:26 2024] was also present when booting on 6.9.7 kernel, so it should be fine as well.
sudo dmesg -T | grep 0000:03:00.0
[tor jul 18 20:07:24 2024] pci 0000:03:00.0: [8086:56a1] type 00 class 0x030000 PCIe Endpoint
[tor jul 18 20:07:24 2024] pci 0000:03:00.0: BAR 0 [mem 0x82000000-0x82ffffff 64bit]
[tor jul 18 20:07:24 2024] pci 0000:03:00.0: BAR 2 [mem 0x4000000000-0x41ffffffff 64bit pref]
[tor jul 18 20:07:24 2024] pci 0000:03:00.0: ROM [mem 0x83000000-0x831fffff pref]
[tor jul 18 20:07:24 2024] pci 0000:03:00.0: ASPM: overriding L1 acceptable latency from 0x0 to 0x7
[tor jul 18 20:07:24 2024] pci 0000:03:00.0: PME# supported from D0 D3hot
[tor jul 18 20:07:24 2024] pci 0000:03:00.0: vgaarb: setting as boot VGA device
[tor jul 18 20:07:24 2024] pci 0000:03:00.0: vgaarb: bridge control possible
[tor jul 18 20:07:24 2024] pci 0000:03:00.0: vgaarb: VGA device added: decodes=io+mem,owns=none,locks=none
[tor jul 18 20:07:24 2024] pci 0000:03:00.0: Adding to iommu group 20
[tor jul 18 20:07:26 2024] i915 0000:03:00.0: [drm] VT-d active for gfx access
[tor jul 18 20:07:26 2024] i915 0000:03:00.0: vgaarb: deactivate vga console
[tor jul 18 20:07:26 2024] i915 0000:03:00.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=io+mem:owns=none
[tor jul 18 20:07:26 2024] i915 0000:03:00.0: [drm] Finished loading DMC firmware i915/dg2_dmc_ver2_08.bin (v2.8)
[tor jul 18 20:07:26 2024] i915 0000:03:00.0: [drm] GT0: GuC firmware i915/dg2_guc_70.bin version 70.20.0
[tor jul 18 20:07:26 2024] i915 0000:03:00.0: [drm] GT0: HuC firmware i915/dg2_huc_gsc.bin version 7.10.15
[tor jul 18 20:07:26 2024] i915 0000:03:00.0: [drm] GT0: GUC: submission enabled
[tor jul 18 20:07:26 2024] i915 0000:03:00.0: [drm] GT0: GUC: SLPC enabled
[tor jul 18 20:07:26 2024] i915 0000:03:00.0: [drm] GT0: GUC: RC enabled
[tor jul 18 20:07:26 2024] [drm] Initialized i915 1.6.0 20230929 for 0000:03:00.0 on minor 1
[tor jul 18 20:07:26 2024] rc rc0: DP-3 as /devices/pci0000:00/0000:00:01.0/0000:01:00.0/0000:02:01.0/0000:03:00.0/rc/rc0
[tor jul 18 20:07:26 2024] input: DP-3 as /devices/pci0000:00/0000:00:01.0/0000:01:00.0/0000:02:01.0/0000:03:00.0/rc/rc0/input3
[tor jul 18 20:07:26 2024] i915 0000:03:00.0: [drm] Reducing the compressed framebuffer size. This may lead to less power savings than a non-reduced-size. Try to increase stolen memory size if available in BIOS.
[tor jul 18 20:07:26 2024] i915 0000:03:00.0: [drm] fb0: i915drmfb frame buffer device
[tor jul 18 20:07:36 2024] i915 0000:03:00.0: [drm] GT0: HuC: timed out waiting for MEI GSC
[tor jul 18 20:09:53 2024] snd_hda_intel 0000:04:00.0: bound 0000:03:00.0 (ops i915_audio_component_bind_ops [i915])
[tor jul 18 20:09:53 2024] i915 0000:03:00.0: [drm] GT0: HuC: authenticated for all workloads
[tor jul 18 20:09:53 2024] mei_pxp i915.mei-gsc.768-fbf6fcf1-96cf-4e2e-a6a6-1bab8cbe36b1: bound 0000:03:00.0 (ops i915_pxp_tee_component_ops [i915])
Which leads me to believe the driver itself are probably good, unless it causes the race condition or something else does. Any advice on how I would go by to find out?
I reinstalled my workstation with the exact same setup. upgrading directly to 6.9.9 everything works as it should. I however upgraded my raspberry pi 4 from 6.9.7 to 6.9.8 and it also failed the at the same place as my workstation did. So something tells me that 6.9.8 aren’t 100% and whatever it is, it lingers around in future upgrades.