Arc GPU won't load on Fedora 38 kernel versions > 6.4.12

All kernel versions after 6.4.12 are refusing to boot. I took this error out of the logs:

Nov 05 17:56:01 fedora kernel: integrity: Problem loading X.509 certificate -126
Nov 05 17:56:01 fedora kernel: integrity: Problem loading X.509 certificate -126
Nov 05 17:56:04 fedora kernel: i915 0000:03:00.0: [drm] *ERROR* Device is non-operational; MMIO access returns 0xFFFFFFFF!
Nov 05 17:56:04 fedora kernel: i915 0000:03:00.0: Device initialization failed (-5)
Nov 05 17:56:06 fedora kernel: usb 3-3.2.1: 3:1: cannot get freq at ep 0x84
Nov 05 17:56:06 fedora kernel: iwlwifi 0000:00:14.3: WRT: Invalid buffer destination
Nov 05 17:56:08 fedora kernel: iwlwifi 0000:00:14.3: WRT: Invalid buffer destination
Nov 05 17:56:08 fedora kernel: Bluetooth: hci0: Malformed MSFT vendor event: 0x02
Nov 05 17:56:08 fedora kernel: iwlwifi 0000:00:14.3: WRT: Invalid buffer destination
Nov 05 17:56:10 fedora kernel: usb 3-3.2.1: 3:1: cannot get freq at ep 0x84
Nov 05 17:56:10 fedora kernel: usb 3-3.2.1: 3:1: cannot get freq at ep 0x84

The system crashes immediately when booting and that is the only -p3 output in journalctl. These lines here:

Nov 05 17:56:04 fedora kernel: i915 0000:03:00.0: [drm] *ERROR* Device is non-operational; MMIO access returns 0xFFFFFFFF!
Nov 05 17:56:04 fedora kernel: i915 0000:03:00.0: Device initialization failed (-5)

This is my Arc 770m GPU in an Intel NUC. It works on all versions up to 6.4.12, after that it won’t boot at all (even with 6.5.10). I have pinned the default to 6.4.12 so I can continue to boot and do my work.

I have tried everything I can think of, but something has definitely been introduced (or removed?) from 6.4.13+ that is crashing Arc GPU.

Does anyone have any idea of something I can try to get around this? I tried looking at what was introduced in 6.4.13 but nothing jumped out at the cause. I am still loading an i915 conf file with ‘force probe’ to ensure the Arc GPU card is loaded - is this still required?

If I boot into 6.4.12 then reboot and select the 6.5.10 kernel from the grub2 menu, it will boot into that just fine with a recognized GPU. But it won’t cold boot into newer kernels. This is just like my issue a year ago, only nothing I try is working this time like before.

The logs look the same as a successful 6.4.12 boot if I reboot into 6.5.10.

I am at a complete loss here on what to do, does anyone know?

Additional log output from a failed boot:


Nov 08 21:42:07 fedora kernel: Setting dangerous option force_probe - tainting kernel
Nov 08 21:42:07 fedora kernel: i915 0000:00:02.0: enabling device (0006 -> 0007)
Nov 08 21:42:07 fedora systemd[1]: Finished initrd-parse-etc.service - Mountpoints Configured in the Real Root.
Nov 08 21:42:07 fedora kernel: i915 0000:00:02.0: [drm] VT-d active for gfx access
Nov 08 21:42:07 fedora kernel: i915 0000:00:02.0: [drm] Using Transparent Hugepages
Nov 08 21:42:07 fedora systemd[1]: Reached target initrd-fs.target - Initrd File Systems.
Nov 08 21:42:07 fedora systemd[1]: Reached target initrd.target - Initrd Default Target.
Nov 08 21:42:07 fedora systemd[1]: dracut-mount.service - dracut mount hook was skipped because no trigger condition checks were met.
Nov 08 21:42:07 fedora systemd[1]: Starting dracut-pre-pivot.service - dracut pre-pivot and cleanup hook...
Nov 08 21:42:07 fedora kernel: i915 0000:00:02.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=mem
Nov 08 21:42:08 fedora kernel: i915 0000:00:02.0: [drm] Finished loading DMC firmware i915/adlp_dmc.bin (v2.20)
Nov 08 21:42:08 fedora kernel: i915 0000:00:02.0: [drm] GT0: GuC firmware i915/adlp_guc_70.bin version 70.13.1
Nov 08 21:42:08 fedora kernel: i915 0000:00:02.0: [drm] GT0: HuC firmware i915/tgl_huc.bin version 7.9.3
Nov 08 21:42:08 fedora kernel: i915 0000:00:02.0: [drm] GT0: HuC: authenticated for all workloads
Nov 08 21:42:08 fedora kernel: i915 0000:00:02.0: [drm] GT0: GUC: submission enabled
Nov 08 21:42:08 fedora kernel: i915 0000:00:02.0: [drm] GT0: GUC: SLPC enabled
Nov 08 21:42:08 fedora kernel: i915 0000:00:02.0: [drm] GT0: GUC: RC enabled
Nov 08 21:42:08 fedora kernel: i915 0000:00:02.0: [drm] Protected Xe Path (PXP) protected content support initialized
Nov 08 21:42:08 fedora kernel: [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.0 on minor 1
Nov 08 21:42:08 fedora kernel: ACPI: video: Video Device [PEGE] (multi-head: yes  rom: no  post: no)
Nov 08 21:42:08 fedora kernel: input: Video Bus as /devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A08:00/device:00/device:01/device:02/LNXVIDEO:00/input/input5
Nov 08 21:42:08 fedora kernel: ACPI: video: Video Device [GFX0] (multi-head: yes  rom: no  post: no)
Nov 08 21:42:08 fedora kernel: input: Video Bus as /devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A08:00/LNXVIDEO:01/input/input6
Nov 08 21:42:08 fedora kernel: i915 0000:00:02.0: [drm] Cannot find any crtc or sizes
Nov 08 21:42:08 fedora kernel: i915 0000:00:02.0: [drm] Cannot find any crtc or sizes
Nov 08 21:42:08 fedora kernel: i915 0000:00:02.0: [drm] Cannot find any crtc or sizes
Nov 08 21:42:08 fedora kernel: usb 3-3.2.4: New USB device found, idVendor=046d, idProduct=0ab7, bcdDevice= 0.20
Nov 08 21:42:08 fedora kernel: usb 3-3.2.4: New USB device strings: Mfr=3, Product=1, SerialNumber=2
Nov 08 21:42:08 fedora kernel: usb 3-3.2.4: Product: Blue Microphones
Nov 08 21:42:08 fedora kernel: usb 3-3.2.4: Manufacturer: Generic
Nov 08 21:42:08 fedora kernel: usb 3-3.2.4: SerialNumber: 2107BAB010J8
Nov 08 21:42:08 fedora kernel: input: Generic Blue Microphones Consumer Control as /devices/pci0000:00/0000:00:14.0/usb3/3-3/3-3.2/3-3.2.4/3-3.2.4:1.3/0003:046D:0AB7.0003/input/input7
Nov 08 21:42:08 fedora kernel: input: Generic Blue Microphones as /devices/pci0000:00/0000:00:14.0/usb3/3-3/3-3.2/3-3.2.4/3-3.2.4:1.3/0003:046D:0AB7.0003/input/input8
Nov 08 21:42:08 fedora kernel: hid-generic 0003:046D:0AB7.0003: input,hiddev97,hidraw2: USB HID v1.11 Device [Generic Blue Microphones] on usb-0000:00:14.0-3.2.4/input3
Nov 08 21:42:10 fedora kernel: i915 0000:03:00.0: [drm] *ERROR* Device is non-operational; MMIO access returns 0xFFFFFFFF!
Nov 08 21:42:10 fedora kernel: i915 0000:03:00.0: Device initialization failed (-5)
Nov 08 21:42:10 fedora kernel: i915: probe of 0000:03:00.0 failed with error -5

You could raise a kernel bug in the fedora bug tracker.

Also you could ask on Intel-gfx Info Page mailing list.

Will do.

Found another thread of someone saying the same exact thing:

https://www.reddit.com/r/Fedora/comments/17amzh9/kernel_versions_65x_wont_work_on_intel_nuc_12/

The reason I suggested the intel gfx list is that is where the developers that write the driver for arc hang out.

1 Like