Sporadic kernel panics

Hello,

I’ve HP EliteBook 840 14 inch G11 Notebook with 32GB of RAM and Intel Core Ultra 7 155H CPU. The GPU is embedded in this CPU. The BIOS is latest: BIOS W70 Ver. 01.07.02 09/24/2025. I use Fedora 42 with Cinnamon DE and with all latest updates. This is the only OS on this machine.

Sporadically this system hangs forever. Today, with recently installed kernel 6.17.4 (previously used 6.16.x) this happened again but this time, after a few more seconds, two screens became black with a message at the center of the compound two screens (internal and external) telling that there is a kernel panic and I have to reboot. Looks like an improved panic message of the same panic I had with 6.16.x too.

Is it a known issue? Is there any known workaround?

On the following URL you can find the output of the sudo journalctl -b -1 -k command that I ran right after the next boot after the panic:

The main cause of this crash seems to be i915 (your onboard graphics chip).

You say it’s sporadic, but can you replicate it with this kernel reliably - it’s triggered by suspend/resume and is likely to be some interaction with the kernel and firmware, so forcing a suspend/resume would be a good way to test its repeatability.

  • Replicate it.
  • Update the firmware with fwupdmgr refresh --force && fwupdmgr update
  • Replicate it again.
  • Add the kernel parameter i915.enable_guc=0 to this kernel, boot it and replicate the issue again. This will force the i915 chip to not use its driver and encourage the CPU to schedule tasks on it. Obviously we’d prefer to not have to do this permanently, so it’s a one-time attempt to see if you can still replicate the issue.

Post back the results of that testing - it’ll help show if the issue is firmware, the kernel, an interaction between the two and so on.

1 Like

Unfortunately I don’t know how to replicate it, i.e. this panic is completely sporadic. It could happen once in a few weeks or a few times in a day.

All the firmware is the latest. I used both the fwupdmgr and the internal ability of the BIOS to update itself from the hp.com site. The fwupdmgr brought the previous 01.07.01 version of the BIOS and such panics happened with that version and with the previous version too. Later the internal BIOS updater brought the latest 01.07.02 version, so now there is no any newer BIOS/firmware that I can install or update.

Can the kernel panic mechanism itself or logging of panics be configure to be more verbosely and informative? So next time such a panic happens I would have more information to share.

You can - you can add the kernel parameter ignore_loglevel to the curently booted kernel - it’ll log everything all of the time though, so unless you can find a way to trigger this issue fairly reliably, you’re going to have lots of output in the journal.

That’s why I wanted to you try to replicate the issue - if you can’t replicate it, you never know if it’s fixed.

See this for more details on logging.

I just want to find more information that will help the kernel maintainers to fix this bug and probably to find a way of replicating it programmatically.

If I add that ignore_loglevel kernel parameter, could it slow my computer down or could it allocate too much disk space? For example during 2 - 3 months.

Will I have to add that ignore_loglevel kernel parameter again and again after each kernel update?

Yes, it will have an impact on the overall performance of the kernel, as it’ll be logging more data, more frequently and subsequently consuming more disk space. It’ll be using more CPU and more battery as it’s doing more work.

As for how much disk space it’ll use - no idea - depends on how much it logs which will vary from machine to machine.

If you add it to the kernel at boot time, by hitting e on the grub screen and placing it on the end of the existing kernel parameters, it’ll only be in effect for that boot. Subsequent boots will revert to normal.

If you want it logging more all the time, you can add it to the list of standard parameters which are applied to all kernels using grubby. I suspect you do not want to do this.

In other words setting verbose logging for a long time is not practical. Since I can’t reproduce this bug reliable maybe there is something else that I can do? Maybe enabling crash dumps of the kernel will be useful? Will it help to find a reliable way of reproducing this issue later programmatically? How to do that? Should I also install the kernel-debug package?

I suppose it only has to be triggered once with verbose logging on (and to be honest, I’m not even sure if you’ll get any additional information that’s useful - I’ve never needed to apply full-blast logging myself so I don’t know how much extra it spits out.

You oculd always try it and perform a few suspend/resume cycles which often is the trigger for i915 causes - there’s evidently something in the hardware or the standard driver which doesn’t deal perfectly with suspend and resume, I suspect.

I’ll have a crack at it now with my own system - rebooting with an ignore_loglevel parameter.

Doesn’t seem particularly different to me - I’d say give it a shot; I’m not sure if you’ll actively notice any difference. It’s not a fire-hose of output spewing out, until something does crash when I assume you’ll get a lot more debug output.

After further observations I can say that this is not always a kernel panic but in most cases just a complete hang of the system. Today I experienced the same hang but fortunately it freed after 20 - 30 seconds. Then I found the following message printed in a terminal:

libva info: VA-API version 1.22.0
libva info: Trying to open /usr/lib64/dri-nonfree/iHD_drv_video.so
libva info: Found init function __vaDriverInit_1_22
libva info: va_openDriver() returns 0
[GFX1-]: RenderCompositorSWGL failed mapping default framebuffer, no dt

Google say the last line about RenderCompositorSWGL relates to Firefox. Could all this trigger a complete system hang in other circumstances? I mean a chain of failures like: Firfox –> libva –> i915 –> hang

Hope this information could help to fix this bug properly.

Yet another system hang just happened. Following logs are taken from the output of journalctl -b -1

Nov 09 16:11:01 fedora kernel: i915 0000:00:02.0: [drm] *ERROR* [CRTC:202:pipe C] PLANE ATS fault
Nov 09 16:11:01 fedora kernel: i915 0000:00:02.0: [drm] *ERROR* [CRTC:202:pipe C][PLANE:147:plane 1C] fault (CTL=0x84000000, SURF=0x5474000, SURFLIVE=0x5474000)
Nov 09 16:11:01 fedora kernel: i915 0000:00:02.0: [drm] *ERROR* [CRTC:202:pipe C] PLANE ATS fault
Nov 09 16:11:01 fedora kernel: i915 0000:00:02.0: [drm] *ERROR* [CRTC:202:pipe C][PLANE:147:plane 1C] fault (CTL=0x84000000, SURF=0x5474000, SURFLIVE=0x5474000)
Nov 09 16:11:01 fedora kernel: i915 0000:00:02.0: [drm] *ERROR* CPU pipe B FIFO underrun
Nov 09 16:11:01 fedora kernel: BUG: unable to handle page fault for address: fffffeffc0d397c0
Nov 09 16:11:01 fedora kernel: #PF: supervisor read access in kernel mode
Nov 09 16:11:01 fedora kernel: #PF: error_code(0x0000) - not-present page
Nov 09 16:11:01 fedora kernel: PGD 0 P4D 0 
Nov 09 16:11:01 fedora kernel: Oops: Oops: 0000 [#1] SMP NOPTI
Nov 09 16:11:01 fedora kernel: CPU: 3 UID: 1000 PID: 2324 Comm: cinnamon Not tainted 6.17.7-200.fc42.x86_64 #1 PREEMPT(lazy) 
Nov 09 16:11:01 fedora kernel: Hardware name: HP HP EliteBook 840 14 inch G11 Notebook PC/8C26, BIOS W70 Ver. 01.07.02 09/24/2025
Nov 09 16:11:01 fedora kernel: RIP: 0010:find_fw_domain+0x39/0xf0 [i915]
Nov 09 16:11:01 fedora kernel: Code: 03 00 77 04 44 03 47 24 41 8b 72 38 31 d2 39 f2 73 4c 89 f0 29 d0 d1 e8 8d 0c 10 48 8d 3c 49 48 89 c8 49 8b 4a 30 4c 8d 0c b9 <45> 3b 01 72 36 45 39 41 04 72 22 4d 85 c9 74 24 41 8b 41 08 41 8b
Nov 09 16:11:01 fedora kernel: RSP: 0018:ffffd1e0ceac35a0 EFLAGS: 00010002
Nov 09 16:11:01 fedora kernel: RAX: 0000000000000010 RBX: 0000000000000000 RCX: fffffeffc0d39700
Nov 09 16:11:01 fedora kernel: RDX: 0000000000000000 RSI: 0000000000000020 RDI: 0000000000000030
Nov 09 16:11:01 fedora kernel: RBP: ffff88f5e0fac2a0 R08: 000000000000a500 R09: fffffeffc0d397c0
Nov 09 16:11:01 fedora kernel: R10: ffff88f5e0f92748 R11: ffff88f5cf54bc70 R12: ffff88f5e0fac028
Nov 09 16:11:01 fedora kernel: R13: 0000000000000001 R14: ffff88f5e0f92748 R15: ffff88f5cf54bc00
Nov 09 16:11:01 fedora kernel: FS:  00007f8a7a4de680(0000) GS:ffff88fd8ebc4000(0000) knlGS:0000000000000000
Nov 09 16:11:01 fedora kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Nov 09 16:11:01 fedora kernel: CR2: fffffeffc0d397c0 CR3: 000000015b0ba003 CR4: 0000000000f72ef0
Nov 09 16:11:01 fedora kernel: PKRU: 55555554
Nov 09 16:11:01 fedora kernel: Call Trace:
Nov 09 16:11:01 fedora kernel:  <TASK>
Nov 09 16:11:01 fedora kernel:  intel_uncore_forcewake_for_reg+0x36/0x130 [i915]
Nov 09 16:11:01 fedora kernel:  guc_update_pm_timestamp+0x41/0x140 [i915]
Nov 09 16:11:01 fedora kernel:  intel_guc_busyness_unpark+0x6d/0xc0 [i915]
Nov 09 16:11:01 fedora kernel:  __gt_unpark+0x5c/0x80 [i915]
Nov 09 16:11:01 fedora kernel:  __intel_wakeref_get_first+0x44/0xd0 [i915]
Nov 09 16:11:01 fedora kernel:  eb_select_engine+0x4d2/0x600 [i915]
Nov 09 16:11:01 fedora kernel:  i915_gem_do_execbuffer+0x46f/0x1140 [i915]
Nov 09 16:11:01 fedora kernel:  i915_gem_execbuffer2_ioctl+0x125/0x250 [i915]
Nov 09 16:11:01 fedora kernel:  ? __pfx_i915_gem_execbuffer2_ioctl+0x10/0x10 [i915]
Nov 09 16:11:01 fedora kernel:  drm_ioctl_kernel+0xab/0x100
Nov 09 16:11:01 fedora kernel:  drm_ioctl+0x2a8/0x550
Nov 09 16:11:01 fedora kernel:  ? __pfx_i915_gem_execbuffer2_ioctl+0x10/0x10 [i915]
Nov 09 16:11:01 fedora kernel:  __x64_sys_ioctl+0x94/0xe0
Nov 09 16:11:01 fedora kernel:  do_syscall_64+0x7e/0x250
Nov 09 16:11:01 fedora kernel:  ? __sys_recvmsg+0xca/0xe0
Nov 09 16:11:01 fedora kernel:  ? do_syscall_64+0xb6/0x250
Nov 09 16:11:01 fedora kernel:  ? __smp_call_single_queue+0xdb/0x190
Nov 09 16:11:01 fedora kernel:  ? __sys_recvmsg+0xca/0xe0
Nov 09 16:11:01 fedora kernel:  ? __task_pid_nr_ns+0x67/0x110
Nov 09 16:11:01 fedora kernel:  ? do_syscall_64+0xb6/0x250
Nov 09 16:11:01 fedora kernel:  ? do_syscall_64+0xb6/0x250
Nov 09 16:11:01 fedora kernel:  ? sched_clock+0x10/0x30
Nov 09 16:11:01 fedora kernel:  ? sched_clock_cpu+0xb/0x30
Nov 09 16:11:01 fedora kernel:  ? irqtime_account_irq+0x3c/0xc0
Nov 09 16:11:01 fedora kernel:  ? handle_softirqs+0x1ae/0x340
Nov 09 16:11:01 fedora kernel:  ? irqentry_exit_to_user_mode+0x2c/0x1c0
Nov 09 16:11:01 fedora kernel:  entry_SYSCALL_64_after_hwframe+0x76/0x7e
Nov 09 16:11:01 fedora kernel: RIP: 0033:0x7f8a7f2ce0ed
Nov 09 16:11:01 fedora kernel: Code: 04 25 28 00 00 00 48 89 45 c8 31 c0 48 8d 45 10 c7 45 b0 10 00 00 00 48 89 45 b8 48 8d 45 d0 48 89 45 c0 b8 10 00 00 00 0f 05 <89> c2 3d 00 f0 ff ff 77 1a 48 8b 45 c8 64 48 2b 04 25 28 00 00 00
Nov 09 16:11:01 fedora kernel: RSP: 002b:00007ffd8bd48020 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Nov 09 16:11:01 fedora kernel: RAX: ffffffffffffffda RBX: 00005639ba253970 RCX: 00007f8a7f2ce0ed
Nov 09 16:11:01 fedora kernel: RDX: 00007ffd8bd480a0 RSI: 0000000040406469 RDI: 000000000000000c
Nov 09 16:11:01 fedora kernel: RBP: 00007ffd8bd48070 R08: 00005639ba1d2560 R09: 0000000000000001
Nov 09 16:11:01 fedora kernel: R10: 00000000000004f0 R11: 0000000000000246 R12: 00005639bcea4b80
Nov 09 16:11:01 fedora kernel: R13: 000000000000000c R14: 00007f8a5ffba090 R15: 00005639ba1d257c
Nov 09 16:11:01 fedora kernel:  </TASK>
Nov 09 16:11:01 fedora kernel: Modules linked in: rfcomm snd_seq_dummy snd_hrtimer nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables qrtr bnep sunrpc binfmt_misc vfat fat snd_ctl_led snd_soc_skl_hda_dsp snd_soc_intel_sof_board_helpers snd_sof_probes snd_soc_intel_hda_dsp_common snd_hda_codec_intelhdmi snd_hda_codec_alc269 snd_hda_scodec_component snd_hda_codec_realtek_lib snd_hda_codec_generic snd_soc_dmic snd_hda_intel snd_sof_pci_intel_mtl snd_sof_intel_hda_generic soundwire_intel snd_sof_intel_hda_sdw_bpt snd_sof_intel_hda_common snd_soc_hdac_hda snd_sof_intel_hda_mlink snd_sof_intel_hda snd_hda_codec_hdmi soundwire_cadence iwlmvm snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils snd_hda_ext_core intel_uncore_frequency snd_hda_codec intel_uncore_frequency_common mac80211 x86_pkg_temp_thermal intel_powerclamp snd_hda_core coretemp r8153_ecm snd_intel_dspcfg cdc_ether snd_intel_sdw_acpi usbnet
Nov 09 16:11:01 fedora kernel:  snd_soc_acpi_intel_match snd_soc_acpi_intel_sdca_quirks soundwire_generic_allocation kvm_intel snd_soc_acpi crc8 soundwire_bus snd_soc_sdca libarc4 snd_usb_audio snd_soc_core kvm snd_usbmidi_lib snd_hwdep snd_ump snd_compress ac97_bus snd_rawmidi snd_pcm_dmaengine uvcvideo snd_seq iwlwifi btusb snd_seq_device btrtl snd_hda_scodec_cs35l41_i2c snd_hda_scodec_cs35l41_spi snd_hda_scodec_cs35l41 uvc snd_pcm btintel videobuf2_vmalloc spi_nor irqbypass videobuf2_memops snd_soc_cs_amp_lib rapl processor_thermal_device_pci videobuf2_v4l2 iTCO_wdt btbcm snd_soc_cs35l41_lib hid_sensor_als intel_pmc_bxt processor_thermal_device mei_gsc_proxy mtd spd5118 iTCO_vendor_support intel_rapl_msr regmap_spi r8152 videobuf2_common btmtk cs_dsp snd_timer hp_wmi hid_sensor_trigger intel_cstate processor_thermal_wt_hint cfg80211 bluetooth intel_uncore platform_profile wmi_bmof mii hid_sensor_iio_common platform_temperature_control videodev mei_me industrialio_triggered_buffer kfifo_buf processor_thermal_rfim snd mei pcspkr
Nov 09 16:11:01 fedora kernel:  spi_intel_pci i2c_i801 processor_thermal_rapl idma64 industrialio spi_intel i2c_smbus mc intel_rapl_common rfkill soundcore processor_thermal_wt_req processor_thermal_power_floor igen6_edac intel_pmc_core processor_thermal_mbox int3403_thermal pmt_telemetry int340x_thermal_zone pmt_discovery serial_multi_instantiate pmt_class intel_hid int3400_thermal sparse_keymap acpi_tad intel_pmc_ssram_telemetry acpi_thermal_rel joydev acpi_pad dm_multipath loop nfnetlink zram lz4hc_compress lz4_compress dm_crypt hid_jabra xe drm_ttm_helper drm_suballoc_helper gpu_sched drm_gpuvm drm_exec hid_sensor_hub intel_ishtp_hid i915 ucsi_acpi typec_ucsi typec i2c_algo_bit drm_buddy ttm nvme drm_display_helper nvme_core intel_ish_ipc spi_pxa2xx_platform hid_multitouch nvme_keyring dw_dmac polyval_clmulni thunderbolt intel_vpu ghash_clmulni_intel intel_ishtp cec spi_pxa2xx_core video nvme_auth i2c_hid_acpi intel_vsec i2c_hid wmi pinctrl_meteorlake serio_raw scsi_dh_rdac scsi_dh_emc scsi_dh_alua i2c_dev fuse
Nov 09 16:11:01 fedora kernel: CR2: fffffeffc0d397c0
Nov 09 16:11:01 fedora kernel: ---[ end trace 0000000000000000 ]---
Nov 09 16:11:01 fedora kernel: RIP: 0010:find_fw_domain+0x39/0xf0 [i915]
Nov 09 16:11:01 fedora kernel: Code: 03 00 77 04 44 03 47 24 41 8b 72 38 31 d2 39 f2 73 4c 89 f0 29 d0 d1 e8 8d 0c 10 48 8d 3c 49 48 89 c8 49 8b 4a 30 4c 8d 0c b9 <45> 3b 01 72 36 45 39 41 04 72 22 4d 85 c9 74 24 41 8b 41 08 41 8b
Nov 09 16:11:01 fedora kernel: RSP: 0018:ffffd1e0ceac35a0 EFLAGS: 00010002
Nov 09 16:11:01 fedora kernel: RAX: 0000000000000010 RBX: 0000000000000000 RCX: fffffeffc0d39700
Nov 09 16:11:01 fedora kernel: RDX: 0000000000000000 RSI: 0000000000000020 RDI: 0000000000000030
Nov 09 16:11:01 fedora kernel: RBP: ffff88f5e0fac2a0 R08: 000000000000a500 R09: fffffeffc0d397c0
Nov 09 16:11:01 fedora kernel: R10: ffff88f5e0f92748 R11: ffff88f5cf54bc70 R12: ffff88f5e0fac028
Nov 09 16:11:01 fedora kernel: R13: 0000000000000001 R14: ffff88f5e0f92748 R15: ffff88f5cf54bc00
Nov 09 16:11:01 fedora kernel: FS:  00007f8a7a4de680(0000) GS:ffff88fd8ebc4000(0000) knlGS:0000000000000000
Nov 09 16:11:01 fedora kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Nov 09 16:11:01 fedora kernel: CR2: fffffeffc0d397c0 CR3: 000000015b0ba003 CR4: 0000000000f72ef0
Nov 09 16:11:01 fedora kernel: PKRU: 55555554
Nov 09 16:11:01 fedora kernel: note: cinnamon[2324] exited with irqs disabled
Nov 09 16:11:01 fedora kernel: note: cinnamon[2324] exited with preempt_count 1
Nov 09 16:11:03 fedora abrt-dump-journal-oops[1606]: abrt-dump-journal-oops: Found oopses: 1
Nov 09 16:11:03 fedora abrt-dump-journal-oops[1606]: abrt-dump-journal-oops: Creating problem directories
Nov 09 16:11:03 fedora abrt-server[4352]: Can't find a meaningful backtrace for hashing in '.'
Nov 09 16:11:03 fedora abrt-server[4352]: Deleting non-reportable oops '.' because DropNotReportableOopses is set to 'yes'
Nov 09 16:11:03 fedora abrt-server[4352]: 'post-create' on '/var/spool/abrt/oops-2025-11-09-16:11:03-1606-0' exited with 1
Nov 09 16:11:03 fedora abrt-server[4352]: Deleting problem directory '/var/spool/abrt/oops-2025-11-09-16:11:03-1606-0'
Nov 09 16:11:03 fedora abrt-server[4352]: Lock file '.lock' was locked by process 4362, but it crashed?
Nov 09 16:11:04 fedora abrt-dump-journal-oops[1606]: Reported 1 kernel oopses to Abrt

Is it helpful to understand why it happens?

Also, that session was never suspended. Today I booted my laptop after it was properly shut down. Why there is intel_uncore_forcewake_for_reg in the call stack? From what state it tried to wake?

This is our old friend i915 soiling the bed once again. It would appear to be a bug in the resumption/wakeup code given the modules which were active in the stack trace.

Long story short - I915 bug with your meteor lake hardware.

I’m led to believe that the xe drivers are the replacement for i915. They are apparently more performant too, so given the number of issues you’re seeing on this kit with the i915 driver, perhaps trying the Xe drivers might calm stuff down a bit. The counter argument is that they are not yet fully stable and so might be even worse, but you never know until you try.

/etc/modprobe.d/disable-i915.conf:

options i915 force_probe=*
blacklist i915
options xe force_probe=*

This should (I can’t test) disable i915 and switch on the xe driver. If it’s no good, just revert the file to the original contents.

That’s not a wake-up from suspend - it’s a “give the graphics drivers a kick, we want to use it to play a video” or do whatever you were trying to do when you got the oops. Cinnamon was trying to call an openGL/vulkan command, which may have been as simple as “repaint this window content” or as complex as render this animation of a menu fly-out.

I’ve added i915.enable_dc=0 to the kernel arguments. Hope this workaround will work. If it doesn’t I will report here.