NVIDIA driver crashes system

I’ve been trying for a while to get NVIDIA working on my laptop with a recent version of GEForce but using akmod, I consistently have 1 or 2 boots that work before the system either freezes shortly after logging in or I get kernel panic just turning the computer on. I’m not sure if this is just an NVIDIA problem with my specific hardware because when I tried Ubuntu using their automated install of NVIDIA drivers, I got the exact same behavior. I’m happy to provide exact specs of my GPU, laptop, etc. I’m at a loss for what to try because it seems like even rebooting is sometimes enough to cause the next boot not to work. Any ideas?

When reporting issues it can be helpful to provide enough detail for others with similar hardware to reproduce the issue. A useful way to get hardware details as plain text is to run inxi -Fzxx in a terminal (post the output as pre-formatted text using the </> button from the top line of the text entry panel) so it can be found with web searches.

You should run some hardware tests, starting with RAM and mass storage. Fedora provides memtest86+, which is a standalone tester so you have to choose it from the grub2 menu at boot time, and is best used for a couple overnight tests. Many vendors have diagnostic tests for their systems, but often as Windows programs. You should also use the S.M.A.R.T tests (using, e.g., Gnome Disks) or the command-line tool.

A marginal power supply can cause the symptoms you report. Some power supplies loose capacity with age, so if your system’s power requirements are close to the capacity of the supply, you may need to upgrade the power supply.

Hi @gnwiii, I’ll boot up my live CD later tonight to provide the output because I can’t use any of the kernels currently on the system to successfully boot.

I’m confident that it is not hardware only because Windows on the same machine works perfectly fine, I’m using the power supply that came with the laptop, and it is a very new machine. I believe it is a configuration issue or just a mismatch between the version of the NVIDIA driver I’m using and the kernel.

It appears that the graphics setup has two cards. One is NVIDIA GeForce RTX 4060 and the other is Intel UHD.

12System:
  12Kernel 6.11.4-301.fc41.x86_64 12arch x86_64 12bits 64 12compiler gcc 12v 14.2.1
  12Console N/A 12wm gnome-shell 12Distro Fedora Linux 41 (Workstation Edition)
12Machine:
  12Type Laptop 12System Dell 12product Dell G16 7630 12v N/A 12serial <filter> 12Chassis 12type 10
    12serial <filter>
  12Mobo Dell 12model 0GT7NV 12v A00 12serial <filter> 12part-nu 0BFA 12UEFI Dell 12v 1.23.0 12date 03/04/2025
12Battery:
  12ID-1 BAT0 12charge 73.1 Wh (100.0%) 12condition 73.1/84.3 Wh (86.7%) 12volts 12.0 12min 11.4
    12model BYD DELL M59JH33 12serial <filter> 12status not charging
12CPU:
  12Info 14-core (6-mt/8-st) 12model 13th Gen Intel Core i7-13650HX 12bits 64 12type MST AMCP
    12arch Raptor Lake 12rev 1 12cache 12L1 1.2 MiB 12L2 11.5 MiB 12L3 24 MiB
  12Speed (MHz) 12avg 800 12min/max 800/4700:4900:3600 12cores 121 800 122 800 123 800 124 800 125 800 126 800 127 800
    128 800 129 800 1210 800 1211 800 1212 800 1213 800 1214 800 1215 800 1216 800 1217 800 1218 800 1219 800 1220 800
    12bogomips 112127
  12Flags avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
12Graphics:
  12Device-1 Intel Raptor Lake-S UHD Graphics 12vendor Dell 12driver i915 12v kernel 12arch Xe 12ports
    12active DP-1,eDP-1 12empty none 12bus-ID 0000:00:02.0 12chip-ID 8086:a78b
  12Device-2 NVIDIA AD107M [GeForce RTX 4060 Max-Q / Mobile] 12vendor Dell 12driver nouveau 12v kernel
    12arch Lovelace 12ports 12active none 12empty HDMI-A-1,eDP-2 12bus-ID 0000:01:00.0 12chip-ID 10de:28e0
  12Device-3 Logitech Brio 101 12driver snd-usb-audio,uvcvideo 12type USB 12rev 2.0 12speed 480 Mb/s
    12lanes 1 12bus-ID 1-11.2.1:10 12chip-ID 046d:094d
  12Device-4 Realtek Integrated_Webcam_HD 12driver uvcvideo 12type USB 12rev 2.0 12speed 480 Mb/s 12lanes 1
    12bus-ID 1-3:2 12chip-ID 0bda:5557
  12Display 12server X.org 12v 1.21.1.16 12with Xwayland 12v 24.1.6 12compositor gnome-shell 12driver 12X
    12loaded modesetting,nvidia 12alternate fbdev,nouveau,nv,vesa 12dri iris 12gpu i915 12display-ID :0
  12Monitor-1 DP-1 12model LG (GoldStar) FHD 12res 1920x1080 12dpi 82 12diag 685mm (27")
  12Monitor-2 eDP-1 12model AU Optronics 0x93a2 12res 2560x1600 12dpi 189 12diag 406mm (16")
  12API OpenGL 12Message GL data unavailable for root.
  12API Vulkan 12Message No Vulkan data available.
  12API EGL 12Message EGL data requires eglinfo. Check --recommends.
  12Info 12Tools 12api glxinfo,vulkaninfo 12gpu nvidia-settings,nvidia-smi 12x11 xdriinfo, xdpyinfo,
    xprop, xrandr
12Audio:
  12Device-1 Intel Raptor Lake High Definition Audio 12vendor Dell 12driver sof-audio-pci-intel-tgl
    12bus-ID 0000:00:1f.3 12chip-ID 8086:7a50
  12Device-2 NVIDIA AD107 High Definition Audio 12driver snd_hda_intel 12v kernel 12bus-ID 0000:01:00.1
    12chip-ID 10de:22be
  12Device-3 C-Media Audio Adapter (Unitek Y-247A) 12driver cmedia_hs100b,snd-usb-audio,usbhid
    12type USB 12rev 1.1 12speed 12 Mb/s 12lanes 1 12bus-ID 1-11.1:7 12chip-ID 0d8c:0014
  12Device-4 Logitech Brio 101 12driver snd-usb-audio,uvcvideo 12type USB 12rev 2.0 12speed 480 Mb/s
    12lanes 1 12bus-ID 1-11.2.1:10 12chip-ID 046d:094d
  12API ALSA 12v k6.11.4-301.fc41.x86_64 12status kernel-api
  12Server-1 JACK 12v 1.9.22 12status off
  12Server-2 PipeWire 12v 1.2.7 12status n/a (root, process) 12with 121 pipewire-pulse 12status active
    122 wireplumber 12status active 123 pipewire-alsa 12type plugin 124 pw-jack 12type plugin
12Network:
  12Device-1 Intel Raptor Lake-S PCH CNVi WiFi 12driver iwlwifi 12v kernel 12bus-ID 0000:00:14.3
    12chip-ID 8086:7a70
  12IF wlp0s20f3 12state down 12mac <filter>
  12Device-2 Realtek RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet 12vendor Dell 12driver r8169
    12v kernel 12port 3000 12bus-ID 0000:02:00.0 12chip-ID 10ec:8168
  12IF enp2s0 12state down 12mac <filter>
  12Device-3 Realtek RTL8153 Gigabit Ethernet Adapter 12driver r8152 12type USB 12rev 3.2 12speed 5 Gb/s
    12lanes 1 12bus-ID 2-7.2:5 12chip-ID 0bda:8153
  12IF enp0s20f0u7u2 12state up 12speed 1000 Mbps 12duplex full 12mac <filter>
12Bluetooth:
  12Device-1 Intel AX201 Bluetooth 12driver btusb 12v 0.8 12type USB 12rev 2.0 12speed 12 Mb/s 12lanes 1
    12bus-ID 1-14:6 12chip-ID 8087:0026
  12Report btmgmt 12ID hci0 12rfk-id 0 12state up 12address <filter> 12bt-v 5.2 12lmp-v 11
12RAID:
  12Hardware-1 Intel Volume Management Device NVMe RAID Controller Intel 12driver vmd 12v 0.6
    12bus-ID 0000:00:0e.0 12chip-ID 8086:a77f
12Drives:
  12Local Storage 12total 1011.64 GiB 12used 89.46 GiB (8.8%)
  12ID-1 /dev/nvme0n1 12vendor SK Hynix 12model BC901 NVMe 1024GB 12size 953.87 GiB 12speed 63.2 Gb/s
    12lanes 4 12serial <filter> 12temp 55.9 C
  12ID-2 /dev/sda 12vendor Memorex 12model USB Flash Drive 12size 57.77 GiB 12type USB 12rev 3.2 12spd 5 Gb/s
    12lanes 1 12serial <filter>
12Partition:
  12ID-1 / 12size 138.33 GiB 12used 22.3 GiB (16.1%) 12fs xfs 12dev /dev/dm-1 12mapped root
  12ID-2 /boot 12size 914.2 MiB 12used 537.3 MiB (58.8%) 12fs xfs 12dev /dev/nvme0n1p7
  12ID-3 /home 12size 207.01 GiB 12used 66.63 GiB (32.2%) 12fs xfs 12dev /dev/dm-0 12mapped home
12Swap:
  12ID-1 swap-1 12type zram 12size 8 GiB 12used 0 KiB (0.0%) 12priority 100 12dev /dev/zram0
12Sensors:
  12System Temperatures 12cpu 72.0 C 12mobo 55.8 C 12sodimm 55.0 C
  12Fan Speeds (rpm) 12cpu 0
12Info:
  12Memory 12total 16 GiB 12available 15.3 GiB 12used 3.81 GiB (24.9%) 12igpu 64 MiB
  12Processes 546 12Power 12uptime 11m 12wakeups 0 12Init systemd 12v 256 12default graphical
  12Packages 12pm rpm 12pkgs N/A 12note see --rpm 12pm flatpak 12pkgs 9 12Compilers 12gcc 14.2.1 12Client Sudo
    12v 1.9.15p5 12inxi 3.3.38

You are using the nouveau open-source driver. For recently introduced cards, you should look for a driver from rpmfusion.

I know for a fact that I am not using the nouveau driver due to the fact that the kernel panic has a stack trace referencing the NVIDIA driver, but it may appear that way given that the LiveCD that I used to generate the output probably is using nouveau. As I previously stated, all of the kernels, including the rescue kernel, panic when I try to boot into them with an NVIDIA stack trace so I can only provide the output of that script from the LiveCD.

Without any info directly from the crashed system you might as well just do a reinstall as the info from the livecd is useless.

you can browse the logs by mounting your offline system in the live session with gnome-disks, then run journalctl --directory /run/media/liveuser/root/root/var/log/journal/ (exact path may differ on your system)

my next guess would be to edit the grub commands on next boot from hard disk by pressing “e” on the GRUB boot screen and replace the values from rd.driver.blacklist= and modprobe.blacklist= from nouveau to nvidia, remove the nvidia-drm.modeset entirely and see if the system boots up correctly…

1 Like

@truster I have already tried replacing the nouveau blacklist with nvidia on previous installs and this immediately resolves the problem, another reason I’m sure this is an NVIDIA problem. Just to give a few other things I’ve tried, reinstalling the kernel does not help, reinstalling the akmod does not help, and using the open NVIDIA kmod does not help. All of these things result in the same behavior. I’ve tried capturing the kernel panic to post here but I think the best I can do is a picture of the screen. What would you like to see from the logs? Would a picture of the kernel panic be helpful?

You should be able to view details of kernel panics using journalctl --no-hostname -b [-N] -p [P] (in terminal read man journalctl for option details. You can also search for a string using -g [string]. If you are booting with a LIve USB you can use the -D [directory] option, where “directory” is <mount_point>/var/log/journalctl/<long hex ID string>.