RTX 3060 Mobile not initializing — RmInitAdapter failed / GSP firmware load errors on Fedora 42 (kernel 6.17.4, driver 580.95.05)

I’m running Fedora 42 with an NVIDIA GeForce RTX 3060 Mobile (GA106M / Max-Q) GPU, and I’m unable to get the proprietary NVIDIA driver working on kernel 6.17.x.
The system boots to a black screen or fails to load the GPU with RmInitAdapter failed errors in dmesg.

[ 63.829430] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100, regvalue: 0xbadf5620, error code: Unknown SYS_PRI_ERROR_CODE
[ 63.829432] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100, regvalue: 0xbadf5620, error code: Unknown SYS_PRI_ERROR_CODE
[ 63.829434] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100, regvalue: 0xbadf5620, error code: Unknown SYS_PRI_ERROR_CODE
[ 63.829436] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100, regvalue: 0xbadf5620, error code: Unknown SYS_PRI_ERROR_CODE
[ 63.829438] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100, regvalue: 0xbadf5620, error code: Unknown SYS_PRI_ERROR_CODE
[ 63.829441] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100, regvalue: 0xbadf5620, error code: Unknown SYS_PRI_ERROR_CODE
[ 63.829444] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100, regvalue: 0xbadf5620, error code: Unknown SYS_PRI_ERROR_CODE
[ 63.829446] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100, regvalue: 0xbadf5620, error code: Unknown SYS_PRI_ERROR_CODE
[ 63.829448] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100, regvalue: 0xbadf5620, error code: Unknown SYS_PRI_ERROR_CODE
[ 63.829451] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100, regvalue: 0xbadf5620, error code: Unknown SYS_PRI_ERROR_CODE
[ 63.829453] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100, regvalue: 0xbadf5620, error code: Unknown SYS_PRI_ERROR_CODE
[ 63.829458] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100, regvalue: 0xbadf5620, error code: Unknown SYS_PRI_ERROR_CODE
[ 63.829463] NVRM: kflcnWaitForHalt_TU102: Timeout waiting for Falcon to halt
[ 63.829466] NVRM: gpuWaitForGfwBootComplete_TU102: GSP failed to halt with GFW_BOOT: (progress 0xff)
[ 63.829467] NVRM: kgspWaitForGfwBootOk_TU102: failed to wait for GFW boot complete: 0x65 VBIOS version 94.06.01.27.01
[ 63.829468] NVRM: kgspWaitForGfwBootOk_TU102: (the GPU may be in a bad state and may need to be reset)
[ 63.829471] NVRM: nvCheckOkFailedNoLog: Check failed: Call timed out [NV_ERR_TIMEOUT] (0x00000065) returned from kgspWaitForGfwBootOk_HAL(pGpu, pKernelGsp) @ kernel_gsp.c:3874
[ 63.829519] NVRM: RmInitAdapter: Cannot initialize GSP firmware RM
[ 63.830743] NVRM: GPU 0000:01:00.0: RmInitAdapter failed! (0x62:0x65:2015)
[ 63.832298] NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device minor number 0

uname -r
6.17.4-200.fc42.x86_64
➜ ~ lspci -nn | grep -E “VGA|3D”
01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GA106M [GeForce RTX 3060 Mobile / Max-Q] [10de:2520] (rev a1)

mokutil --sb-state
SecureBoot disabled
Platform is in Setup Mode

rpm -qa | grep nvidia
nvidia-modprobe-580.95.05-1.fc42.x86_64
xorg-x11-drv-nvidia-cuda-libs-580.95.05-1.fc42.x86_64
xorg-x11-drv-nvidia-libs-580.95.05-1.fc42.x86_64
nvidia-persistenced-580.95.05-1.fc42.x86_64
xorg-x11-drv-nvidia-kmodsrc-580.95.05-1.fc42.x86_64
nvidia-settings-580.95.05-1.fc42.x86_64
xorg-x11-drv-nvidia-580.95.05-1.fc42.x86_64
akmod-nvidia-580.95.05-1.fc42.x86_64
xorg-x11-drv-nvidia-cuda-580.95.05-1.fc42.x86_64
xorg-x11-drv-nvidia-power-580.95.05-1.fc42.x86_64
kmod-nvidia-6.17.4-200.fc42.x86_64-580.95.05-1.fc42.x86_64
nvidia-gpu-firmware-20251021-1.fc42.noarch

What I have already tried
Removed and reinstalled all NVIDIA packages (akmod-nvidia, xorg-x11-drv-nvidia, etc.)
Rebuilt kernel modules manually:
sudo akmods --force --kernels $(uname -r)
Verified firmware presence
Rebuilt initramfs and confirmed firmware inclusion

  • Tried disabling GSP firmware via:
  • /etc/modprobe.d/nvidia.confoptions nvidia NVreg_EnableGpuFirmware=0
  • Kernel parameter nvidia.NVreg_EnableGpuFirmware=0
  • Booted with previous kernel 6.16.12-200.fc42.x86_64 (same issue)
  • Disabled fedora-nvidia repo, only kept rpmfusion-nonfree-nvidia-driver active.

Current behavior
The NVIDIA module loads partially but fails GSP initialization every time

nvidia-smi outputs No devices were found

The /sys/module/nvidia/parameters/NVreg_EnableGpuFirmware file does not exist.

Please show us the output of dnf list --installed \*nvidia\* and post that as preformatted text so it remains formatted exactly as seen on your screen. This is done by pasting the text, then highlight it and clidk the </> button on the toolbar of the text entry window.
This output will show us exactly what version as well as what repo the packages are installed from (and is sorted). The rpm info you provided shows the packages but not the source.

Also show us the output of dnf repolist so we can see the enabled repos for this.
Also of interest would be cat /proc/cmdline

A full removal of all nvidia packages is done easily with sudo dnf remove \*nvidia\* --exclude nvidia-gpu-firmware

I have an RTX 3050 and am not using those kernel parameters. Please remove them from both the kernel command line and /etc/modprobe.d/ before continuing.

$ lspci | grep -i vga
06:00.0 VGA compatible controller: NVIDIA Corporation GA107 [GeForce RTX 3050 6GB] (rev a1)


$ cat /proc/cmdline
BOOT_IMAGE=(hd6,gpt2)/vmlinuz-6.17.4-200.fc42.x86_64 root=/dev/mapper/fedora_raptor-root ro rd.lvm.lv=fedora_raptor/root rhgb quiet rd.driver.blacklist=nouveau,nova_core modprobe.blacklist=nouveau,nova_core

It is also quite possible that there is a hardware problem. You did not tell us what your system is (inxi -Fzxx would do that for us) but you did show mobile so I suspect it is a laptop.

dnf list --installed \*nvidia\* 
Installed packages
akmod-nvidia.x86_64                       3:580.95.05-1.fc42 rpmfusion-nonfree-n
kmod-nvidia-6.17.4-200.fc42.x86_64.x86_64 3:580.95.05-1.fc42 @commandline
nvidia-gpu-firmware.noarch                20251021-1.fc42    updates
nvidia-modprobe.x86_64                    3:580.95.05-1.fc42 rpmfusion-nonfree-n
nvidia-persistenced.x86_64                3:580.95.05-1.fc42 rpmfusion-nonfree-n
nvidia-settings.x86_64                    3:580.95.05-1.fc42 rpmfusion-nonfree-n
xorg-x11-drv-nvidia.x86_64                3:580.95.05-1.fc42 rpmfusion-nonfree-n
xorg-x11-drv-nvidia-cuda.x86_64           3:580.95.05-1.fc42 rpmfusion-nonfree-n
xorg-x11-drv-nvidia-cuda-libs.x86_64      3:580.95.05-1.fc42 rpmfusion-nonfree-n
xorg-x11-drv-nvidia-kmodsrc.x86_64        3:580.95.05-1.fc42 rpmfusion-nonfree-n
xorg-x11-drv-nvidia-libs.x86_64           3:580.95.05-1.fc42 rpmfusion-nonfree-n
xorg-x11-drv-nvidia-power.x86_64          3:580.95.05-1.fc42 rpmfusion-nonfree-n

cat /proc/cmdline
BOOT_IMAGE=(hd0,gpt2)/vmlinuz-6.17.4-200.fc42.x86_64 root=UUID=b359d009-13a2-448e-b5ac-3e9aa0bf9d44 ro rootflags=subvol=root rhgb quiet modprobe.blacklist=nouveau rd.driver.blacklist=nouveau pcie_aspm=off nvidia_drm.modset=1
lspci | grep -i vga
01:00.0 VGA compatible controller: NVIDIA Corporation GA106M [GeForce RTX 3060 Mobile / Max-Q] (rev a1)

inxi -Fzxx
System:
  Kernel: 6.17.4-200.fc42.x86_64 arch: x86_64 bits: 64 compiler: gcc v: 15.2.1
  Desktop: GNOME v: 48.6 tk: GTK v: 3.24.49 wm: gnome-shell dm: GDM
    Distro: Fedora Linux 42 (Workstation Edition)
Machine:
  Type: Desktop Mobo: N/A model: N/A serial: <superuser required>
    UEFI: American Megatrends LLC. v: 5.27 date: 05/20/2025
CPU:
  Info: 16-core (8-mt/8-st) model: 13th Gen Intel Core i7-13700HX bits: 64
    type: MST AMCP arch: Raptor Lake rev: 2 cache: L1: 1.4 MiB L2: 14 MiB
    L3: 30 MiB
  Speed (MHz): avg: 4410 min/max: 800/4800:5000:3700 cores: 1: 4410 2: 4410
    3: 4410 4: 4410 5: 4410 6: 4410 7: 4410 8: 4410 9: 4410 10: 4410 11: 4410
    12: 4410 13: 4410 14: 4410 15: 4410 16: 4410 17: 4410 18: 4410 19: 4410
    20: 4410 21: 4410 22: 4410 23: 4410 24: 4410 bogomips: 110592
  Flags-basic: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
Graphics:
  Device-1: NVIDIA GA106M [GeForce RTX 3060 Mobile / Max-Q] driver: nvidia
    v: 580.95.05 arch: Ampere pcie: speed: 16 GT/s lanes: 16 bus-ID: 01:00.0
    chip-ID: 10de:2520
  Device-2: Fic 48MP USB Camera driver: uvcvideo type: USB rev: 2.0
    speed: 480 Mb/s lanes: 1 bus-ID: 1-2:3 chip-ID: 32e4:4802
  Display: wayland server: X.Org v: 24.1.8 with: Xwayland v: 24.1.8
    compositor: gnome-shell driver: dri: swrast gpu: nvidia display-ID: :0
    screens: 1
  Screen-1: 0 s-res: 3440x1440 s-dpi: 96
  Monitor-1: Unknown-1 mapped: None-1 res: 3440x1440 hz: 60 dpi: 96
  API: OpenGL v: 4.5 vendor: mesa v: 25.1.9 glx-v: 1.4 es-v: 3.2
    direct-render: yes renderer: llvmpipe (LLVM 20.1.8 256 bits)
    device-ID: ffffffff:ffffffff
  API: EGL Message: EGL data requires eglinfo. Check --recommends.
  Info: Tools: api: glxinfo gpu: nvidia-settings,nvidia-smi x11: xdriinfo,
    xdpyinfo, xprop, xrandr
Audio:
  Device-1: Intel Raptor Lake High Definition Audio driver: snd_hda_intel
    v: kernel bus-ID: 00:1f.3 chip-ID: 8086:7a50
  Device-2: NVIDIA GA106 High Definition Audio driver: snd_hda_intel
    v: kernel pcie: speed: 16 GT/s lanes: 16 bus-ID: 01:00.1 chip-ID: 10de:228e
  API: ALSA v: k6.17.4-200.fc42.x86_64 status: kernel-api
  Server-1: PipeWire v: 1.4.9 status: active with: 1: pipewire-pulse
    status: active 2: wireplumber status: active 3: pipewire-alsa type: plugin
    4: pw-jack type: plugin
Network:
  Device-1: Intel Ethernet I226-V driver: igc v: kernel pcie: speed: 5 GT/s
    lanes: 1 port: N/A bus-ID: 03:00.0 chip-ID: 8086:125c
  IF: enp3s0 state: down mac: <filter>
  Device-2: Intel Ethernet I226-V driver: igc v: kernel pcie: speed: 5 GT/s
    lanes: 1 port: N/A bus-ID: 04:00.0 chip-ID: 8086:125c
  IF: enp4s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
Drives:
  Local Storage: total: 238.47 GiB used: 12.93 GiB (5.4%)
  ID-1: /dev/nvme0n1 vendor: Intel model: HBRPEKNX0101AH size: 238.47 GiB
    speed: 15.8 Gb/s lanes: 2 serial: <filter> temp: 44.9 C
Partition:
  ID-1: / size: 236.89 GiB used: 12.32 GiB (5.2%) fs: btrfs
    dev: /dev/nvme0n1p3
  ID-2: /boot size: 973.4 MiB used: 609.2 MiB (62.6%) fs: ext4
    dev: /dev/nvme0n1p2
  ID-3: /boot/efi size: 598.8 MiB used: 19.3 MiB (3.2%) fs: vfat
    dev: /dev/nvme0n1p1
  ID-4: /home size: 236.89 GiB used: 12.32 GiB (5.2%) fs: btrfs
    dev: /dev/nvme0n1p3
Swap:
  ID-1: swap-1 type: zram size: 8 GiB used: 0 KiB (0.0%) priority: 100
    dev: /dev/zram0
Sensors:
  System Temperatures: cpu: 55.2 C mobo: N/A
  Fan Speeds (rpm): N/A
Info:
  Memory: total: 32 GiB available: 31.17 GiB used: 3.43 GiB (11.0%)
  Processes: 479 Power: uptime: 4m wakeups: 0 Init: systemd v: 257
    target: graphical (5) default: graphical
  Packages: pm: rpm pkgs: N/A note: see --rpm Compilers: gcc: 15.2.1
    Shell: Zsh v: 5.9 running-in: ptyxis-agent inxi: 3.3.39

It is actually an industrial PC

Looking at that kernel command line I have a couple suggestions that likely will fix the issues.
Some of the parameters are not compatible with the newer drivers.

  1. Remove the old parameters
    sudo grubby --update-kernel=ALL --remove-args='modprobe.blacklist=nouveau rd.driver.blacklist=nouveau nvidia_drm.modset=1'
  2. Add updated parameters
    sudo grubby --update-kernel=ALL --args='modprobe.blacklist=nouveau,nova_core rd.driver.blacklist=nouveau,nova_core'
  3. reboot

You can put a specific kernel version in place of ALL if you only want the change to apply to one kernel version. The booted kernel could be updated by using $(uname -r)

I also suggest that you run modinfo -l nvidia after rebooting to see if it shows (as it should)

$ modinfo -l nvidia
Dual MIT/GPL

Having the driver loaded properly should also fix the problem in inxi about Monitor1 being Unknown-1 and more.

I updated the kernel parameters

 cat /proc/cmdline
BOOT_IMAGE=(hd0,gpt2)/vmlinuz-6.17.4-200.fc42.x86_64 root=UUID=b359d009-13a2-448e-b5ac-3e9aa0bf9d44 ro rootflags=subvol=root rhgb quiet pcie_aspm=off modprobe.blacklist=nouveau,nova_core rd.driver.blacklist=nouveau,nova_core

output from modinfo -l nvidia

Dual MIT/GPL

but when I run nvidia-smi

nvidia-smi                                 
No devices were found

On dmesg I still have NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device minor number 0

[   77.741536] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741538] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741540] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741542] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741544] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741546] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741548] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741551] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741553] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741555] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741557] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741559] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741561] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741563] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741565] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741567] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741569] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741571] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741573] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741575] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741578] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741580] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741582] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741584] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741586] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741588] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741590] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741592] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741594] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741596] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741598] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741602] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   77.741606] NVRM: kflcnWaitForHalt_TU102: Timeout waiting for Falcon to halt
[   77.741609] NVRM: gpuWaitForGfwBootComplete_TU102: GSP failed to halt with GFW_BOOT: (progress 0xff)
[   77.741610] NVRM: kgspWaitForGfwBootOk_TU102: failed to wait for GFW boot complete: 0x65 VBIOS version 94.06.01.27.01
[   77.741611] NVRM: kgspWaitForGfwBootOk_TU102: (the GPU may be in a bad state and may need to be reset)
[   77.741613] NVRM: nvCheckOkFailedNoLog: Check failed: Call timed out [NV_ERR_TIMEOUT] (0x00000065) returned from kgspWaitForGfwBootOk_HAL(pGpu, pKernelGsp) @ kernel_gsp.c:3874
[   77.741660] NVRM: RmInitAdapter: Cannot initialize GSP firmware RM
[   77.743101] NVRM: GPU 0000:01:00.0: RmInitAdapter failed! (0x62:0x65:2015)
[   77.744279] NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device minor number 0

Try disabling nvidia-persistenced.service. See https://docs.nvidia.com/deploy/driver-persistence/index.html.

I disabled it

systemctl status nvidia-persistenced --no-pager
○ nvidia-persistenced.service - NVIDIA Persistence Daemon
     Loaded: loaded (/usr/lib/systemd/system/nvidia-persistenced.service; disabled; preset: disabled)
    Drop-In: /usr/lib/systemd/system/service.d
             └─10-timeout-abort.conf
     Active: inactive (dead)

systemctl status nvidia-powerd --no-pager
○ nvidia-powerd.service
     Loaded: masked (Reason: Unit nvidia-powerd.service is masked.)
     Active: inactive (dead)

but are still the same errors.

nvidia-smi
No devices were found
[   96.363825] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363827] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363829] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363832] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363834] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363836] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363838] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363840] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363842] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363844] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363846] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363848] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363850] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363853] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363855] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363857] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363859] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363861] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363863] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363865] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363867] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363869] NVRM: gpuHandleSanityCheckRegReadError_GM107: Possible bad register read: addr: 0x110100,  regvalue: 0xbadf5620,  error code: Unknown SYS_PRI_ERROR_CODE
[   96.363875] NVRM: kflcnWaitForHalt_TU102: Timeout waiting for Falcon to halt
[   96.363878] NVRM: gpuWaitForGfwBootComplete_TU102: GSP failed to halt with GFW_BOOT: (progress 0xff)
[   96.363880] NVRM: kgspWaitForGfwBootOk_TU102: failed to wait for GFW boot complete: 0x65 VBIOS version 94.06.01.27.01
[   96.363880] NVRM: kgspWaitForGfwBootOk_TU102: (the GPU may be in a bad state and may need to be reset)
[   96.363882] NVRM: nvCheckOkFailedNoLog: Check failed: Call timed out [NV_ERR_TIMEOUT] (0x00000065) returned from kgspWaitForGfwBootOk_HAL(pGpu, pKernelGsp) @ kernel_gsp.c:3874
[   96.363929] NVRM: RmInitAdapter: Cannot initialize GSP firmware RM
[   96.365287] NVRM: GPU 0000:01:00.0: RmInitAdapter failed! (0x62:0x65:2015)
[   96.366807] NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device minor number 0

I also tried kernel version 6.16, but after booting, I don’t even have an image.

  1. Try if you can boot with the nouveau kernel driver instead of the nvidia driver.
    Interrupt boot process at the grub stage and edit kernel cmdline and delete
    rd.driver.blacklist=nouveau,nova_core modprobe.blacklist=nouveau,nova_core

I don’t see the iGPU listed in the inxi output. Maybe try to enable the iGPU in bios.

3,

This looks strange to me, because I would not expect TU102 (turing ) functions called on an Ampere GPU, but that’s implementation details.

AFAIK those drivers won’t work w/o the GSP.

Also try to power down the system, disconnect the AC adapter and remove / disable battery for a few minutes.