desmond97
(Indexador Segundo)
May 7, 2024, 8:29pm
1
Hi, I’m somewhat new to this, and am not exactly sure what I’m doing
I followed instructions from rpmfusion to install Nvidia drivers, but when I restarted the computer, after selecting the kernel, it froze on the loading screen.
I couldn’t boot on the latest kernel (6.8.8-300), so I’m using the previous one (6.8.8-200)
Here’s some information that I’m not sure if it’s useful
dnf list installed '*nvidia*'
Installed Packages
akmod-nvidia.x86_64 3:550.78-1.fc40 @rpmfusion-nonfree-nvidia-driver
kmod-nvidia-6.8.8-300.fc40.x86_64.x86_64
3:550.78-1.fc40 @@commandline
nvidia-gpu-firmware.noarch 20240410-1.fc40 @updates
nvidia-modprobe.x86_64 3:550.78-1.fc40 @rpmfusion-nonfree-nvidia-driver
nvidia-persistenced.x86_64 3:550.78-1.fc40 @rpmfusion-nonfree-nvidia-driver
nvidia-settings.x86_64 3:550.78-1.fc40 @rpmfusion-nonfree-nvidia-driver
xorg-x11-drv-nvidia.x86_64 3:550.78-1.fc40 @rpmfusion-nonfree-nvidia-driver
xorg-x11-drv-nvidia-cuda.x86_64 3:550.78-1.fc40 @rpmfusion-nonfree-nvidia-driver
xorg-x11-drv-nvidia-cuda-libs.x86_64
3:550.78-1.fc40 @rpmfusion-nonfree-nvidia-driver
xorg-x11-drv-nvidia-kmodsrc.x86_64
3:550.78-1.fc40 @rpmfusion-nonfree-nvidia-driver
xorg-x11-drv-nvidia-libs.x86_64 3:550.78-1.fc40 @rpmfusion-nonfree-nvidia-driver
xorg-x11-drv-nvidia-power.x86_64
3:550.78-1.fc40 @rpmfusion-nonfree-nvidia-driver
$ rpm -qa | grep -e akmod -e nvidia
nvidia-gpu-firmware-20240410-1.fc40.noarch
xorg-x11-drv-nvidia-kmodsrc-550.78-1.fc40.x86_64
xorg-x11-drv-nvidia-cuda-libs-550.78-1.fc40.x86_64
nvidia-modprobe-550.78-1.fc40.x86_64
xorg-x11-drv-nvidia-libs-550.78-1.fc40.x86_64
akmods-0.5.8-8.fc40.noarch
nvidia-settings-550.78-1.fc40.x86_64
xorg-x11-drv-nvidia-power-550.78-1.fc40.x86_64
xorg-x11-drv-nvidia-550.78-1.fc40.x86_64
akmod-nvidia-550.78-1.fc40.x86_64
nvidia-persistenced-550.78-1.fc40.x86_64
xorg-x11-drv-nvidia-cuda-550.78-1.fc40.x86_64
kmod-nvidia-6.8.8-300.fc40.x86_64-550.78-1.fc40.x86_64
And here’s the crash report(?)
Any help would be much appreciated.
Very possibly the fact that the kernel command line seems to be missing the nvidia-drm.modeset=1
option.
To test that when booting hold the shift key to display the grub menu then press e
to edit the commands.
On the line that begins with linux=
add the option above then continue booting.
If it boots properly then it can be made permanent with editing the file /etc/default/grub
and add the same option into the line that begins with GRUB_CMDLINE_LINUX=
. Save the file then run sudo grub2-mkconfig -o /boot/grub2/grub.cfg
. Once that is done a reboot should work properly.
desmond97
(Indexador Segundo)
May 7, 2024, 9:36pm
3
Hm, unfortunately that didn’t seem to work
desmond97
(Indexador Segundo)
May 7, 2024, 11:17pm
4
Also, when I use the lsmod command, nvidia or nvidia-drm don’t appear, I don’t know if that’s a problem
$ lsmod | grep nvidia
nvidia_wmi_ec_backlight 12288 0
video 77824 6 nvidia_wmi_ec_backlight,dell_wmi,dell_laptop,xe,i915,nouveau
wmi 36864 12 dell_wmi_sysman,video,nvidia_wmi_ec_backlight,dell_wmi_ddv,alienware_wmi,dell_wmi,wmi_bmof,dell_smm_hwmon,dell_smbios,dell_wmi_descriptor,mxm_wmi,nouveau
desmond97
(Indexador Segundo)
May 7, 2024, 11:54pm
5
Okay, this keeps getting weirder. After rebooting a few times, it will sometimes load past the boot screen and into the login screen.
It doesn’t completely freeze in the login screen, I can still move the mouse, but I can’t click or type anything. It’s as if it’s teasing me.
That may be a problem.
Do you have secure boot enabled? If so the first step would be to boot into the bios setup menu and disable secure boot, so the nvidia drivers can be loaded.
Once booted with the nvidia drivers there are steps available to make it possible using secure boot as well but getting the drivers to load first is important.
When booted you can check the status of secure boot with mokutil --sb-state
desmond97
(Indexador Segundo)
May 8, 2024, 12:02am
7
Jeff V:
mokutil --sb-state
Now that I think about it, I don’t think they should be loaded, since I’ve been booting on the older kernel from before I installed the driver. I couldn’t get the newer one to boot at all.
And the secure boot is disabled
The key information on the crash is hidden, namely the backtrace.
Can you post the backtrace please, and as text not a picture?
It will be in the journalctl output.
desmond97
(Indexador Segundo)
May 8, 2024, 7:59pm
9
The backtrace that shows in the problem report is
#PF: supervisor read access in kernel mode
#PF: error_code(0x0000) - not-present page
PGD 0 P4D 0
Oops: 0000 [#2] PREEMPT SMP NOPTI
CPU: 10 PID: 296 Comm: kworker/10:1 Tainted: P D OE 6.8.8-300.fc40.x86_64 #1
Hardware name: Dell Inc. Dell G15 5530/04HG9Y, BIOS 1.14.0 03/19/2024
Workqueue: kacpi_notify acpi_os_execute_deferred
RIP: 0010:_nv012501rm+0xe7/0x310 [nvidia]
Code: 0f 84 94 00 00 00 48 8b 42 08 80 78 20 00 0f 84 86 00 00 00 48 8b 70 08 48 8b 4e 10 48 39 c8 75 aa 48 8b 4e 18 48 85 c9 74 06 <80> 79 20 00 75 ae 48 39 50 18 0f 84 99 01 00 00 c6 40 20 00 48 8b
RSP: 0018:ffff9d9b807cbd18 EFLAGS: 00010006
RAX: ffff9d9b8033bdc0 RBX: ffffffffc2909c8d RCX: 0000000000004000
RDX: ffff9d9b807cbdc0 RSI: ffff9d9b806afdc0 RDI: ffffffffc2af3ff8
RBP: ffff906ff359e000 R08: 0000000000000000 R09: ffff9d9b807cbde8
R10: 000000000000000d R11: 000000000000000d R12: ffff9d9b807cbd70
R13: ffff906fd8ac0000 R14: 0000000000004001 R15: ffff906fd13c80d8
FS: 0000000000000000(0000) GS:ffff90733f680000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000004020 CR3: 00000002f0428000 CR4: 0000000000f50ef0
PKRU: 55555554
Call Trace:
<TASK>
? __die+0x23/0x70
? page_fault_oops+0x174/0x540
? exc_page_fault+0x7f/0x180
? asm_exc_page_fault+0x26/0x30
? _nv000779rm+0x1d/0x70 [nvidia]
? _nv012501rm+0xe7/0x310 [nvidia]
? _nv000779rm+0x1d/0x70 [nvidia]
_nv049766rm+0xd6/0x1d0 [nvidia]
? pick_eevdf+0x160/0x1a0
_nv000779rm+0x1d/0x70 [nvidia]
rm_acpi_notify+0xf1/0x280 [nvidia]
acpi_ev_notify_dispatch+0x48/0x80
acpi_os_execute_deferred+0x17/0x30
process_one_work+0x16f/0x330
worker_thread+0x273/0x3c0
? __pfx_worker_thread+0x10/0x10
kthread+0xe5/0x120
? __pfx_kthread+0x10/0x10
ret_from_fork+0x31/0x50
? __pfx_kthread+0x10/0x10
ret_from_fork_asm+0x1b/0x30
</TASK>
Modules linked in: nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nvidia_drm(POE+) nvidia_modeset(POE) ip_set nf_tables nvidia_uvm(POE) qrtr bnep nvidia(POE) sunrpc binfmt_misc vfat fat snd_ctl_led snd_soc_skl_hda_dsp snd_soc_hdac_hdmi snd_sof_probes snd_soc_intel_hda_dsp_common snd_hda_codec_realtek snd_hda_codec_generic snd_soc_dmic iwlmvm snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel snd_sof_intel_hda_mlink soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof mac80211 snd_sof_utils snd_soc_hdac_hda snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi soundwire_generic_allocation soundwire_bus snd_soc_core intel_uncore_frequency intel_uncore_frequency_common intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp coretemp snd_compress snd_hda_codec_hdmi ac97_bus libarc4 uvcvideo
snd_pcm_dmaengine kvm_intel snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi uvc videobuf2_vmalloc snd_hda_codec videobuf2_memops kvm btusb videobuf2_v4l2 btrtl snd_hda_core processor_thermal_device_pci videobuf2_common snd_hwdep processor_thermal_device snd_seq iwlwifi spi_nor snd_seq_device processor_thermal_wt_hint videodev snd_pcm btintel iTCO_wdt processor_thermal_rfim mtd processor_thermal_rapl intel_rapl_msr intel_pmc_bxt irqbypass dell_laptop btbcm iTCO_vendor_support btmtk intel_rapl_common uas rapl mei_pxp mei_hdcp dell_wmi bluetooth intel_cstate intel_pmc_core snd_timer cfg80211 mc usb_storage intel_uncore dell_wmi_ddv alienware_wmi dell_smbios dell_smm_hwmon pcspkr snd dcdbas processor_thermal_wt_req i2c_i801 dell_wmi_sysman spi_intel_pci processor_thermal_power_floor firmware_attributes_class ledtrig_audio dell_wmi_descriptor intel_vsec wmi_bmof nvidia_wmi_ec_backlight soundcore i2c_smbus rfkill spi_intel idma64 processor_thermal_mbox int3403_thermal int340x_thermal_zone pmt_telemetry
int3400_thermal intel_hid pmt_class mei_me acpi_thermal_rel acpi_pad sparse_keymap acpi_tad joydev mei loop nfnetlink zram xe drm_ttm_helper gpu_sched drm_suballoc_helper drm_gpuvm drm_exec hid_sensor_hub intel_ishtp_hid hid_logitech_hidpp wacom hid_logitech_dj i915 nvme nvme_core nvme_auth crct10dif_pclmul crc32_pclmul i2c_algo_bit crc32c_intel drm_buddy polyval_clmulni polyval_generic ttm drm_display_helper r8169 ghash_clmulni_intel intel_ish_ipc hid_multitouch sha512_ssse3 ucsi_acpi video sha256_ssse3 typec_ucsi sha1_ssse3 cec intel_ishtp typec realtek vmd i2c_hid_acpi i2c_hid wmi pinctrl_alderlake serio_raw ip6_tables ip_tables fuse
CR2: 0000000000004020
That shows its in the nvidia driver and its a memory issue.
Maybe a driver bug, but might also point to a hardware issue.
Try removing the GPU and cleaning the slot it was in then reinstalling.
(Happen to me with AMD GPU recently)
Also try running memory test.
desmond97
(Indexador Segundo)
May 8, 2024, 10:08pm
11
I don’t think it’s a hardware issue, the GPU seems to work on Windows.
I tried running a memory test and it showed no errors
You will need to report the problem to nvidia so they can fix their code.
1 Like