Something went wrong on update while installing Nvidia drivers

Hello!

I was trying to install the Nvidia drivers. I think it went well after many attempts. Anyways, I had a kernel upgrade from 6.6.6.200 to 6.6.7.200 while following this tutorial and I think something went wrong because the order was:

  • Determine card model
  • dnf update -y
  • sudo dnf install xorg-x11-drv-nvidia-470xx akmod-nvidia-470xx
  • sudo dnf install xorg-x11-drv-nvidia-470xx-cuda #optional for cuda up to 11.4 support

And then I rebooted.

Now when I try to boot from 6.6.7, the system won’t completely boot. It seems like it will, but it stays in the “Lenovo” screen.

I believe the Nvidia drivers are correcly installed because I see something with the nvidia-smi command (tbh, I don’t know what I see, but is not an error like before), and I see the Nvidia card configuration in the Nvidia X Server app, which I didn’t see neither before.

Based on all that, I believe that I should update again, since I don’t want this mistake to be kept on further updates. How should I “reupdate” the system?

Thanks!

Did you wait at least 5 minutes after upgrading before you rebooted?

Updating the nvidia drivers is one thing, updating the entire machine is something different.
I hope the update was done using the command you show in the second step above.

The steps you show in the order performed should have worked, but if you rebooted too early after the installation then it may be an issue.

If possible please show the output of both dnf list installed \*nvidia\* and inxi -Fzxx so we may see the details of the system and have more info to assist. Use the preformatted text button </> to retain on-screen formatting of the text you post.

I did. I actually waited like 20 minutes.

On the update I didn’t add the “-y” part, but of course typed “y” after I was asked.

Here the outputs:

  • dnf list installed \*nvidia\*
Installed Packages
akmod-nvidia-470xx.x86_64                3:470.223.02-1.fc39  @rpmfusion-nonfree
kmod-nvidia-470xx-6.6.6-200.fc39.x86_64.x86_64
                                         3:470.223.02-1.fc39  @@commandline     
kmod-nvidia-470xx-6.6.7-200.fc39.x86_64.x86_64
                                         3:470.223.02-1.fc39  @@commandline     
nvidia-gpu-firmware.noarch               20231211-1.fc39      @updates          
nvidia-persistenced.x86_64               3:545.29.06-1.fc39   @rpmfusion-nonfree-nvidia-driver
nvidia-settings-470xx.x86_64             3:470.223.02-1.fc39  @rpmfusion-nonfree
xorg-x11-drv-nvidia-470xx.x86_64         3:470.223.02-1.fc39  @rpmfusion-nonfree
xorg-x11-drv-nvidia-470xx-cuda.x86_64    3:470.223.02-1.fc39  @rpmfusion-nonfree
xorg-x11-drv-nvidia-470xx-cuda-libs.i686 3:470.223.02-1.fc39  @rpmfusion-nonfree
xorg-x11-drv-nvidia-470xx-cuda-libs.x86_64
                                         3:470.223.02-1.fc39  @rpmfusion-nonfree
xorg-x11-drv-nvidia-470xx-kmodsrc.x86_64 3:470.223.02-1.fc39  @rpmfusion-nonfree
xorg-x11-drv-nvidia-470xx-libs.i686      3:470.223.02-1.fc39  @rpmfusion-nonfree
xorg-x11-drv-nvidia-470xx-libs.x86_64    3:470.223.02-1.fc39  @rpmfusion-nonfree
  • inxi -Fzxx
System:
  Kernel: 6.6.6-200.fc39.x86_64 arch: x86_64 bits: 64 compiler: gcc
    v: 2.40-13.fc39 Desktop: GNOME v: 45.2 tk: GTK v: 3.24.38 wm: gnome-shell
    dm: GDM Distro: Fedora release 39 (Thirty Nine)
Machine:
  Type: Laptop System: LENOVO product: 82EY v: IdeaPad Gaming 3 15ARH05
    serial: <superuser required> Chassis: type: 10 v: IdeaPad Gaming 3 15ARH05
    serial: <superuser required>
  Mobo: LENOVO model: LNVNB161216 v: SDK0J40709 WIN
    serial: <superuser required> UEFI: LENOVO v: FCCN13WW date: 06/29/2020
Battery:
  ID-1: BAT0 charge: 20.0 Wh (55.1%) condition: 36.3/45.0 Wh (80.6%)
    volts: 11.1 min: 11.4 model: SMP L19M3PF7 serial: <filter>
    status: discharging
CPU:
  Info: 8-core model: AMD Ryzen 7 4800H with Radeon Graphics bits: 64
    type: MT MCP arch: Zen 2 rev: 1 cache: L1: 512 KiB L2: 4 MiB L3: 8 MiB
  Speed (MHz): avg: 1398 high: 1400 min/max: 1400/2900 boost: enabled cores:
    1: 1400 2: 1400 3: 1397 4: 1400 5: 1397 6: 1400 7: 1397 8: 1397 9: 1398
    10: 1397 11: 1400 12: 1400 13: 1400 14: 1400 15: 1400 16: 1400
    bogomips: 92630
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
Graphics:
  Device-1: NVIDIA TU117M [GeForce GTX 1650 Ti Mobile] vendor: Lenovo
    driver: nvidia v: 470.223.02 arch: Turing pcie: speed: 2.5 GT/s lanes: 8
    ports: active: none empty: HDMI-A-1 bus-ID: 01:00.0 chip-ID: 10de:1f95
  Device-2: AMD Renoir [Radeon RX Vega 6 ] vendor: Lenovo driver: amdgpu
    v: kernel arch: GCN-5 pcie: speed: 16 GT/s lanes: 16 ports: active: eDP-1
    empty: none bus-ID: 05:00.0 chip-ID: 1002:1636 temp: 40.0 C
  Device-3: IMC Networks [] driver: uvcvideo type: USB rev: 2.0
    speed: 480 Mb/s lanes: 1 bus-ID: 1-3:2 chip-ID: 13d3:56ff
  Display: x11 server: X.Org v: 1.20.14 with: Xwayland v: 23.2.3
    compositor: gnome-shell driver: X: loaded: amdgpu,nvidia
    unloaded: fbdev,modesetting,nouveau,vesa alternate: nv dri: radeonsi
    gpu: amdgpu display-ID: :1 screens: 1
  Screen-1: 0 s-res: 1920x1080 s-dpi: 96
  Monitor-1: eDP-1 mapped: eDP model: BOE Display 0x08e8 res: 1920x1080
    dpi: 142 diag: 395mm (15.5")
  API: OpenGL v: 4.6 vendor: amd mesa v: 23.3.1 glx-v: 1.4 es-v: 3.2
    direct-render: yes renderer: AMD Radeon Graphics (radeonsi renoir LLVM
    17.0.6 DRM 3.54 6.6.6-200.fc39.x86_64) device-ID: 1002:1636
  API: EGL Message: EGL data requires eglinfo. Check --recommends.
Audio:
  Device-1: NVIDIA driver: snd_hda_intel v: kernel pcie: speed: 2.5 GT/s
    lanes: 8 bus-ID: 01:00.1 chip-ID: 10de:10fa
  Device-2: AMD ACP/ACP3X/ACP6x Audio Coprocessor vendor: Lenovo driver: N/A
    pcie: speed: 16 GT/s lanes: 16 bus-ID: 05:00.5 chip-ID: 1022:15e2
  Device-3: AMD Family 17h/19h HD Audio vendor: Lenovo driver: snd_hda_intel
    v: kernel pcie: speed: 16 GT/s lanes: 16 bus-ID: 05:00.6 chip-ID: 1022:15e3
  API: ALSA v: k6.6.6-200.fc39.x86_64 status: kernel-api
  Server-1: JACK v: 1.9.22 status: off
  Server-2: PipeWire v: 1.0.0 status: active with: 1: pipewire-pulse
    status: active 2: wireplumber status: active 3: pipewire-alsa type: plugin
Network:
  Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet
    vendor: Lenovo driver: r8169 v: kernel pcie: speed: 2.5 GT/s lanes: 1
    port: 2000 bus-ID: 03:00.0 chip-ID: 10ec:8168
  IF: eno1 state: down mac: <filter>
  Device-2: Qualcomm Atheros QCA6174 802.11ac Wireless Network Adapter
    vendor: Lenovo driver: ath10k_pci v: kernel pcie: speed: 2.5 GT/s lanes: 1
    bus-ID: 04:00.0 chip-ID: 168c:003e temp: 50.0 C
  IF: wlp4s0 state: up mac: <filter>
Bluetooth:
  Device-1: Qualcomm Atheros QCA61x4 Bluetooth 4.0 driver: btusb v: 0.8
    type: USB rev: 2.0 speed: 12 Mb/s lanes: 1 bus-ID: 3-3:2 chip-ID: 0cf3:e300
  Report: btmgmt ID: hci0 rfk-id: 5 state: up address: <filter> bt-v: 4.2
    lmp-v: 8
Drives:
  Local Storage: total: 476.94 GiB used: 170.04 GiB (35.7%)
  ID-1: /dev/nvme0n1 vendor: Toshiba model: KBG40ZNT512G MEMORY
    size: 476.94 GiB speed: 31.6 Gb/s lanes: 4 serial: <filter> temp: 31.9 C
Partition:
  ID-1: / size: 475.01 GiB used: 169.67 GiB (35.7%) fs: btrfs dev: /dev/dm-0
    mapped: luks-56a3363a-685a-46b1-83fd-49a2eac1d90f
  ID-2: /boot size: 942.8 MiB used: 356.2 MiB (37.8%) fs: ext4
    dev: /dev/nvme0n1p2
  ID-3: /boot/efi size: 974.6 MiB used: 17.4 MiB (1.8%) fs: vfat
    dev: /dev/nvme0n1p1
  ID-4: /home size: 475.01 GiB used: 169.67 GiB (35.7%) fs: btrfs
    dev: /dev/dm-0 mapped: luks-56a3363a-685a-46b1-83fd-49a2eac1d90f
Swap:
  ID-1: swap-1 type: zram size: 8 GiB used: 0 KiB (0.0%) priority: 100
    dev: /dev/zram0
Sensors:
  System Temperatures: cpu: 41.5 C mobo: N/A gpu: amdgpu temp: 40.0 C
  Fan Speeds (rpm): N/A
Info:
  Processes: 482 Uptime: 20m Memory: total: 16 GiB note: est.
  available: 14.99 GiB used: 3.96 GiB (26.4%) Init: systemd v: 254
  target: graphical (5) default: graphical Compilers: gcc: 13.2.1 Packages:
  pm: rpm pkgs: N/A note: see --rpm pm: flatpak pkgs: 46 Shell: Bash v: 5.2.21
  running-in: gnome-terminal inxi: 3.3.31

That shows you installed an incorrect driver. The 1650 is perfectly supported by the latest nvidia driver.

Please remove the 470xx driver and install the latest driver.

  1. sudo dnf remove \*nvidia\* --exclude nvidia-gpu-firmware to remove all the invalid packages.
  2. sudo dnf install akmod-nvidia xorg-x11-drv-nvidia-cuda to install all the proper versioned packages.
  3. Wait at least 5 minutes after step 2 completes then reboot.

After this the dnf list installed \*nvidia\* command should show all the packages at the 545.29.06-1 version (as the nvidia-persistenced package is now).

One of the major failings of using the 470xx driver with that GPU is not that the gpu is unsupported, but rather that the driver does not support wayland at all and forces use of xorg, in addition to the fact there have been no updates to actually support new features in kernels for several years.

1 Like

Good to know that I can use the latest driver!

I did as you say, sadly I’m still unable to boot into the 6.6.7 kernel and now my laptop has this strange behaviour, where the screen slows down for like half a secod (on a video is really obvious)

The kernel booted but still I’m facing this kind of microfreezing.

I made some changes in the grub to try some stuff, since the microfreezing thing is something I faced before and I thought I fixed.

Before the whole Nvidia + updating I did:

  • Add drm.debug=0 drm.vblankoffdelay=1 i915.semaphores=0 i915.modeset=1 i915.use_mmio_flip=1 i915.powersave=1 i915.enable_ips=1 i915.disable_power_well=1 i915.enable_hangcheck=1 i915.enable_cmd_parser=1 i915.fastboot=0 i915.enable_ppgtt=1 i915.reset=0 i915.lvds_use_ssc=0 i915.enable_psr=0 to the the /etc/default/grub file.
  • Change GRUB_ENABLE_BLSCFG=true in /etc/default/grub to false. This has to be done before generating the grub config or if not you won’t be able to boot to fedora.
  • Then remove old grub config : sudo rm /boot/grub2/grub.cfg
  • After that generate new : sudo grub2-mkconfig -o /boot/grub2/grub.cfg
  • Reboot

And it was like that so far, till I experienced the microfreezings again after booting into 6.6.7. To try to fix that I changed the first step to just add GRUB_CMDLINE_LINUX_DEFAULT="i915.enable_psr=0", but like said, it started showing again the microfreezings (which were back before I rebooted into 6.6.7) any idea of how can I get rid of that behaviour?