Graphical Issues and System Instability

Been using Fedora 40 for a few months now. Machine is a custom-built i9-13900k with a 3080ti. My biggest issue since day 1 is putting the machine to sleep. It will…

  • Occasionally resume normally.
  • Not at all, black screen requiring a power reset.
  • Resume but with extreme graphical issues (small gray blocks, strange visualizations, screen freezing). This includes applications crashing. Only resolved by a restart.

I’m using whatever NVIDIA drivers were pulled using dnf. Is there anything I can do to improve this?

Hi and welcome to :fedora:

The issue could be related to the graphics driver. What is the output of inxi -Fzxx and that of dnf repolist. Please post as preformatted text, using the </> button.

I assume you are saying an nvidia RTX 3080 Ti.
Please post the output of dnf list installed '*nvidia*' so we can see the packages installed as well as the repo they came from.

One thing of note that may be of concern.
If you enabled the 3rd party repos during the first boot setup then you likely have the rpmfusion-nonfree-nvidia-driver repo (which would show with the dnf repolist output asked for above) that may have been used to install the nvidia drivers. That repo contains only the 555 driver version.

There is a newer driver version (560.35.03) in the rpmfusion-nonfree repo but that requires an explicit enabling of the additional repos (free & nonfree) as shown at https://rpmfusion.org/Configuration then an upgrade using sudo dnf upgrade akmod-nvidia --disablerepo rpmfusion-nonfree-nvidia-driver to force the system to select the newer driver for the upgrade.

inxi

System:
  Kernel: 6.11.3-200.fc40.x86_64 arch: x86_64 bits: 64 compiler: gcc
    v: 2.41-37.fc40
  Desktop: GNOME v: 46.5 tk: GTK v: 3.24.43 wm: gnome-shell dm: GDM
    Distro: Fedora Linux 40 (Workstation Edition)
Machine:
  Type: Desktop Mobo: Micro-Star model: MEG Z690 UNIFY-X (MS-7D28) v: 2.0
    serial: <superuser required> UEFI: American Megatrends LLC. v: A.I0
    date: 08/20/2024
CPU:
  Info: 24-core (8-mt/16-st) model: 13th Gen Intel Core i9-13900K bits: 64
    type: MST AMCP arch: Raptor Lake rev: 1 cache: L1: 2.1 MiB L2: 32 MiB
    L3: 36 MiB
  Speed (MHz): avg: 800 min/max: 800/5500:5800:4300 cores: 1: 800 2: 800
    3: 800 4: 800 5: 800 6: 800 7: 800 8: 800 9: 800 10: 800 11: 800 12: 800
    13: 800 14: 800 15: 800 16: 800 17: 800 18: 800 19: 800 20: 800 21: 800
    22: 800 23: 800 24: 800 25: 800 26: 800 27: 800 28: 800 29: 800 30: 800
    31: 800 32: 800 bogomips: 191692
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
Graphics:
  Device-1: NVIDIA GA102 [GeForce RTX 3080 Ti] vendor: ZOTAC driver: nouveau
    v: kernel arch: Ampere pcie: speed: 2.5 GT/s lanes: 16 ports:
    active: DP-3,HDMI-A-1 empty: DP-1,DP-2 bus-ID: 01:00.0 chip-ID: 10de:2208
  Device-2: Logitech BRIO Ultra HD Webcam
    driver: hid-generic,snd-usb-audio,usbhid,uvcvideo type: USB rev: 3.1
    speed: 5 Gb/s lanes: 1 bus-ID: 2-2:2 chip-ID: 046d:085e
  Display: wayland server: X.org v: 1.20.14 with: Xwayland v: 24.1.3
    compositor: gnome-shell driver: gpu: nouveau display-ID: 0
  Monitor-1: DP-3 model: MSI Optix MAG27CQ res: 2560x1440 dpi: 110
    diag: 686mm (27")
  Monitor-2: HDMI-A-1 model: BenQ ZOWIE RL LCD res: 1920x1080 dpi: 92
    diag: 609mm (24")
  API: OpenGL v: 4.3 vendor: mesa v: 24.1.7 glx-v: 1.4 es-v: 3.2
    direct-render: yes renderer: NV172 device-ID: 10de:2208 display-ID: :0.0
  API: EGL Message: EGL data requires eglinfo. Check --recommends.
Audio:
  Device-1: Intel Alder Lake-S HD Audio vendor: Micro-Star MSI
    driver: snd_hda_intel v: kernel bus-ID: 00:1f.3 chip-ID: 8086:7ad0
  Device-2: NVIDIA GA102 High Definition Audio vendor: ZOTAC
    driver: snd_hda_intel v: kernel pcie: speed: 2.5 GT/s lanes: 16
    bus-ID: 01:00.1 chip-ID: 10de:1aef
  Device-3: Micro Star USB Audio driver: hid-generic,snd-usb-audio,usbhid
    type: USB rev: 2.0 speed: 480 Mb/s lanes: 1 bus-ID: 1-11:9
    chip-ID: 0db0:95bb
  Device-4: Logitech BRIO Ultra HD Webcam
    driver: hid-generic,snd-usb-audio,usbhid,uvcvideo type: USB rev: 3.1
    speed: 5 Gb/s lanes: 1 bus-ID: 2-2:2 chip-ID: 046d:085e
  API: ALSA v: k6.11.3-200.fc40.x86_64 status: kernel-api
  Server-1: JACK v: 1.9.22 status: off
  Server-2: PipeWire v: 1.0.8 status: active with: 1: pipewire-pulse
    status: active 2: wireplumber status: active 3: pipewire-alsa type: plugin
Network:
  Device-1: Intel Alder Lake-S PCH CNVi WiFi driver: iwlwifi v: kernel
    bus-ID: 00:14.3 chip-ID: 8086:7af0
  IF: wlo1 state: down mac: <filter>
  Device-2: Intel Ethernet I225-V vendor: Micro-Star MSI driver: igc
    v: kernel pcie: speed: 5 GT/s lanes: 1 port: N/A bus-ID: 06:00.0
    chip-ID: 8086:15f3
  IF: enp6s0 state: down mac: <filter>
  Device-3: Intel Ethernet I225-V vendor: Micro-Star MSI driver: igc
    v: kernel pcie: speed: 5 GT/s lanes: 1 port: N/A bus-ID: 07:00.0
    chip-ID: 8086:15f3
  IF: enp7s0 state: up speed: 2500 Mbps duplex: full mac: <filter>
Bluetooth:
  Device-1: Intel AX211 Bluetooth driver: btusb v: 0.8 type: USB rev: 2.0
    speed: 12 Mb/s lanes: 1 bus-ID: 1-14:12 chip-ID: 8087:0033
  Report: btmgmt ID: hci0 rfk-id: 0 state: up address: <filter> bt-v: 5.3
    lmp-v: 12
Drives:
  Local Storage: total: 5.46 TiB used: 104.78 GiB (1.9%)
  ID-1: /dev/nvme0n1 vendor: Smart Modular Tech. model: SHPP41-2000GM
    size: 1.82 TiB speed: 63.2 Gb/s lanes: 4 serial: <filter> temp: 38.9 C
  ID-2: /dev/nvme1n1 vendor: Samsung model: SSD 990 PRO 4TB size: 3.64 TiB
    speed: 63.2 Gb/s lanes: 4 serial: <filter> temp: 33.9 C
Partition:
  ID-1: / size: 55.43 GiB used: 40.34 GiB (72.8%) fs: btrfs
    dev: /dev/nvme0n1p6
  ID-2: /boot size: 973.4 MiB used: 455.3 MiB (46.8%) fs: ext4
    dev: /dev/nvme0n1p5
  ID-3: /boot/efi size: 96 MiB used: 51.6 MiB (53.7%) fs: vfat
    dev: /dev/nvme0n1p1
  ID-4: /home size: 55.43 GiB used: 40.34 GiB (72.8%) fs: btrfs
    dev: /dev/nvme0n1p6
Swap:
  ID-1: swap-1 type: partition size: 977 MiB used: 0 KiB (0.0%) priority: -2
    dev: /dev/nvme0n1p9
  ID-2: swap-2 type: zram size: 8 GiB used: 512 KiB (0.0%) priority: 100
    dev: /dev/zram0
Sensors:
  System Temperatures: cpu: 29.5 C mobo: N/A
  Fan Speeds (rpm): N/A
Info:
  Memory: total: 32 GiB available: 31.18 GiB used: 3.17 GiB (10.2%)
  Processes: 572 Power: uptime: 23h 35m wakeups: 0 Init: systemd v: 255
    target: graphical (5) default: graphical
  Packages: pm: rpm pkgs: N/A note: see --rpm Compilers: gcc: 14.2.1
    Shell: Bash v: 5.2.26 running-in: gnome-terminal inxi: 3.3.36

dnf

nvidia-gpu-firmware.noarch                                             20240909-1.fc40                                             @updates

The command outputs show that you have no Nvidia drivers installed, and using the generic nouveau drivers instead.

You should follow the instructions as suggested by @computersavvy above, in order to install the necessary drivers.

Please also have a look at RPM Fusion’s Nividia HowTo, so that you get familiar with the concepts of installing and using Nvidia drivers on Fedora systems. Normally, one would use those instructions. However, there is currently a special situation, which doesn’t usually occur, in which two branches of the driver are being provided simultaneously:

  • the 555.xx versions, available in the rpmfusion-nonfree-nvidia-driver repo
  • the 560.xx versions, available in the rpmfusion-nonfreerepo.

Hence the suggestion above to disable the rpmfusion-nonfree-nvidia-driver repo when installing the drivers.

1 Like

Ouch, rookie mistake. Like only using the Microsoft Basic Display Adapter.

I’ve got 560 installed, will see how that goes.

2 Likes

Still having the same issues, although I feel like they’re not as frequent.

Graphical issues: Gnome flatpaks apps are crashing or not opening - #3 by Espionage724

Probably need to set GSK_RENDERER=gl globally in profile.d


Suspend issues: Fedora 40 kde battery percentage not updating/always in charge - #19 by Espionage724

From this list I think Windows 11 might work for Raptor Lake:

acpi_osi='Windows 2021'

I’ve just set that environment variable now, will see how it goes.

sudo strings '/sys/firmware/acpi/tables/DSDT' | grep -i 'windows ' | sort output is…

Windows 2001
Windows 2001.1
Windows 2001 SP1
Windows 2001 SP2
Windows 2006
Windows 2009
Windows 2012
Windows 2013
Windows 2015

Set it to “Windows 2021”, even if it’s not reported?

Nah in that case I’d use Windows 2015. Iirc you can see what OSI does kind-of specifically in the DSDT with a text editor (mine made mention to Touchpad but I don’t understand what exactly it’s doing :stuck_out_tongue:).

I’m thinking Windows 10 support should be as good as it can get but if the computer mobo was manufacturer during a certain Windows version year (like 8 or 8.1) then it might have better compatibility/support settings from an older version with newer OSI support making it faster/dropping legacy.

It’s a Z690 chipset, so Q4 of 2021. Although that’s roughly when Windows 11 released, I would’ve expected more than “Windows 2015”.

All the same, I’ll apply that.

After applying that to GRUB and rebooting, I caught an odd screen I’d see a few times before (a gray screen with 3 square white dots). What I don’t recall seeing is the message “NVIDIA kernel missing. Falling back to nouveau”.

Since I have secure boot enabled, this has likely been an issue since the beginning. (Well, aside from the fact I didn’t have the NVIDIA drivers installed.) Imported the public key, from the README, message didn’t reappear. Hopefully that’s the root of my issues.

Experiencing different issues now…

  • Sometimes, when I put it to sleep, it will black screen but stay awake. Eventually, the monitors resume. I try again, it sleeps.
  • After waking the displays and logging in, both monitors will go black for a few seconds then resume. This can happen when switching between application windows. Sometimes, the monitors just lose signal. Replugging the display cables brings them back.

After upgrading to Fedora 41, I ran into the same issue "NVIDIA kernel missing. Falling back to nouveau”. Seems I am in fact still on 40, I’m probably confusing it with the other upgrades I did. Not sure what changed, but definitely after updates.

Following the same instructions above, the key was already enrolled. I ran sudo akmods --force --rebuild, rebooted, and lspci -nnk | grep -iA2 vga reports the NVIDIA kernel driver is being used.