Black screen on boot | F41 | NVIDIA GPU

Hi,

This seems closely related to Black screen on boot | F40 | NVIDIA GPU. My hardware is AMD3950X and GA104 [GeForce RTX 3070] video. This runs happily using the nouveau driver. However, I need to enable CUDA. Secure boot is disabled in the BIOS. I followed Howto/NVIDIA - RPM Fusion to install the driver. System is fully up-to-date, running kernel 6.11.10-300.fc41.x86_64. NVidia modules installed are here (first also installed cuda, but after the failure I first wanted to get the plan driver in place).

akmod-nvidia.x86_64                        3:565.57.01-1.fc41 rpmfusion-nonfree-nvidia-driver
kmod-nvidia-6.11.10-300.fc41.x86_64.x86_64 3:565.57.01-1.fc41 @commandline
nvidia-gpu-firmware.noarch                 20241110-1.fc41    <unknown>
nvidia-modprobe.x86_64                     3:565.57.01-1.fc41 rpmfusion-nonfree-nvidia-driver
nvidia-settings.x86_64                     3:565.57.01-1.fc41 rpmfusion-nonfree-nvidia-driver
xorg-x11-drv-nvidia.x86_64                 3:565.57.01-3.fc41 rpmfusion-nonfree-nvidia-driver
xorg-x11-drv-nvidia-cuda-libs.x86_64       3:565.57.01-3.fc41 rpmfusion-nonfree-nvidia-driver
xorg-x11-drv-nvidia-kmodsrc.x86_64         3:565.57.01-3.fc41 rpmfusion-nonfree-nvidia-driver
xorg-x11-drv-nvidia-libs.i686              3:565.57.01-3.fc41 rpmfusion-nonfree-nvidia-driver
xorg-x11-drv-nvidia-libs.x86_64            3:565.57.01-3.fc41 rpmfusion-nonfree-nvidia-driver
xorg-x11-drv-nvidia-power.x86_64           3:565.57.01-3.fc41 rpmfusion-nonfree-nvidia-driver
xorg-x11-drv-nvidia-xorg-libs.x86_64       3:565.57.01-3.fc41 rpmfusion-nonfree-nvidia-driver

Output of inxi -Fzxx:

System:
  Kernel: 6.11.10-300.fc41.x86_64 arch: x86_64 bits: 64 compiler: gcc
    v: 2.43.1-2.fc41
  Desktop: GNOME v: 47.1 tk: GTK v: 3.24.43 wm: gnome-shell dm: 1: GDM
    2: LightDM note: stopped Distro: Fedora Linux 41 (Workstation Edition)
Machine:
  Type: Desktop Mobo: ASUSTeK model: PRIME B450-PLUS v: Rev X.0x
    serial: <superuser required> part-nu: SKU UEFI: American Megatrends v: 4622
    date: 09/29/2024
CPU:
  Info: 16-core model: AMD Ryzen 9 3950X bits: 64 type: MT MCP arch: Zen 2
    rev: 0 cache: L1: 1024 KiB L2: 8 MiB L3: 64 MiB
  Speed (MHz): avg: 3593 min/max: 550/4762 boost: enabled cores: 1: 3593
    2: 3593 3: 3593 4: 3593 5: 3593 6: 3593 7: 3593 8: 3593 9: 3593 10: 3593
    11: 3593 12: 3593 13: 3593 14: 3593 15: 3593 16: 3593 17: 3593 18: 3593
    19: 3593 20: 3593 21: 3593 22: 3593 23: 3593 24: 3593 25: 3593 26: 3593
    27: 3593 28: 3593 29: 3593 30: 3593 31: 3593 32: 3593 bogomips: 223585
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3
Graphics:
  Device-1: NVIDIA GA104 [GeForce RTX 3070] vendor: Micro-Star MSI
    driver: nouveau v: kernel arch: Ampere pcie: speed: 2.5 GT/s lanes: 16
    ports: active: HDMI-A-1 empty: DP-1,DP-2,DP-3 bus-ID: 09:00.0
    chip-ID: 10de:2484
  Device-2: Logitech B525 HD Webcam driver: snd-usb-audio,uvcvideo type: USB
    rev: 2.0 speed: 480 Mb/s lanes: 1 bus-ID: 3-1.3:5 chip-ID: 046d:0836
  Display: wayland server: X.org v: 1.21.1.14 with: Xwayland v: 24.1.4
    compositor: gnome-shell driver: X: loaded: modesetting,nouveau
    unloaded: fbdev,vesa gpu: nouveau display-ID: 0
  Monitor-1: HDMI-A-1 model: LG (GoldStar) ULTRAWIDE res: 3440x1440 dpi: 109
    diag: 867mm (34.1")
  API: OpenGL v: 4.3 vendor: mesa v: 24.2.7 glx-v: 1.4 es-v: 3.2
    direct-render: yes renderer: NV174 device-ID: 10de:2484 display-ID: :0.0
  API: Vulkan v: 1.3.296 surfaces: xcb,xlib,wayland device: 0
    type: discrete-gpu driver: N/A device-ID: 10de:2484 device: 1 type: cpu
    driver: N/A device-ID: 10005:0000
  API: EGL Message: EGL data requires eglinfo. Check --recommends.
Audio:
  Device-1: NVIDIA GA104 High Definition Audio vendor: Micro-Star MSI
    driver: snd_hda_intel v: kernel pcie: speed: 5 GT/s lanes: 16
    bus-ID: 09:00.1 chip-ID: 10de:228b
  Device-2: Advanced Micro Devices [AMD] Starship/Matisse HD Audio
    vendor: ASUSTeK driver: snd_hda_intel v: kernel pcie: speed: 16 GT/s
    lanes: 16 bus-ID: 0b:00.4 chip-ID: 1022:1487
  Device-3: Generalplus USB Audio Device
    driver: hid-generic,snd-usb-audio,usbhid type: USB rev: 1.1 speed: 12 Mb/s
    lanes: 1 bus-ID: 1-2:2 chip-ID: 1b3f:2008
  Device-4: Logitech B525 HD Webcam driver: snd-usb-audio,uvcvideo type: USB
    rev: 2.0 speed: 480 Mb/s lanes: 1 bus-ID: 3-1.3:5 chip-ID: 046d:0836
  API: ALSA v: k6.11.10-300.fc41.x86_64 status: kernel-api
  Server-1: JACK v: 1.9.22 status: off
  Server-2: PipeWire v: 1.2.7 status: active with: 1: pipewire-pulse
    status: active 2: wireplumber status: active 3: pipewire-alsa type: plugin
Network:
  Device-1: Realtek RTL8111/8168/8211/8411 PCI Express Gigabit Ethernet
    vendor: ASUSTeK RTL8111H driver: r8169 v: kernel pcie: speed: 2.5 GT/s
    lanes: 1 port: f000 bus-ID: 04:00.0 chip-ID: 10ec:8168
  IF: enp4s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
  IF-ID-1: br-11c2c9b3104c state: down mac: <filter>
  IF-ID-2: docker0 state: down mac: <filter>
  IF-ID-3: mpqemubr0 state: down mac: <filter>
Drives:
  Local Storage: total: 5.01 TiB used: 1.03 TiB (20.6%)
  ID-1: /dev/nvme0n1 vendor: Kingston model: SFYRD4000G size: 3.64 TiB
    speed: 63.2 Gb/s lanes: 4 serial: <filter> temp: 35.9 C
  ID-2: /dev/sda model: 512GB PCS 2.5 SSD size: 476.94 GiB speed: 6.0 Gb/s
    serial: <filter> temp: 40 C
  ID-3: /dev/sdb vendor: Crucial model: CT1000X6SSD8 size: 931.51 GiB
    type: USB rev: 3.1 spd: 10 Gb/s lanes: 1 serial: <filter>
Partition:
  ID-1: / size: 195.8 GiB used: 86.46 GiB (44.2%) fs: ext4 dev: /dev/dm-0
    mapped: vgubuntu-fedora
  ID-2: /boot size: 25.86 GiB used: 573.5 MiB (2.2%) fs: ext4 dev: /dev/sda3
  ID-3: /boot/efi size: 511 MiB used: 24.4 MiB (4.8%) fs: vfat
    dev: /dev/sda1
  ID-4: /home size: 983.23 GiB used: 777.37 GiB (79.1%) fs: ext4
    dev: /dev/dm-2 mapped: janw4t-home
Swap:
  ID-1: swap-1 type: zram size: 8 GiB used: 0 KiB (0.0%) priority: 100
    dev: /dev/zram0
Sensors:
  System Temperatures: cpu: 51.2 C mobo: N/A
  Fan Speeds (rpm): N/A
Info:
  Memory: total: 128 GiB note: est. available: 125.69 GiB
    used: 5.14 GiB (4.1%)
  Processes: 679 Power: uptime: 12m wakeups: 0 Init: systemd v: 256
    target: graphical (5) default: graphical
  Packages: 141 pm: nix-default pkgs: 53 pm: rpm pkgs: N/A note: see --rpm
    pm: flatpak pkgs: 23 pm: snap pkgs: 12 Compilers: clang: 19.1.4 gcc: 14.2.1
    Shell: Bash v: 5.2.32 running-in: terminator inxi: 3.3.36

Iā€™ve played around with nomodeset, and nvidia_drm.modeset=[01] and I updated the BIOS. It all makes no difference:

I captured nvidia-bug-report.log.gz Guess it is a bit too large to attach here.

Anything I have missed?

In the meanwhile we have seen several kernel and nvidia driver updates, but the result stays the same: using nvidia_drm.modeset=1 the screen goes black immediately, using nvidia_drm.modeset=0 I get the splash screen at low resolution for maybe 10 seconds before the screen goes black. Current versions

(~) 3_> dnf list --installed \*nvidia\*
Installed packages
akmod-nvidia.x86_64                        3:565.77-1.fc41 rpmfusion-nonfree-nvidia-driver
kmod-nvidia-6.11.11-300.fc41.x86_64.x86_64 3:565.77-1.fc41 @commandline
kmod-nvidia-6.12.4-200.fc41.x86_64.x86_64  3:565.77-1.fc41 @commandline
kmod-nvidia-6.12.5-200.fc41.x86_64.x86_64  3:565.77-1.fc41 @commandline
nvidia-gpu-firmware.noarch                 20241210-1.fc41 updates
nvidia-modprobe.x86_64                     3:565.77-1.fc41 rpmfusion-nonfree-nvidia-driver
nvidia-settings.x86_64                     3:565.77-1.fc41 rpmfusion-nonfree-nvidia-driver
xorg-x11-drv-nvidia.x86_64                 3:565.77-3.fc41 rpmfusion-nonfree-updates
xorg-x11-drv-nvidia-cuda-libs.x86_64       3:565.77-3.fc41 rpmfusion-nonfree-updates
xorg-x11-drv-nvidia-kmodsrc.x86_64         3:565.77-3.fc41 rpmfusion-nonfree-updates
xorg-x11-drv-nvidia-libs.i686              3:565.77-3.fc41 rpmfusion-nonfree-updates
xorg-x11-drv-nvidia-libs.x86_64            3:565.77-3.fc41 rpmfusion-nonfree-updates
xorg-x11-drv-nvidia-power.x86_64           3:565.77-3.fc41 rpmfusion-nonfree-updates
xorg-x11-drv-nvidia-xorg-libs.x86_64       3:565.77-3.fc41 rpmfusion-nonfree-updates
(~) 4_> uname -a
Linux janw 6.12.5-200.fc41.x86_64 #1 SMP PREEMPT_DYNAMIC Sun Dec 15 16:48:23 UTC 2024 x86_64 GNU/Linux

I did some careful analysis of the journalctl logs from the boot session. It cannot start the nvidia-powerd, but that seems normal and innocent. All seems innocent until it says (using nvidia_drm.modeset=1)

Dec 21 15:36:33 fedora systemd[1]: nvidia-fallback.service - Fallback to nouveau as nvidia did not load was skipped because of an unmet condition check (ConditionPathExists=!/sys/module/nvidia).

Some nvidia related lines before are here. I cannot find a message claiming that the load failed, let alone why :frowning:

Dec 21 15:36:29 fedora kernel: nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms  565.77  Wed Nov 27 22:53:48 UTC 2024
...
Dec 21 15:36:32 fedora kernel: nvidia 0000:09:00.0: vgaarb: deactivate vga console
Dec 21 15:36:32 fedora kernel: fbcon: nvidia-drmdrmfb (fb0) is primary device
Dec 21 15:36:32 fedora kernel: nvidia 0000:09:00.0: [drm] fb0: nvidia-drmdrmfb frame buffer device

I can remote login. Is there anything I should examine that may help?

That said the fallback was skipped and is normal when nvidia loads.

Powerd is only supported on certain cards so failure is quite common and that is an information message, not an error.

I think that you might need the environment variable set.
Edit the file /etc/environment and add a line that reads GSK_RENDERER=ngl then reboot. If it is the cause, as it appears for some was the case, then it should boot normally. Potentially if you are using an older nvidia card it may need to be gl instead of ngl.

I see. I misread. It is telling me the fallback is not tried because /sys/module/nvidia is present (I think). Ok.

Thanks. Tried without, with ngl and with gl. It all does the same with a small difference: without I do get the splash screen briefly. These are N=1 tests though.

I see some discussion that switching back to Xorg might resolve this. I donā€™t like these work-arounds, but if necessary ā€¦ Could it work?

Hello,

Iā€™m experiencing a similar issue with an RTX 3000 series card. Based on what Iā€™ve observed so far:

  • When I connect my Samsung monitor via HDMI, it results in a black screen. However, when I use DisplayPort, it works perfectly.
  • When I connect my MSI monitor via HDMI, it only shows two refresh rate profiles, none of which include 144Hz. On the other hand, connecting via displayPort shows all the refresh rate profiles, including 144Hz.
  • Iā€™ve also tried different HDMI interface cables, but the results remain the same.

From my understanding, the problem can be related to NVIDIA drivers and Wayland not fully supporting EDID in all cases.

You might want to try using displayPort or switching to Xorg to potentially solve this issue. However, I havenā€™t tested the Xorg solution myself, nor have I tried different kernel versions. For now, Iā€™m just using my MSI monitor via displayPort.

Iā€™m sure skilled enthusiasts might have a solution for this, but as a newcomer to Linux and Fedora, Iā€™m still finding my way around. :sweat_smile:

āžœ  ~ inxi -Fzxx
System:
  Kernel: 6.12.5-200.fc41.x86_64 arch: x86_64 bits: 64 compiler: gcc
    v: 2.43.1-4.fc41
  Desktop: GNOME v: 47.2 tk: GTK v: 3.24.43 wm: gnome-shell dm: GDM
    Distro: Fedora Linux 41 (Workstation Edition)

Graphics:
  Device-1: NVIDIA GA102 [GeForce RTX 3080] vendor: Micro-Star MSI
    driver: nvidia v: 565.77 arch: Ampere pcie: speed: 5 GT/s lanes: 16 ports:
    active: none off: DP-2,HDMI-A-1 empty: DP-1,DP-3 bus-ID: 01:00.0
    chip-ID: 10de:2206
  Display: wayland server: X.org v: 1.21.1.14 with: Xwayland v: 24.1.4
    compositor: gnome-shell driver: gpu: nvidia,nvidia-nvswitch display-ID: 0
  Monitor-1: DP-2 model: MSI MAG342CQR res: 3440x1440 dpi: 111
    diag: 864mm (34")
  Monitor-2: HDMI-A-1 model: Samsung res: 3840x2160 dpi: 88
    diag: 1271mm (50.1")
  API: EGL v: 1.5 platforms: device: 0 drv: nvidia device: 2 drv: swrast
    gbm: drv: nvidia surfaceless: drv: nvidia wayland: drv: nvidia x11:
    drv: nvidia inactive: device-1
  API: OpenGL v: 4.6.0 compat-v: 4.5 vendor: nvidia mesa v: 565.77
    glx-v: 1.4 direct-render: yes renderer: NVIDIA GeForce RTX 3080/PCIe/SSE2
    display-ID: :0.0
  API: Vulkan v: 1.3.296 surfaces: xcb,xlib,wayland device: 0
    type: discrete-gpu driver: N/A device-ID: 10de:2206 device: 1 type: cpu
    driver: N/A device-ID: 10005:0000

1 Like

It never came to my mind that using displayport rather than hdmi could have any effect, but it does. It now boots fine :slight_smile: Thanks!

Youā€™re welcome! I also did a little research on EDID and found some interesting stuff, but I didnā€™t dive too deep into it. DisplayPort seems like a good solution for me too right now :slightly_smiling_face:ā€¦