My fedora 31 system running an NVIDIA GPU failed recently after partially running for several months. I was unable to find the problem source - it occurred when I rebooted and may have been triggered by a system update. Thanks for any help, I know this is messy.
My installation had used the rpmfusion howto and included cuda11.0. So I tried to uninstall nvidia and start over, again using the rpmfusion howto.
I find the GPU
bash-5.0# /sbin/lspci | grep -e VGA
00:02.0 VGA compatible controller: Intel Corporation UHD Graphics 620 (rev 07)
09:00.0 VGA compatible controller: NVIDIA Corporation GP107 [GeForce GTX 1050] (rev a1)
There is some evidence of the nvidia installation.
modinfo -F version nvidia
455.38
The kernel-devel modules are installed.
I disabled my epel repo since it seemed to have some conflicts with rpmfusion. My working repolist is
repo id repo name
adobe-linux-x86_64 Adobe Systems Incorporated
cernbox CERNBOX2.0 client repository (EOS backend)
copr:copr.fedorainfracloud.org:abn:throttled Copr repo for throttled owned by abn
cuda cuda
eos-fc-30 fc-30 EOS release from EOS project
fedora Fedora 31 - x86_64
fedora-modular Fedora Modular 31 - x86_64
google-chrome google-chrome
google-chrome-64 google-chrome - 64-bit
google-chrome-unstable google-chrome-unstable
rpmfusion-free RPM Fusion for Fedora 31 - Free
rpmfusion-free-updates RPM Fusion for Fedora 31 - Free - Updates
rpmfusion-nonfree-nvidia-driver RPM Fusion for Fedora 31 - Nonfree - NVIDIA Driver
skype-stable skype (stable)
updates Fedora 31 - x86_64 - Updates
updates-modular Fedora Modular 31 - x86_64 - Updates
I can no longer install the akmod for nvidia.
dnf install akmod-nvidia
Last metadata expiration check: 1:42:07 ago on Sun 22 Nov 2020 09:21:01 AM PST.
Error:
Problem: package nvidia-kmod-common-3:455.45.01-1.el8.noarch requires nvidia-driver = 3:455.45.01, but none of the providers can be installed
- package akmod-nvidia-3:455.38-1.fc31.x86_64 requires nvidia-kmod-common >= 3:455.38, but none of the providers can be installed
- package nvidia-driver-3:455.45.01-1.el8.x86_64 requires dnf-plugin-nvidia, but none of the providers can be installed
- conflicting requests
- package dnf-plugin-nvidia-1.1-1.el8.noarch is filtered out by modular filtering
- nothing provides python(abi) = 3.6 needed by dnf-plugin-nvidia-1.1-1.el8.noarch
- package dnf-plugin-nvidia-1.6-1.el8.noarch is filtered out by modular filtering
- nothing provides python(abi) = 3.6 needed by dnf-plugin-nvidia-1.6-1.el8.noarch
- package dnf-plugin-nvidia-1.9-1.el8.noarch is filtered out by modular filtering
- nothing provides python(abi) = 3.6 needed by dnf-plugin-nvidia-1.9-1.el8.noarch
- nothing provides python(abi) = 3.6 needed by dnf-plugin-nvidia-2.0-1.el8.noarch
- package xorg-x11-drv-nvidia-3:455.38-1.fc31.x86_64 is filtered out by modular filtering
A possible clue for why Iâve lost contact with the GTX1050 gpu is a message in /var/log/messages ( which only runs on shutdown)
Nov 21 18:55:48 lenolin systemd[1]: Stopped Create Volatile Files and Directories.
Nov 21 18:55:49 lenolin dracut-initramfs-restore[2448]: /usr/sbin/setfiles: conflicting specifications for /run/initramfs/etc/modprobe.d/nvidia-installer-disable-nouveau.conf and /run/initramfs/usr/lib/modprobe.d/nvidia-installer-disable-nouveau.conf, using system_u:object_r:lib_t:s0.
Nov 21 18:55:49 lenolin systemd[1]: dracut-shutdown.service: Succeeded.
Nov 21 18:55:49 lenolin systemd[1]: Stopped Restore /run/initramfs on shutdown.
Anything to be gained by upgrading to fed33?