Hello everyone! I’ll describe the issue right away.
There is Fedora 39 installed on a PC and I recently installed the latest kernel, 6.8.11. After the installation is complete I reboot and the kernel doesn’t load, basically after the Grub screen the monitor shows only the loading wording and also the disk activity led gives no signs. I reinstall the kernel but the problem persists. Doing some research I don’t find much.
Then I boot the latest working kernel and run the command sudo dmesg | grep -i failed and get this output:
[ 7.035696] nvidia: module verification failed: signature and/or required key missing - tainting kernel
[ 12.617748] BTRFS warning (device nvme0n1p2): csum failed root 256 ino 6391800 off 36618240 csum 0xe8ad850d expected csum 0x75be784b mirror 1
[ 12.617772] BTRFS warning (device nvme0n1p2): csum failed root 256 ino 6391800 off 36622336 csum 0x703efc6e expected csum 0xa68f5e81 mirror 1
[ 12.617781] BTRFS warning (device nvme0n1p2): csum failed root 256 ino 6391800 off 36626432 csum 0xb202d783 expected csum 0x9d56c38b mirror 1
[ 12.617789] BTRFS warning (device nvme0n1p2): csum failed root 256 ino 6391800 off 36630528 csum 0xe5548687 expected csum 0x3aaa0e25 mirror 1
[ 46.505396] BTRFS warning (device nvme0n1p2): csum failed root 256 ino 6391800 off 36700160 csum 0xe8ad850d expected csum 0x75be784b mirror 1
[ 46.505409] BTRFS warning (device nvme0n1p2): csum failed root 256 ino 6391800 off 36704256 csum 0x703efc6e expected csum 0xa68f5e81 mirror 1
[ 46.505416] BTRFS warning (device nvme0n1p2): csum failed root 256 ino 6391800 off 36708352 csum 0xb202d783 expected csum 0x9d56c38b mirror 1
[ 46.505421] BTRFS warning (device nvme0n1p2): csum failed root 256 ino 6391800 off 36712448 csum 0xe5548687 expected csum 0x3aaa0e25 mirror 1
[ 46.505673] BTRFS warning (device nvme0n1p2): csum failed root 256 ino 6391800 off 36700160 csum 0xe8ad850d expected csum 0x75be784b mirror 1
[ 46.505684] BTRFS warning (device nvme0n1p2): csum failed root 256 ino 6391800 off 36704256 csum 0x703efc6e expected csum 0xa68f5e81 mirror 1
[ 46.505691] BTRFS warning (device nvme0n1p2): csum failed root 256 ino 6391800 off 36708352 csum 0xb202d783 expected csum 0x9d56c38b mirror 1
[ 46.505697] BTRFS warning (device nvme0n1p2): csum failed root 256 ino 6391800 off 36712448 csum 0xe5548687 expected csum 0x3aaa0e25 mirror 1
[ 46.505936] BTRFS warning (device nvme0n1p2): csum failed root 256 ino 6391800 off 36700160 csum 0xe8ad850d expected csum 0x75be784b mirror 1
[ 46.505943] BTRFS warning (device nvme0n1p2): csum failed root 256 ino 6391800 off 36704256 csum 0x703efc6e expected csum 0xa68f5e81 mirror 1
From this I deduce that I have problems with the disk or the Btrfs filesystem. I do some more research and run some more terminal commands:
sudo btrfs device stats /
Output:
[/dev/nvme0n1p2].write_io_errs 0
[/dev/nvme0n1p2].read_io_errs 0
[/dev/nvme0n1p2].flush_io_errs 0
[/dev/nvme0n1p2].corruption_errs 20593
[/dev/nvme0n1p2].generation_errs 0
sudo btrfs scrub start -B /
Output:
Starting scrub on devid 1
scrub done for fa7f4375-9aca-4c2c-a122-8a0abaa53c18
Scrub started: Sun Jun 2 11:50:42 2024
Status: finished
Duration: 0:02:01
Total to scrub: 39.30GiB
Rate: 332.61MiB/s
Error summary: csum=253252
Corrected: 253232
Uncorrectable: 20
Unverified: 0
ERROR: there are 1 uncorrectable errors
- From live Fedora
sudo btrfs check /dev/nvme0n1p2
Output:
Opening filesystem to check...
Checking filesystem on /dev/nvme0n1p2
UUID: fa7f4375-9aca-4c2c-a122-8a0abaa53c18
[1/7] checking root items
[2/7] checking extents
[3/7] checking free space tree
[4/7] checking fs roots
[5/7] checking only csums items (without verifying data)
[6/7] checking root refs
[7/7] checking quota groups skipped (not enabled on this FS)
found 39998722048 bytes used, no error found
total csum bytes: 36739452
total tree bytes: 2296004608
total fs tree bytes: 2134851584
total extent tree bytes: 104857600
btree space waste bytes: 483752904
file data blocks allocated: 234120581120
referenced 88654282752
- From live Fedora
sudo btrfs check -p --readonly --check-data-csum /dev/nvme0n1p2
Output:
Opening filesystem to check...
Checking filesystem on /dev/nvme0n1p2
UUID: fa7f4375-9aca-4c2c-a122-8a0abaa53c18
[1/7] checking root items (0:00:01 elapsed, 937917 items checked)
[2/7] checking extents (0:00:06 elapsed, 131284 items checked)
[3/7] checking free space tree (0:00:00 elapsed, 53 items checked)
[4/7] checking fs roots (0:00:07 elapsed, 120988 items checked)
mirror 1 bytenr 12749418496 csum 0xe8ad850d expected csum 0x75be784b996 items checked)
mirror 1 bytenr 12749422592 csum 0x703efc6e expected csum 0xa68f5e81
mirror 1 bytenr 12749426688 csum 0xb202d783 expected csum 0x9d56c38b
mirror 1 bytenr 12749430784 csum 0xe5548687 expected csum 0x3aaa0e25
mirror 1 bytenr 20133556224 csum 0x9be6b08c expected csum 0xa1912793623 items checked)
mirror 1 bytenr 20133560320 csum 0x1631ae43 expected csum 0x45bad2e7
mirror 1 bytenr 20133564416 csum 0xa1567b66 expected csum 0x9d52b848
mirror 1 bytenr 20133568512 csum 0xd199cb0a expected csum 0x5d2a2c8c
mirror 1 bytenr 20133572608 csum 0x0166da36 expected csum 0x6cccacdb
mirror 1 bytenr 20133576704 csum 0xec5680d3 expected csum 0x8b348092
mirror 1 bytenr 20133580800 csum 0x22c9df3a expected csum 0x98ef5185
mirror 1 bytenr 20133584896 csum 0x7eb0913e expected csum 0xa4f57dfa
mirror 1 bytenr 27345760256 csum 0xf38bbef5 expected csum 0x055f2878553 items checked)
mirror 1 bytenr 27345764352 csum 0xe5fb6267 expected csum 0x6ade60b2
mirror 1 bytenr 27345768448 csum 0x1b64811b expected csum 0xf5aac402
mirror 1 bytenr 27345772544 csum 0x26c53775 expected csum 0x9a100b20
mirror 1 bytenr 27345776640 csum 0x6d9fba29 expected csum 0xa02c129c
mirror 1 bytenr 27345780736 csum 0xa256887e expected csum 0x94099a0a
mirror 1 bytenr 27345784832 csum 0x3d538e26 expected csum 0x8e7a2967
mirror 1 bytenr 27345788928 csum 0xfa0c1ed5 expected csum 0xb2bb6f52
[5/7] checking csums against data (0:01:44 elapsed, 265872 items checked)
ERROR: errors found in csum tree
[6/7] checking root refs (0:00:00 elapsed, 11 items checked)
[7/7] checking quota groups skipped (not enabled on this FS)
found 39648686080 bytes used, error(s) found
total csum bytes: 36539540
total tree bytes: 2150678528
total fs tree bytes: 1989967872
total extent tree bytes: 104415232
btree space waste bytes: 454815130
file data blocks allocated: 224884305920
referenced 86042050560
From these outputs I think there is some problem with the Btrfs filesystem on the NVMe.
Are these problems serious? Can they compromise the data? Do they indicate that the NVMe is corrupted? Could they be the cause of the kernel not booting?
I also created on the same disk a new Btrfs partition of the same size and run the same commands to compare the outputs and find the same errors. Apparently no errors are being reported, so perhaps I can rule out possible NVMe hardware problems.
Could it just be related to the single partition on which Fedora is installed?
Finally, I realized that I occasionally experience boot problems even with the 6.8.10 kernel running: basically, once I boot the system crashes before GDM appears and I can only interact with the tty shell. I have to reboot in order to access the system with the Gnome shell working.
Can anyone help me solve it?