Hardware error notification with freshly installed Fedora 36 KDE on T14s Gen 3 AMD

Hi, I got every 5 min or so a error message (with loud beep) in the notification area, any idea what can happen ? I have run a comprehensive memory test from the BIOS (5 hours…) and all is green.

My system is

Operating System: Fedora Linux 36
KDE Plasma Version: 5.25.3
KDE Frameworks Version: 5.96.0
Qt Version: 5.15.3
Kernel Version: 5.18.11-200.fc36.x86_64 (64-bit)
Graphics Platform: Wayland
Processors: 16 × AMD Ryzen 7 PRO 6850U with Radeon Graphics
Memory: 30.0 GiB of RAM
Graphics Processor: AMD YELLOW_CARP
Manufacturer: LENOVO
Product Name: 21CQ000GUS
System Version: ThinkPad T14s Gen 3

Message is

Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: CPU:0 (19:44:1) MC15_STATUS[Over|CE|MiscV|AddrV|-|-|SyndV|CECC|-|-|-]: 0xdc204000000c011b
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: Error Addr: 0x00000001fc880040
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: IPID: 0x0000009600050f00, Syndrome: 0x000001ff0a240701
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: Unified Memory Controller Ext. Error Code: 12
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: Corrected error, no action required.
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: CPU:0 (19:44:1) MC16_STATUS[Over|CE|MiscV|AddrV|-|-|SyndV|CECC|-|-|-]: 0xdc204000000c011b
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: Error Addr: 0x00000001fef81240
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x000001ff0a240701
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: Unified Memory Controller Ext. Error Code: 12
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: Corrected error, no action required.
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: CPU:0 (19:44:1) MC17_STATUS[Over|CE|MiscV|AddrV|-|-|SyndV|CECC|-|-|-]: 0xdc204000000c011b
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: Error Addr: 0x00000001ff280040
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: IPID: 0x0000009600250f00, Syndrome: 0x000001ff0a240701
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: Unified Memory Controller Ext. Error Code: 12
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: Corrected error, no action required.
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: CPU:0 (19:44:1) MC18_STATUS[Over|CE|MiscV|AddrV|-|-|SyndV|CECC|-|-|-]: 0xdc204000000c011b
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: Error Addr: 0x00000001efcfe380
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: IPID: 0x0000009600350f00, Syndrome: 0x000001ff0a240700
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: Unified Memory Controller Ext. Error Code: 12
Message from syslogd@fedora at Jul 16 15:14:39 ...
kernel:[Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD

Those appear to be memory or cpu errors – L3 cache

What is the cpu temp?
Have these errors appeared before?
please post the output of inxi -Fzx inside the </> Preformatted text tags from the toolbar above your text entry screen.
Please also run memtest86 on the system to narrow down the error location.

A search for the Syndrome: portion of the error shows several entries about the lenovo thinkpad t14 and z16 series with similar errors.
English Community-Lenovo Community is one of those as is this one https://www.reddit.com/r/thinkpad/comments/vsqlb4/thinkpad_z16_and_linux/

thx very much, you have probably nailed it ! I read all the posts you have referenced. Below is the inxi output.
The problem occurs from the first time I installed Fedora 36 KDE.
How can I check the CPU temp ? output of the fan is cold.
Cant install memtest86 using sudo dnf install memtest86…

This morning I installed latest patches from Fedora and for a while (1 hour for sure) , I did not get back these errors. However, later the machine shutdown failed (stucked), and restarting the t14s leads me to the display of the background image, but no KDE menu… I had to forced shutdown (ctrl-alt tab F2, logging as root and typing “reboot” did not work !!), and then it was OK, but the deaded error reappeared…

So I face a very difficult problem here : should I return the T14s ?? Either

  • I wait for a working patch for firmware, but it can be very long (month, year ???)
  • i return the laptop, but it was otherwise perfect !!! Superb keyboard, screen adequate for me, light, etc. Intel version of t14s are almost 2x the price and the 12th gen are known to be ridiculously hot (toaster…).

What would you do ? I thought about System76 lemur pro, but very expensive, average keyboard with ridiculously small arrow key, POP OS only ( I hate Gnome) and the same *itty Intel 12th gen…

System:
  Kernel: 5.18.11-200.fc36.x86_64 arch: x86_64 bits: 64 compiler: gcc
    v: 2.37-27.fc36 Desktop: KDE Plasma v: 5.25.3
    Distro: Fedora release 36 (Thirty Six)
Machine:
  Type: Laptop System: LENOVO product: 21CQ000GUS v: ThinkPad T14s Gen 3
    serial: <superuser required>
  Mobo: LENOVO model: 21CQ000GUS v: ThinkPad serial: <superuser required>
    UEFI: LENOVO v: R22ET46W (1.16 ) date: 05/28/2022
Battery:
  ID-1: BAT0 charge: 58.4 Wh (100.0%) condition: 58.4/57.0 Wh (102.5%)
    volts: 17.4 min: 15.4 model: SMP LNV-5B10W51875�� status: full
CPU:
  Info: 8-core model: AMD Ryzen 7 PRO 6850U with Radeon Graphics bits: 64
    type: MT MCP arch: Zen 3+ rev: 1 cache: L1: 512 KiB L2: 4 MiB L3: 16 MiB
  Speed (MHz): avg: 1371 high: 2670 min/max: 400/4768 boost: enabled cores:
    1: 1186 2: 1186 3: 1186 4: 1185 5: 2670 6: 2669 7: 1186 8: 1186 9: 1186
    10: 1186 11: 1186 12: 1186 13: 1186 14: 1186 15: 1186 16: 1186
    bogomips: 86240
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
Graphics:
  Device-1: AMD Rembrandt [Radeon 680M] vendor: Lenovo driver: amdgpu
    v: kernel arch: RDNA 2 bus-ID: 33:00.0
  Device-2: IMC Networks Integrated Camera type: USB driver: uvcvideo
    bus-ID: 5-1:2
  Display: wayland server: X.org v: 1.20.14 with: Xwayland v: 22.1.3
    compositor: kwin_wayland driver: X: loaded: amdgpu
    unloaded: fbdev,modesetting,vesa gpu: amdgpu resolution: 1920x1200
  OpenGL:
    renderer: AMD YELLOW_CARP (LLVM 14.0.0 DRM 3.46 5.18.11-200.fc36.x86_64)
    v: 4.6 Mesa 22.1.3 direct render: Yes
Audio:
  Device-1: AMD Rembrandt Radeon High Definition Audio vendor: Lenovo
    driver: snd_hda_intel v: kernel bus-ID: 33:00.1
  Device-2: AMD ACP/ACP3X/ACP6x Audio Coprocessor vendor: Lenovo
    driver: snd_pci_acp6x v: kernel bus-ID: 33:00.5
  Device-3: AMD Family 17h/19h HD Audio vendor: Lenovo
    driver: snd_hda_intel v: kernel bus-ID: 33:00.6
  Sound Server-1: ALSA v: k5.18.11-200.fc36.x86_64 running: yes
  Sound Server-2: PulseAudio v: 15.0 running: no
  Sound Server-3: PipeWire v: 0.3.55 running: yes
Network:
  Device-1: Qualcomm QCNFA765 Wireless Network Adapter vendor: Lenovo
    driver: ath11k_pci v: kernel bus-ID: 01:00.0
  IF: wlp1s0 state: up mac: <filter>
Bluetooth:
  Device-1: USI type: USB driver: btusb v: 0.8 bus-ID: 1-3.1:3
  Report: rfkill ID: hci0 rfk-id: 1 state: up address: see --recommends
Drives:
  Local Storage: total: 1.86 TiB used: 6.5 GiB (0.3%)
  ID-1: /dev/nvme0n1 vendor: Samsung model: MZVL22T0HBLB-00BL7
    size: 1.86 TiB temp: 40.9 C
Partition:
  ID-1: / size: 1.86 TiB used: 6.26 GiB (0.3%) fs: btrfs dev: /dev/dm-0
    mapped: luks-af1699eb-e51e-4c83-916e-5b34c6f998ac
  ID-2: /boot size: 973.4 MiB used: 230.7 MiB (23.7%) fs: ext4
    dev: /dev/nvme0n1p2
  ID-3: /boot/efi size: 598.8 MiB used: 14 MiB (2.3%) fs: vfat
    dev: /dev/nvme0n1p1
  ID-4: /home size: 1.86 TiB used: 6.26 GiB (0.3%) fs: btrfs dev: /dev/dm-0
    mapped: luks-af1699eb-e51e-4c83-916e-5b34c6f998ac
Swap:
  ID-1: swap-1 type: zram size: 8 GiB used: 0 KiB (0.0%) dev: /dev/zram0
Sensors:
  System Temperatures: cpu: 43.0 C mobo: N/A gpu: amdgpu temp: 43.0 C
  Fan Speeds (RPM): cpu: 65535 fan-1: 0 fan-2:
Info:
  Processes: 397 Uptime: 2m Memory: 30.02 GiB used: 1.84 GiB (6.1%)
  Init: systemd target: graphical (5) Compilers: gcc: 12.1.1 Packages: N/A
  note: see --pkg Shell: Bash v: 5.1.16 inxi: 3.3.19

Lenovo is aware of this issue and they will fix it in a future BIOS update. English Community-Lenovo Community

I have the same issue with a different Lenovo AMD 6000 laptop (Yoga 7 14ARB7). Apparently this affects a lot of Lenovo laptops with specific Micron DDR5 RAM.

1 Like

excellent news ! thx !