I have been getting the following hardware errors. I asked GPT-OSS, which worried me even more. It said it could be hardware fault with the CPU, RAM, or chipset! Given the current hardware prices, needing to replace hardware sounds scary. Can someone help me understand the errors and figure out what is actually wrong?
Message from syslogd@systemname at May 23 21:17:25 ...
kernel:[Hardware Error]: System Fatal error.
Message from syslogd@systemname at May 23 21:17:25 ...
kernel:[Hardware Error]: CPU:1 (19:21:2) MC12_STATUS[Over|UE|MiscV|AddrV|PCC|SyndV|UECC|Deferred|Poison|Scrub]: 0xffffffffffffffff
Message from syslogd@systemname at May 23 21:17:25 ...
kernel:[Hardware Error]: Error Addr: 0x0000000000000000
Message from syslogd@systemname at May 23 21:17:25 ...
kernel:[Hardware Error]: IPID: 0x0000000000000000, Syndrome: 0x0000000000000000
Message from syslogd@systemname at May 23 21:17:25 ...
kernel:[Hardware Error]: Bank 12 is reserved.
Message from syslogd@systemname at May 23 21:17:25 ...
kernel:[Hardware Error]: cache level: L3/GEN, tx: RESV
Message from syslogd@systemname at May 26 08:10:54 ...
kernel:[Hardware Error]: System Fatal error.
Message from syslogd@systemname at May 26 08:10:54 ...
kernel:[Hardware Error]: CPU:1 (19:21:2) MC13_STATUS[Over|UE|MiscV|AddrV|PCC|SyndV|UECC|Deferred|Poison|Scrub]: 0xffffffffa6b2950b
Message from syslogd@systemname at May 26 08:10:54 ...
kernel:[Hardware Error]: Error Addr: 0x0000000000000000
Message from syslogd@systemname at May 26 08:10:54 ...
kernel:[Hardware Error]: IPID: 0x0000000000000000, Syndrome: 0x0000000000000000
Message from syslogd@systemname at May 26 08:10:54 ...
kernel:[Hardware Error]: Bank 13 is reserved.
Message from syslogd@systemname at May 26 08:10:54 ...
kernel:[Hardware Error]: cache level: L3/GEN, tx: GEN
Message from syslogd@systemname at May 26 18:38:57 ...
kernel:[Hardware Error]: System Fatal error.
Message from syslogd@systemname at May 26 18:38:57 ...
kernel:[Hardware Error]: CPU:1 (19:21:2) MC22_STATUS[Over|UE|MiscV|AddrV|PCC|SyndV|UECC|Deferred|Poison|Scrub]: 0xffffffffa4e13ca0
Message from syslogd@systemname at May 26 18:38:57 ...
kernel:[Hardware Error]: Error Addr: 0x0000000000000000
Message from syslogd@systemname at May 26 18:38:57 ...
kernel:[Hardware Error]: IPID: 0x0000000000000000, Syndrome: 0x0000000000000000
Message from syslogd@systemname at May 26 18:38:57 ...
kernel:[Hardware Error]: Bank 22 is reserved.
Message from syslogd@systemname at May 26 18:38:57 ...
kernel:[Hardware Error]: cache level: RESV, tx: INSN
Message from syslogd@systemname at Jun 3 23:28:58 ...
kernel:[Hardware Error]: System Fatal error.
Message from syslogd@systemname at Jun 3 23:28:58 ...
kernel:[Hardware Error]: CPU:1 (19:21:2) MC19_STATUS[Over|UE|MiscV|AddrV|PCC|SyndV|UECC|Deferred|Poison|Scrub]: 0xffffffff88d883e0
Message from syslogd@systemname at Jun 3 23:28:58 ...
kernel:[Hardware Error]: Error Addr: 0x0000000000000000
Message from syslogd@systemname at Jun 3 23:28:58 ...
kernel:[Hardware Error]: IPID: 0x0000000000000000, Syndrome: 0x0000000000000000
Message from syslogd@systemname at Jun 3 23:28:58 ...
kernel:[Hardware Error]: Bank 19 is reserved.
Message from syslogd@systemname at Jun 3 23:28:58 ...
kernel:[Hardware Error]: cache level: RESV, tx: INSN
The CPU is brand new (~2 months), everything else is a few yrs old. I’ve never had any kind of hardware/stability issues. My uptime is also generally pretty high.
My system info:
# inxi -MmCG
Machine:
Type: Desktop System: ASUS product: N/A v: N/A serial: N/A
Mobo: ASUSTeK model: TUF GAMING B550M-PLUS (WI-FI) v: Rev X.0x
serial: XXXXXXXX Firmware: UEFI vendor: American Megatrends v: 2806
date: 10/27/2022
Memory:
System RAM: total: 32 GiB available: 31.22 GiB used: 22.52 GiB (72.1%)
Array-1: capacity: 128 GiB slots: 4 modules: 4 EC: None
Device-1: DIMM_A1 type: DDR4 size: 8 GiB speed: 3200 MT/s
Device-2: DIMM_A2 type: DDR4 size: 8 GiB speed: 3200 MT/s
Device-3: DIMM_B1 type: DDR4 size: 8 GiB speed: 3200 MT/s
Device-4: DIMM_B2 type: DDR4 size: 8 GiB speed: 3200 MT/s
CPU:
Info: 8-core model: AMD Ryzen 7 5800X bits: 64 type: MT MCP cache: L2: 4 MiB
Speed (MHz): avg: 3882 min/max: 556/4854 cores: 1: 3882 2: 3882 3: 3882
4: 3882 5: 3882 6: 3882 7: 3882 8: 3882 9: 3882 10: 3882 11: 3882 12: 3882
13: 3882 14: 3882 15: 3882 16: 3882
Graphics:
Device-1: Intel DG2 [Arc A750] driver: i915 v: kernel
Display: x11 server: X.Org v: 21.1.22 with: Xwayland v: 24.1.11 driver: X:
loaded: modesetting dri: iris gpu: i915 resolution: 1: 1920x1080~60Hz
2: 2560x1440~60Hz
API: OpenGL v: 4.6 vendor: intel mesa v: 26.0.6 renderer: Mesa Intel Arc
A750 Graphics (DG2)
API: Vulkan v: 1.4.341 drivers: intel,llvmpipe surfaces: N/A
API: EGL Message: EGL data requires eglinfo. Check --recommends.
Info: Tools: api: glxinfo,vulkaninfo de: xfce4-display-settings
gpu: corectrl, gputop, intel_gpu_top, lsgpu x11: xdriinfo, xdpyinfo, xprop,
xrandr