Checking logs is always good advice, however if the machine is freezing so completely that the filesystem is corrupted, I wouldn’t be surprised if there’s nothing useful in the logs — whatever message might be output at the time of the freeze probably didn’t get a chance to get written to disk.
It sounds very much like a hardware issue, quite possibly overheating. Several Amazon users have reported overheating issues with these models, on Fedora specifically.
One thing I’d suggest, if you haven’t already, is to install and configure the lm_sensors monitoring package, along with its logging daemon.
# 1. Install the packages
$ sudo dnf install lm_sensors lm_sensors-sensord
# 2. Interactively configure the necessary drivers
$ sudo sensors-detect
# 3. Enable and start the logging daemon
$ sudo systemctl enable sensord
$ sudo systemctl start sensord
Before step 3, you might want to edit /etc/sysconfig/sensord
and lower the LOG_INTERVAL
from the default 20m
to something like 5m
(or even 2m
or 1m
, temporarily), to have a better chance at capturing a picture of the hardware state shortly before a freeze.
After step 2, you’ll know the sensors are configured correctly if the sensors
command outputs at least somewhat useful data. What you actually get can vary wildly. On my two machines, I get:
$ sensors
coretemp-isa-0000
Adapter: ISA adapter
Package id 0: +38.0°C (high = +80.0°C, crit = +99.0°C)
Core 0: +38.0°C (high = +80.0°C, crit = +99.0°C)
Core 1: +36.0°C (high = +80.0°C, crit = +99.0°C)
Core 2: +34.0°C (high = +80.0°C, crit = +99.0°C)
Core 3: +37.0°C (high = +80.0°C, crit = +99.0°C)
# vs
$ sensors
k8temp-pci-00c3
Adapter: PCI adapter
Core0 Temp: +27.0°C
Core0 Temp: +19.0°C
Core1 Temp: +36.0°C
Core1 Temp: +23.0°C
acpitz-acpi-0
Adapter: ACPI interface
temp1: +13.0°C (crit = +70.0°C)
it8716-isa-0290
Adapter: ISA adapter
in0: +1.10 V (min = +0.00 V, max = +0.00 V) ALARM
in1: +2.50 V (min = +0.00 V, max = +4.08 V)
in2: +1.79 V (min = +0.00 V, max = +4.08 V)
in3: +3.36 V (min = +0.00 V, max = +4.08 V)
in4: +3.02 V (min = +0.00 V, max = +4.08 V)
in5: +1.17 V (min = +0.00 V, max = +4.08 V)
in6: +2.93 V (min = +0.00 V, max = +4.08 V)
in7: +3.02 V (min = +0.00 V, max = +2.03 V) ALARM
Vbat: +2.05 V
fan1: 1824 RPM (min = 0 RPM)
fan2: 883 RPM (min = 0 RPM)
temp1: +13.0°C (low = +127.0°C, high = +65.0°C) sensor = thermal diode
temp2: +34.0°C (low = +127.0°C, high = +127.0°C) sensor = thermistor
temp3: +25.0°C (low = +127.0°C, high = +127.0°C) sensor = thermistor
intrusion0: ALARM
The former machine has an unsupported hardware monitoring chip, so all I get is the CPU thermal monitoring. The latter machine uses an it87 hardware sensor, so I get a wealth of readouts (even if the ranges aren’t properly configured to monitor them).
If the machine is overheating, you may be able to install and use the cpupower
command to increase the CPU throttling. Or, there may be options in the BIOS to adjust how reactive the system is to thermal load.