Checking logs is always good advice, however if the machine is freezing so completely that the filesystem is corrupted, I wouldn’t be surprised if there’s nothing useful in the logs — whatever message might be output at the time of the freeze probably didn’t get a chance to get written to disk.
It sounds very much like a hardware issue, quite possibly overheating. Several Amazon users have reported overheating issues with these models, on Fedora specifically.
One thing I’d suggest, if you haven’t already, is to install and configure the lm_sensors monitoring package, along with its logging daemon.
# 1. Install the packages
$ sudo dnf install lm_sensors lm_sensors-sensord
# 2. Interactively configure the necessary drivers
$ sudo sensors-detect
# 3. Enable and start the logging daemon
$ sudo systemctl enable sensord
$ sudo systemctl start sensord
Before step 3, you might want to edit
/etc/sysconfig/sensord and lower the
LOG_INTERVAL from the default
20m to something like
5m (or even
1m, temporarily), to have a better chance at capturing a picture of the hardware state shortly before a freeze.
After step 2, you’ll know the sensors are configured correctly if the
sensors command outputs at least somewhat useful data. What you actually get can vary wildly. On my two machines, I get:
Adapter: ISA adapter
Package id 0: +38.0°C (high = +80.0°C, crit = +99.0°C)
Core 0: +38.0°C (high = +80.0°C, crit = +99.0°C)
Core 1: +36.0°C (high = +80.0°C, crit = +99.0°C)
Core 2: +34.0°C (high = +80.0°C, crit = +99.0°C)
Core 3: +37.0°C (high = +80.0°C, crit = +99.0°C)
Adapter: PCI adapter
Core0 Temp: +27.0°C
Core0 Temp: +19.0°C
Core1 Temp: +36.0°C
Core1 Temp: +23.0°C
Adapter: ACPI interface
temp1: +13.0°C (crit = +70.0°C)
Adapter: ISA adapter
in0: +1.10 V (min = +0.00 V, max = +0.00 V) ALARM
in1: +2.50 V (min = +0.00 V, max = +4.08 V)
in2: +1.79 V (min = +0.00 V, max = +4.08 V)
in3: +3.36 V (min = +0.00 V, max = +4.08 V)
in4: +3.02 V (min = +0.00 V, max = +4.08 V)
in5: +1.17 V (min = +0.00 V, max = +4.08 V)
in6: +2.93 V (min = +0.00 V, max = +4.08 V)
in7: +3.02 V (min = +0.00 V, max = +2.03 V) ALARM
Vbat: +2.05 V
fan1: 1824 RPM (min = 0 RPM)
fan2: 883 RPM (min = 0 RPM)
temp1: +13.0°C (low = +127.0°C, high = +65.0°C) sensor = thermal diode
temp2: +34.0°C (low = +127.0°C, high = +127.0°C) sensor = thermistor
temp3: +25.0°C (low = +127.0°C, high = +127.0°C) sensor = thermistor
The former machine has an unsupported hardware monitoring chip, so all I get is the CPU thermal monitoring. The latter machine uses an it87 hardware sensor, so I get a wealth of readouts (even if the ranges aren’t properly configured to monitor them).
If the machine is overheating, you may be able to install and use the
cpupower command to increase the CPU throttling. Or, there may be options in the BIOS to adjust how reactive the system is to thermal load.