Have just swapped the SATA cable with a brand new one, and used a different port on the motherboard side. A few minutes into the backup job, dmesg
shows the same error as before:
[ 311.481330] ata3.00: sense data available but port frozen
[ 311.481340] ata3.00: exception Emask 0x11 SAct 0x10000 SErr 0x6c0100 action 0x6 frozen
[ 311.481344] ata3.00: irq_stat 0x48000008, interface fatal error
[ 311.481347] ata3: SError: { UnrecovData CommWake 10B8B BadCRC Handshk }
[ 311.481351] ata3.00: failed command: READ FPDMA QUEUED
[ 311.481353] ata3.00: cmd 60/00:80:e8:c5:30/01:00:30:00:00/40 tag 16 ncq dma 131072 in
res 43/84:01:00:00:00/00:00:00:00:00/00 Emask 0x10 (ATA bus error)
[ 311.481360] ata3.00: status: { DRDY SENSE ERR }
[ 311.481362] ata3.00: error: { ICRC ABRT }
[ 311.481371] ata3: hard resetting link
[ 311.955593] ata3: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[ 311.959495] ata3.00: configured for UDMA/133
[ 311.969672] sd 2:0:0:0: [sda] tag#16 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s
[ 311.969681] sd 2:0:0:0: [sda] tag#16 Sense Key : Aborted Command [current]
[ 311.969687] sd 2:0:0:0: [sda] tag#16 Add. Sense: Scsi parity error
[ 311.969692] sd 2:0:0:0: [sda] tag#16 CDB: Read(16) 88 00 00 00 00 00 30 30 c5 e8 00 00 01 00 00 00
[ 311.969695] I/O error, dev sda, sector 808502760 op 0x0:(READ) flags 0x80700 phys_seg 2 prio class 2
[ 311.969717] ata3: EH complete
This is the output of sudo smartctl -x /dev/sda
:
smartctl 7.4 2023-08-01 r5530 [x86_64-linux-6.11.7-200.fc40.x86_64] (local build)
Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Device Model: ST4000VX016-3CV104
Serial Number: WW625N7N
LU WWN Device Id: 5 000c50 0f2872f34
Firmware Version: CV10
User Capacity: 4.000.787.030.016 bytes [4,00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Rotation Rate: 5400 rpm
Form Factor: 3.5 inches
Device is: Not in smartctl database 7.3/5528
ATA Version is: ACS-3 T13/2161-D revision 5
SATA Version is: SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is: Sun Nov 17 16:59:46 2024 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is: Unavailable
APM feature is: Unavailable
Rd look-ahead is: Enabled
Write cache is: Enabled
DSN feature is: Unavailable
ATA Security is: Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x00) Offline data collection activity
was never started.
Auto Offline Data Collection: Disabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 0) seconds.
Offline data collection
capabilities: (0x73) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
No Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 457) minutes.
Conveyance self-test routine
recommended polling time: ( 2) minutes.
SCT capabilities: (0x70bd) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE
1 Raw_Read_Error_Rate POSR-- 082 064 006 - 141584184
3 Spin_Up_Time PO---- 096 095 000 - 0
4 Start_Stop_Count -O--CK 100 100 020 - 74
5 Reallocated_Sector_Ct PO--CK 100 100 010 - 0
7 Seek_Error_Rate POSR-- 070 060 045 - 11015804
9 Power_On_Hours -O--CK 100 100 000 - 724
10 Spin_Retry_Count PO--C- 100 100 097 - 0
12 Power_Cycle_Count -O--CK 100 100 020 - 67
183 Runtime_Bad_Block -O--CK 096 096 000 - 4
184 End-to-End_Error -O--CK 100 100 099 - 0
187 Reported_Uncorrect -O--CK 100 100 000 - 0
188 Command_Timeout -O--CK 100 098 000 - 42950393879
189 High_Fly_Writes -O-RCK 100 100 000 - 0
190 Airflow_Temperature_Cel -O---K 068 061 040 - 32 (Min/Max 18/32)
191 G-Sense_Error_Rate -O--CK 100 100 000 - 0
192 Power-Off_Retract_Count -O--CK 100 100 000 - 8
193 Load_Cycle_Count -O--CK 100 100 000 - 186
194 Temperature_Celsius -O---K 032 040 000 - 32 (0 18 0 0 0)
195 Hardware_ECC_Recovered -O-RC- 082 064 000 - 141584184
197 Current_Pending_Sector -O--C- 100 100 000 - 0
198 Offline_Uncorrectable ----C- 100 100 000 - 0
199 UDMA_CRC_Error_Count -OSRCK 200 200 000 - 68
240 Head_Flying_Hours ------ 100 253 000 - 662 (209 111 0)
241 Total_LBAs_Written ------ 100 253 000 - 19250142579
242 Total_LBAs_Read ------ 100 253 000 - 91161825597
||||||_ K auto-keep
|||||__ C event count
||||___ R error rate
|||____ S speed/performance
||_____ O updated online
|______ P prefailure warning
General Purpose Log Directory Version 1
SMART Log Directory Version 1 [multi-sector log support]
Address Access R/W Size Description
0x00 GPL,SL R/O 1 Log Directory
0x01 SL R/O 1 Summary SMART error log
0x02 SL R/O 5 Comprehensive SMART error log
0x03 GPL R/O 5 Ext. Comprehensive SMART error log
0x04 GPL,SL R/O 8 Device Statistics log
0x06 SL R/O 1 SMART self-test log
0x07 GPL R/O 1 Extended self-test log
0x08 GPL R/O 2 Power Conditions log
0x09 SL R/W 1 Selective self-test log
0x0c GPL R/O 2048 Pending Defects log
0x10 GPL R/O 1 NCQ Command Error log
0x11 GPL R/O 1 SATA Phy Event Counters log
0x21 GPL R/O 1 Write stream error log
0x22 GPL R/O 1 Read stream error log
0x24 GPL R/O 512 Current Device Internal Status Data log
0x30 GPL,SL R/O 9 IDENTIFY DEVICE data log
0x80-0x9f GPL,SL R/W 16 Host vendor specific log
0xa1 GPL,SL VS 24 Device vendor specific log
0xa2 GPL VS 8160 Device vendor specific log
0xa6 GPL VS 192 Device vendor specific log
0xa8-0xa9 GPL,SL VS 136 Device vendor specific log
0xab GPL VS 1 Device vendor specific log
0xb0 GPL VS 9048 Device vendor specific log
0xbe-0xbf GPL VS 65535 Device vendor specific log
0xc0 GPL,SL VS 1 Device vendor specific log
0xc1 GPL,SL VS 16 Device vendor specific log
0xc3 GPL,SL VS 8 Device vendor specific log
0xc4 GPL,SL VS 24 Device vendor specific log
0xd1 GPL VS 264 Device vendor specific log
0xd3 GPL VS 1920 Device vendor specific log
0xe0 GPL,SL R/W 1 SCT Command/Status
0xe1 GPL,SL R/W 1 SCT Data Transfer
SMART Extended Comprehensive Error Log Version: 1 (5 sectors)
No Errors Logged
SMART Extended Self-test Log Version: 1 (1 sectors)
No self-tests have been logged. [To run self-tests, use: smartctl -t]
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
SCT Status Version: 3
SCT Version (vendor specific): 522 (0x020a)
Device State: Active (0)
Current Temperature: 32 Celsius
Power Cycle Min/Max Temperature: 18/32 Celsius
Lifetime Min/Max Temperature: 18/39 Celsius
Under/Over Temperature Limit Count: 0/0
SCT Temperature History Version: 2
Temperature Sampling Period: 3 minutes
Temperature Logging Interval: 94 minutes
Min/Max recommended Temperature: 1/61 Celsius
Min/Max Temperature Limit: 2/60 Celsius
Temperature History Size (Index): 128 (70)
Index Estimated Time Temperature Celsius
71 2024-11-09 09:40 33 **************
72 2024-11-09 11:14 ? -
73 2024-11-09 12:48 20 *
74 2024-11-09 14:22 ? -
75 2024-11-09 15:56 28 *********
76 2024-11-09 17:30 ? -
77 2024-11-09 19:04 20 *
78 2024-11-09 20:38 ? -
79 2024-11-09 22:12 19 -
80 2024-11-09 23:46 34 ***************
81 2024-11-10 01:20 35 ****************
... ..( 2 skipped). .. ****************
84 2024-11-10 06:02 35 ****************
85 2024-11-10 07:36 34 ***************
86 2024-11-10 09:10 34 ***************
87 2024-11-10 10:44 ? -
88 2024-11-10 12:18 19 -
89 2024-11-10 13:52 33 **************
90 2024-11-10 15:26 34 ***************
91 2024-11-10 17:00 33 **************
92 2024-11-10 18:34 33 **************
93 2024-11-10 20:08 34 ***************
94 2024-11-10 21:42 33 **************
95 2024-11-10 23:16 33 **************
96 2024-11-11 00:50 33 **************
97 2024-11-11 02:24 ? -
98 2024-11-11 03:58 20 *
99 2024-11-11 05:32 33 **************
100 2024-11-11 07:06 34 ***************
... ..( 3 skipped). .. ***************
104 2024-11-11 13:22 34 ***************
105 2024-11-11 14:56 36 *****************
106 2024-11-11 16:30 34 ***************
107 2024-11-11 18:04 33 **************
108 2024-11-11 19:38 ? -
109 2024-11-11 21:12 20 *
110 2024-11-11 22:46 34 ***************
111 2024-11-12 00:20 34 ***************
112 2024-11-12 01:54 34 ***************
113 2024-11-12 03:28 33 **************
114 2024-11-12 05:02 33 **************
115 2024-11-12 06:36 ? -
116 2024-11-12 08:10 20 *
117 2024-11-12 09:44 ? -
118 2024-11-12 11:18 19 -
119 2024-11-12 12:52 ? -
120 2024-11-12 14:26 20 *
121 2024-11-12 16:00 ? -
122 2024-11-12 17:34 19 -
123 2024-11-12 19:08 32 *************
124 2024-11-12 20:42 32 *************
125 2024-11-12 22:16 33 **************
126 2024-11-12 23:50 32 *************
127 2024-11-13 01:24 32 *************
0 2024-11-13 02:58 32 *************
1 2024-11-13 04:32 31 ************
2 2024-11-13 06:06 31 ************
3 2024-11-13 07:40 ? -
4 2024-11-13 09:14 19 -
5 2024-11-13 10:48 ? -
6 2024-11-13 12:22 19 -
7 2024-11-13 13:56 ? -
8 2024-11-13 15:30 19 -
9 2024-11-13 17:04 ? -
10 2024-11-13 18:38 19 -
11 2024-11-13 20:12 ? -
12 2024-11-13 21:46 19 -
13 2024-11-13 23:20 ? -
14 2024-11-14 00:54 20 *
15 2024-11-14 02:28 ? -
16 2024-11-14 04:02 19 -
17 2024-11-14 05:36 ? -
18 2024-11-14 07:10 19 -
19 2024-11-14 08:44 ? -
20 2024-11-14 10:18 20 *
21 2024-11-14 11:52 ? -
22 2024-11-14 13:26 19 -
23 2024-11-14 15:00 ? -
24 2024-11-14 16:34 19 -
25 2024-11-14 18:08 ? -
26 2024-11-14 19:42 18 -
27 2024-11-14 21:16 ? -
28 2024-11-14 22:50 18 -
29 2024-11-15 00:24 ? -
30 2024-11-15 01:58 18 -
31 2024-11-15 03:32 ? -
32 2024-11-15 05:06 26 *******
33 2024-11-15 06:40 ? -
34 2024-11-15 08:14 27 ********
35 2024-11-15 09:48 ? -
36 2024-11-15 11:22 30 ***********
37 2024-11-15 12:56 ? -
38 2024-11-15 14:30 19 -
39 2024-11-15 16:04 ? -
40 2024-11-15 17:38 19 -
41 2024-11-15 19:12 ? -
42 2024-11-15 20:46 18 -
43 2024-11-15 22:20 ? -
44 2024-11-15 23:54 18 -
45 2024-11-16 01:28 ? -
46 2024-11-16 03:02 19 -
47 2024-11-16 04:36 ? -
48 2024-11-16 06:10 18 -
49 2024-11-16 07:44 ? -
50 2024-11-16 09:18 18 -
51 2024-11-16 10:52 ? -
52 2024-11-16 12:26 18 -
53 2024-11-16 14:00 ? -
54 2024-11-16 15:34 19 -
55 2024-11-16 17:08 ? -
56 2024-11-16 18:42 32 *************
57 2024-11-16 20:16 ? -
58 2024-11-16 21:50 30 ***********
59 2024-11-16 23:24 ? -
60 2024-11-17 00:58 19 -
61 2024-11-17 02:32 ? -
62 2024-11-17 04:06 18 -
63 2024-11-17 05:40 ? -
64 2024-11-17 07:14 18 -
65 2024-11-17 08:48 32 *************
66 2024-11-17 10:22 32 *************
67 2024-11-17 11:56 31 ************
68 2024-11-17 13:30 31 ************
69 2024-11-17 15:04 32 *************
70 2024-11-17 16:38 32 *************
SCT Error Recovery Control:
Read: Disabled
Write: Disabled
Device Statistics (GP Log 0x04)
Page Offset Size Value Flags Description
0x01 ===== = = === == General Statistics (rev 1) ==
0x01 0x008 4 67 --- Lifetime Power-On Resets
0x01 0x010 4 724 --- Power-on Hours
0x01 0x018 6 19250454123 --- Logical Sectors Written
0x01 0x020 6 69159153 --- Number of Write Commands
0x01 0x028 6 91161852212 --- Logical Sectors Read
0x01 0x030 6 46542447 --- Number of Read Commands
0x01 0x038 6 - --- Date and Time TimeStamp
0x03 ===== = = === == Rotating Media Statistics (rev 1) ==
0x03 0x008 4 714 --- Spindle Motor Power-on Hours
0x03 0x010 4 662 --- Head Flying Hours
0x03 0x018 4 186 --- Head Load Events
0x03 0x020 4 0 --- Number of Reallocated Logical Sectors
0x03 0x028 4 0 --- Read Recovery Attempts
0x03 0x030 4 0 --- Number of Mechanical Start Failures
0x03 0x038 4 0 --- Number of Realloc. Candidate Logical Sectors
0x03 0x040 4 8 --- Number of High Priority Unload Events
0x04 ===== = = === == General Errors Statistics (rev 1) ==
0x04 0x008 4 0 --- Number of Reported Uncorrectable Errors
0x04 0x010 4 23 --- Resets Between Cmd Acceptance and Completion
0x05 ===== = = === == Temperature Statistics (rev 1) ==
0x05 0x008 1 32 --- Current Temperature
0x05 0x010 1 30 --- Average Short Term Temperature
0x05 0x018 1 31 --- Average Long Term Temperature
0x05 0x020 1 39 --- Highest Temperature
0x05 0x028 1 19 --- Lowest Temperature
0x05 0x030 1 34 --- Highest Average Short Term Temperature
0x05 0x038 1 30 --- Lowest Average Short Term Temperature
0x05 0x040 1 31 --- Highest Average Long Term Temperature
0x05 0x048 1 30 --- Lowest Average Long Term Temperature
0x05 0x050 4 0 --- Time in Over-Temperature
0x05 0x058 1 70 --- Specified Maximum Operating Temperature
0x05 0x060 4 0 --- Time in Under-Temperature
0x05 0x068 1 0 --- Specified Minimum Operating Temperature
0x06 ===== = = === == Transport Statistics (rev 1) ==
0x06 0x008 4 216 --- Number of Hardware Resets
0x06 0x010 4 120 --- Number of ASR Events
0x06 0x018 4 68 --- Number of Interface CRC Errors
|||_ C monitored condition met
||__ D supports DSN
|___ N normalized value
Pending Defects log (GP Log 0x0c)
No Defects Logged
SATA Phy Event Counters (GP Log 0x11)
ID Size Value Description
0x000a 2 5 Device-to-host register FISes sent due to a COMRESET
0x0001 2 3 Command failed due to ICRC error
0x0003 2 3 R_ERR response for device-to-host data FIS
0x0004 2 0 R_ERR response for host-to-device data FIS
0x0006 2 1 R_ERR response for device-to-host non-data FIS
0x0007 2 0 R_ERR response for host-to-device non-data FIS
Seagate FARM log (GP Log 0xa6) supported [try: -l farm]
Update: Having read up on the use of smartctl
, I ran several short tests; no issues. Running a conveyance SMART test gets me a “Connection timed out” every time. See this output after three tries:
smartctl 7.4 2023-08-01 r5530 [x86_64-linux-6.11.7-200.fc40.x86_64] (local build)
Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Conveyance captive Interrupted (host reset) 50% 725 -
# 2 Conveyance captive Interrupted (host reset) 50% 725 -
# 3 Conveyance captive Interrupted (host reset) 50% 725 -
# 4 Short captive Completed without error 00% 725 -
# 5 Short offline Completed without error 00% 725 -
Meanwhile in dmesg
and journalctl
, I get these two lines:
[ 185.762588] ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[ 185.800229] ata2.00: configured for UDMA/133
Yes, I’ve switched SATA ports again, but that should not be a problem.
Can the issue be seen in this output? Can I use this in my communication with the computer store’s people? What should I tell or show them?
Thanks,
FWieP