Hard Drive Issues
Richard McIntyre
rem at tco2.thecompanyonline.com
Fri Oct 13 12:04:07 PDT 2006
David Kelly wrote:
>On Thu, Oct 12, 2006 at 06:54:53PM +0100, Spiros Papadopoulos wrote:
>
>
>>Since as you say everything is working, maybe it is a good idea to
>>take a look and run the fsck command at least it may give you some
>>more information, which you can post in order to get better answers
>>
>>
>
>That too, but first I'd start with sysutils/smartmontools and see what
>the drive and its built-in log says.
>
>
>
I'm having a similar problem,
Oct 13 03:01:31 tco1 kernel: ad2: FAILURE - READ_DMA
status=51<READY,DSC,ERROR> error=40<UNCORRECTABLE> LBA=181778119
Oct 13 07:11:15 tco1 kernel: ad2: FAILURE - READ_DMA
status=51<READY,DSC,ERROR> error=40<UNCORRECTABLE> LBA=181778119
I'm assuming that particular sector on the drive is dying, I have backed
everything up on the drive, can anyone give me more information, should
the drive simply be replaced or is it possible that this is simply a TOC
error and could be corrected by newfs to the drive?
I'm guessing it will need to be replaced, output of smartctl is below....
Thanks
~Richard
uname -a
>>FreeBSD 5.3-RELEASE FreeBSD 5.3-RELEASE #0: Mon May 2 22:32:50 EDT
2005
>>root at tco1:/usr/src/sys/i386/compile/TCO1.2005.05.02.001 i386
My output of smartmontools is:
smartctl -a -s on /dev/ad2
smartctl version 5.36 [i386-portbld-freebsd5.3] Copyright (C) 2002-6
Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF INFORMATION SECTION ===
Model Family: Seagate Barracuda 7200.7 and 7200.7 Plus family
Device Model: ST3200822A
Serial Number: 5LJ0LW2T
Firmware Version: 3.01
User Capacity: 200,049,647,616 bytes
Device is: In smartctl database [for details use: -P show]
ATA Version is: 6
ATA Standard is: ATA/ATAPI-6 T13 1410D revision 2
Local Time is: Fri Oct 13 14:56:23 2006 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Disabled
=== START OF ENABLE/DISABLE COMMANDS SECTION ===
SMART Enabled.
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection:
Enabled.
Self-test execution status: ( 0) The previous self-test routine
completed
without error or no self-test
has ever
been run.
Total time to complete Offline
data collection: ( 430) seconds.
Offline data collection
capabilities: (0x5b) SMART execute Offline immediate.
Auto Offline data collection
on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
No General Purpose Logging support.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 111) minutes.
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE
UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 051 048 006 Pre-fail
Always - 22488920
3 Spin_Up_Time 0x0003 097 097 000 Pre-fail
Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age
Always - 21
5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail
Always - 1
7 Seek_Error_Rate 0x000f 084 060 030 Pre-fail
Always - 328020832
9 Power_On_Hours 0x0032 082 082 000 Old_age
Always - 16043
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail
Always - 0
12 Power_Cycle_Count 0x0032 100 100 020 Old_age
Always - 22
194 Temperature_Celsius 0x0022 030 040 000 Old_age
Always - 30
195 Hardware_ECC_Recovered 0x001a 051 048 000 Old_age
Always - 22488920
197 Current_Pending_Sector 0x0012 100 100 000 Old_age
Always - 1
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age
Offline - 1
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age
Always - 0
200 Multi_Zone_Error_Rate 0x0000 100 253 000 Old_age
Offline - 0
202 TA_Increase_Count 0x0032 051 204 000 Old_age
Always - 49
SMART Error Log Version: 1
ATA Error Count: 7742 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.
Error 7742 occurred at disk power-on lifetime: 16036 hours (668 days + 4
hours)
When the command that caused the error occurred, the device was active
or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 04 c7 b6 d5 ea 00 15:22:37.737 READ DMA
c8 00 04 9b b4 e1 ea 00 15:22:37.493 READ DMA
c8 00 04 97 b4 e1 ea 00 15:22:37.251 READ DMA
c8 00 04 a7 b4 e1 ea 00 15:22:37.002 READ DMA
c8 00 04 a3 b4 e1 ea 00 15:22:36.761 READ DMA
Error 7741 occurred at disk power-on lifetime: 16032 hours (668 days + 0
hours)
When the command that caused the error occurred, the device was active
or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 04 c7 b6 d5 ea 00 11:08:40.154 READ DMA
35 00 20 df ff 2b 40 00 11:08:40.145 WRITE DMA EXT
35 00 20 1f d5 16 40 00 11:08:44.953 WRITE DMA EXT
ca 00 20 3f c0 92 ef 00 11:08:40.258 WRITE DMA
ca 00 20 df 85 81 ef 00 11:08:40.250 WRITE DMA
Error 7740 occurred at disk power-on lifetime: 16012 hours (667 days + 4
hours)
When the command that caused the error occurred, the device was active
or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 04 c7 b6 d5 ea 00 15:49:49.473 READ DMA
c8 00 04 9b b4 e1 ea 00 15:49:49.220 READ DMA
c8 00 04 97 b4 e1 ea 00 15:49:52.420 READ DMA
c8 00 04 a7 b4 e1 ea 00 15:49:52.175 READ DMA
c8 00 04 a3 b4 e1 ea 00 15:49:51.929 READ DMA
Error 7739 occurred at disk power-on lifetime: 16008 hours (667 days + 0
hours)
When the command that caused the error occurred, the device was active
or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 04 c7 b6 d5 ea 00 11:35:56.771 READ DMA
35 00 20 bf e7 39 40 00 11:35:56.765 WRITE DMA EXT
35 00 20 7f 6b 2e 40 00 11:35:56.749 WRITE DMA EXT
35 00 20 3f 0d c7 40 00 11:35:56.740 WRITE DMA EXT
35 00 20 1f 4f c1 40 00 11:35:56.732 WRITE DMA EXT
Error 7738 occurred at disk power-on lifetime: 15989 hours (666 days + 5
hours)
When the command that caused the error occurred, the device was active
or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 04 c7 b6 d5 ea Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 04 c7 b6 d5 ea 00 16:16:27.719 READ DMA
c8 00 04 9b b4 e1 ea 00 16:16:27.468 READ DMA
c8 00 04 97 b4 e1 ea 00 16:16:30.682 READ DMA
c8 00 04 a7 b4 e1 ea 00 16:16:30.440 READ DMA
c8 00 04 a3 b4 e1 ea 00 16:16:30.174 READ DMA
SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
More information about the freebsd-questions
mailing list