Hard Drive Issues

Richard McIntyre rem at tco2.thecompanyonline.com
Fri Oct 13 12:04:07 PDT 2006


David Kelly wrote:

>On Thu, Oct 12, 2006 at 06:54:53PM +0100, Spiros Papadopoulos wrote:
>  
>
>>Since as you say everything is working, maybe it is a good idea to
>>take a look and run the fsck command at least it may give you some
>>more information, which you can post in order to get better answers
>>    
>>
>
>That too, but first I'd start with sysutils/smartmontools and see what
>the drive and its built-in log says.
>
>  
>




I'm having a similar problem,
Oct 13 03:01:31 tco1 kernel: ad2: FAILURE - READ_DMA 
status=51<READY,DSC,ERROR> error=40<UNCORRECTABLE> LBA=181778119
Oct 13 07:11:15 tco1 kernel: ad2: FAILURE - READ_DMA 
status=51<READY,DSC,ERROR> error=40<UNCORRECTABLE> LBA=181778119

I'm assuming that particular sector on the drive is dying, I have backed 
everything up on the drive, can anyone give me more information, should 
the drive simply be replaced or is it possible that this is simply a TOC 
error and could be corrected by newfs to the drive?

I'm guessing it will need to be replaced, output of smartctl is below....

Thanks
~Richard

uname -a
 >>FreeBSD 5.3-RELEASE FreeBSD 5.3-RELEASE #0: Mon May  2 22:32:50 EDT 
2005    
 >>root at tco1:/usr/src/sys/i386/compile/TCO1.2005.05.02.001  i386


My output of smartmontools is:
smartctl -a -s on /dev/ad2
smartctl version 5.36 [i386-portbld-freebsd5.3] Copyright (C) 2002-6 
Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Model Family:     Seagate Barracuda 7200.7 and 7200.7 Plus family
Device Model:     ST3200822A
Serial Number:    5LJ0LW2T
Firmware Version: 3.01
User Capacity:    200,049,647,616 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   6
ATA Standard is:  ATA/ATAPI-6 T13 1410D revision 2
Local Time is:    Fri Oct 13 14:56:23 2006 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Disabled

=== START OF ENABLE/DISABLE COMMANDS SECTION ===
SMART Enabled.

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: 
Enabled.
Self-test execution status:      (   0) The previous self-test routine 
completed
                                        without error or no self-test 
has ever
                                        been run.
Total time to complete Offline
data collection:                 ( 430) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection 
on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        No General Purpose Logging support.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 111) minutes.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      
UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   051   048   006    Pre-fail  
Always       -       22488920
  3 Spin_Up_Time            0x0003   097   097   000    Pre-fail  
Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   
Always       -       21
  5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail  
Always       -       1
  7 Seek_Error_Rate         0x000f   084   060   030    Pre-fail  
Always       -       328020832
  9 Power_On_Hours          0x0032   082   082   000    Old_age   
Always       -       16043
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  
Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   
Always       -       22
194 Temperature_Celsius     0x0022   030   040   000    Old_age   
Always       -       30
195 Hardware_ECC_Recovered  0x001a   051   048   000    Old_age   
Always       -       22488920
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   
Always       -       1
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   
Offline      -       1
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   
Always       -       0
200 Multi_Zone_Error_Rate   0x0000   100   253   000    Old_age   
Offline      -       0
202 TA_Increase_Count       0x0032   051   204   000    Old_age   
Always       -       49

SMART Error Log Version: 1
ATA Error Count: 7742 (device log contains only the most recent five errors)
        CR = Command Register [HEX]
        FR = Features Register [HEX]
        SC = Sector Count Register [HEX]
        SN = Sector Number Register [HEX]
        CL = Cylinder Low Register [HEX]
        CH = Cylinder High Register [HEX]
        DH = Device/Head Register [HEX]
        DC = Device Command Register [HEX]
        ER = Error register [HEX]
        ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 7742 occurred at disk power-on lifetime: 16036 hours (668 days + 4 
hours)
  When the command that caused the error occurred, the device was active 
or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 04 c7 b6 d5 ea  Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 04 c7 b6 d5 ea 00      15:22:37.737  READ DMA
  c8 00 04 9b b4 e1 ea 00      15:22:37.493  READ DMA
  c8 00 04 97 b4 e1 ea 00      15:22:37.251  READ DMA
  c8 00 04 a7 b4 e1 ea 00      15:22:37.002  READ DMA
  c8 00 04 a3 b4 e1 ea 00      15:22:36.761  READ DMA

Error 7741 occurred at disk power-on lifetime: 16032 hours (668 days + 0 
hours)
  When the command that caused the error occurred, the device was active 
or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 04 c7 b6 d5 ea  Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 04 c7 b6 d5 ea 00      11:08:40.154  READ DMA
  35 00 20 df ff 2b 40 00      11:08:40.145  WRITE DMA EXT
  35 00 20 1f d5 16 40 00      11:08:44.953  WRITE DMA EXT
  ca 00 20 3f c0 92 ef 00      11:08:40.258  WRITE DMA
  ca 00 20 df 85 81 ef 00      11:08:40.250  WRITE DMA

Error 7740 occurred at disk power-on lifetime: 16012 hours (667 days + 4 
hours)
  When the command that caused the error occurred, the device was active 
or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 04 c7 b6 d5 ea  Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 04 c7 b6 d5 ea 00      15:49:49.473  READ DMA
  c8 00 04 9b b4 e1 ea 00      15:49:49.220  READ DMA
  c8 00 04 97 b4 e1 ea 00      15:49:52.420  READ DMA
  c8 00 04 a7 b4 e1 ea 00      15:49:52.175  READ DMA
  c8 00 04 a3 b4 e1 ea 00      15:49:51.929  READ DMA

Error 7739 occurred at disk power-on lifetime: 16008 hours (667 days + 0 
hours)
  When the command that caused the error occurred, the device was active 
or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 04 c7 b6 d5 ea  Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 04 c7 b6 d5 ea 00      11:35:56.771  READ DMA
  35 00 20 bf e7 39 40 00      11:35:56.765  WRITE DMA EXT
  35 00 20 7f 6b 2e 40 00      11:35:56.749  WRITE DMA EXT
  35 00 20 3f 0d c7 40 00      11:35:56.740  WRITE DMA EXT
  35 00 20 1f 4f c1 40 00      11:35:56.732  WRITE DMA EXT

Error 7738 occurred at disk power-on lifetime: 15989 hours (666 days + 5 
hours)
  When the command that caused the error occurred, the device was active 
or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  40 51 04 c7 b6 d5 ea  Error: UNC 4 sectors at LBA = 0x0ad5b6c7 = 181778119

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  c8 00 04 c7 b6 d5 ea 00      16:16:27.719  READ DMA
  c8 00 04 9b b4 e1 ea 00      16:16:27.468  READ DMA
  c8 00 04 97 b4 e1 ea 00      16:16:30.682  READ DMA
  c8 00 04 a7 b4 e1 ea 00      16:16:30.440  READ DMA
  c8 00 04 a3 b4 e1 ea 00      16:16:30.174  READ DMA

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.



More information about the freebsd-questions mailing list