[problem] aac0 does not respond
Vladimir Ermakov
samflanker at gmail.com
Tue Mar 24 03:13:40 PDT 2009
Hello, All
Describe my problem:
have volume RAID-10 (SAS-HDD x 6) on Adaptec RAID 5805
2 HHD of 6 have errors in smart data (damaged)
i am try read file /var/db/mysql/ibdata1 from this volume
system does not respond ( lost access to ssh ) after read 6GB data from
this file
and print debug messages on ttyv0
As to prevent the emergence of this problem?
As monitor the status of RAID-controller?
please, any solutions
/Vladimir Ermakov
==========================messages on
ttyv0==================================
Mar 22 20:20:12 df24 kernel: aac0: COMMAND 0xffffffff80859dd0 TIMEOUT
AFTER 50 SECONDS
Mar 22 20:20:12 df24 kernel: aac0: COMMAND 0xffffffff808599e0 TIMEOUT
AFTER 50 SECONDS
Mar 22 20:20:12 df24 kernel: aac0: COMMAND 0xffffffff808569c0 TIMEOUT
AFTER 50 SECONDS
Mar 22 20:20:32 df24 kernel: aac0: COMMAND 0xffffffff80859dd0 TIMEOUT
AFTER 70 SECONDS
Mar 22 20:20:32 df24 kernel: aac0: COMMAND 0xffffffff808599e0 TIMEOUT
AFTER 70 SECONDS
Mar 22 20:20:32 df24 kernel: aac0: COMMAND 0xffffffff808569c0 TIMEOUT
AFTER 70 SECONDS
Mar 22 20:20:52 df24 kernel: aac0: COMMAND 0xffffffff80859dd0 TIMEOUT
AFTER 90 SECONDS
Mar 22 20:20:52 df24 kernel: aac0: COMMAND 0xffffffff808599e0 TIMEOUT
AFTER 90 SECONDS
Mar 22 20:20:52 df24 kernel: aac0: COMMAND 0xffffffff808569c0 TIMEOUT
AFTER 90 SECONDS
Mar 22 20:21:12 df24 kernel: aac0: COMMAND 0xffffffff80859dd0 TIMEOUT
AFTER 111 SECONDS
Mar 22 20:21:12 df24 kernel: aac0: COMMAND 0xffffffff808599e0 TIMEOUT
AFTER 111 SECONDS
Mar 22 20:21:12 df24 kernel: aac0: COMMAND 0xffffffff808569c0 TIMEOUT
AFTER 111 SECONDS
===============================================================
# ls -halt /var/db/mysql/ibdata1
-rw-rw---- 1 88 88 256G Mar 22 23:23 /var/db/mysql/ibdata1
# tar -cf - /var/db/mysql/ibdata1 | pv -br > /dev/null
3.73GB [ 146MB/s]
# smartctl -a -d scsi /dev/pass4
smartctl version 5.38 [amd64-portbld-freebsd7.1] Copyright (C) 2002-8
Bruce Allen
Home page is http://smartmontools.sourceforge.net/
Device: FUJITSU MAX3147RC Version: 0104
Serial number: xxxxxxxxxxxxxxxxx
Device type: <31>
Transport protocol: SAS
Local Time is: Tue Mar 24 10:07:08 2009 CET
Device supports SMART and is Enabled
Temperature Warning Enabled
SMART Health Status: OK
Current Drive Temperature: 21 C
Drive Trip Temperature: 65 C
Manufactured in week 18 of year 2006
Recommended maximum start stop count: 10000 times
Current start stop count: 46 times
Error counter log:
Errors Corrected by Total Correction
Gigabytes Total
ECC rereads/ errors algorithm
processed uncorrected
fast | delayed rewrites corrected invocations [10^9
bytes] errors
read: 0 75782 1488 0 0
31950.874 1488
write: 0 567 0 0 0
12148.416 0
verify: 0 17642 960 0 0
10148.962 960
# uname -a
FreeBSD sys3 7.1-PRERELEASE FreeBSD 7.1-PRERELEASE #1: Mon Nov 3
18:39:49 UTC 2008 root at sys3:/usr/obj/usr/src/sys/SYS3 amd64
# pciconf -lvc
***
aac0 at pci0:10:0:0: class=0x010400 card=0x02b69005 chip=0x02859005
rev=0x09 hdr=0x00
vendor = 'Adaptec Inc'
device = 'AAC-RAID RAID Controller'
class = mass storage
subclass = RAID
cap 01[98] = powerspec 2 supports D0 D1 D3 current D0
cap 05[a0] = MSI supports 2 messages, 64 bit
cap 10[d0] = PCI-Express 1 endpoint
cap 03[90] = VPD
***
# dmesg | grep aac0
aac0: <Adaptec RAID 5805> mem 0xb8a00000-0xb8bfffff irq 16 at device 0.0
on pci10
aac0: Enabling 64-bit address support
aac0: Enable Raw I/O
aac0: Enable 64-bit array
aac0: New comm. interface enabled
aac0: [ITHREAD]
aac0: Adaptec 5805, aac driver 2.0.0-1
aacp0: <SCSI Passthrough Bus> on aac0
aacp1: <SCSI Passthrough Bus> on aac0
aacp2: <SCSI Passthrough Bus> on aac0
aacd0: <RAID 0/1> on aac0
More information about the freebsd-hackers
mailing list