SSD becomes detached 9.2
Jim Ohlstein
jim at ohlste.in
Sat Nov 23 05:08:33 UTC 2013
Hello,
I am setting up a new (remote) server. I initially installed 9.2 RC4
amd64 because that's what the data center put in the drive for me.
Shortly thereafter I downloaded 9.2-STABLE sources and compiled world
and a generic kernel. While doing so the system became unreachable by
SSH The SSH sessions appeared to connect but there was never any data
returned. I could telnet to port 22 but I could not log in from a
terminal. I could ping the server as well. I had the server rebooted.
I did install an updated kernel and world (9.2-STABLE amd64 r258426) and
it happened again just now.
The OS is installed on a 120 GB SSD with root on ZFS. There is also
another SSD for L2ARC and there are two 3TB SATA drives in a separate
ZFS mirror pool. All drives passed cursory testing with smartmontools.
CPU is an AMD-8120 (8 core Zambezi).
Very little is running on the server as it is not yet in production
(thankfully).
Here is the relevant part of dmesg:
ahcich0: Timeout on slot 31 port 0
ahcich0: is 00000008 cs 00000000 ss 00000000 rs f8000000 tfd 40 serr
00000000 cmd 00047f17
(ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 20 ff a6 40 01 00
00 00 00 00
(ada0:ahcich0:0:0:0): CAM status: Command timeout
(ada0:ahcich0:0:0:0): Retrying command
ahcich0: Timeout on slot 31 port 0
ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr
00000000 cmd 00047f17
(aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00
00 00 00
(aprobe0:ahcich0:0:0:0): CAM status: Command timeout
(aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked
ahcich0: Timeout on slot 31 port 0
ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr
00000000 cmd 00047f17
(aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00
00 00 00
(aprobe0:ahcich0:0:0:0): CAM status: Command timeout
(aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked
ada0 at ahcich0 bus 0 scbus0 target 0 lun 0
ada0: <Samsung SSD 840 EVO 120GB EXT0AB0Q> s/n S1D5NSAD915803Y detached
ahcich0: Timeout on slot 31 port 0
ahcich0: is 00000001 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr
00000000 cmd 00047f17
(ada0:ahcich0:0:0:0): SETFEATURES ENABLE RCACHE. ACB: ef aa 00 00 00 40
00 00 00 00 00 00
(ada0:ahcich0:0:0:0): CAM status: Command timeout
(ada0:ahcich0:0:0:0): Error 5, Periph was invalidated
ahcich0: Timeout on slot 31 port 0
ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr
00000000 cmd 00047f17
(aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00
00 00 00
(aprobe0:ahcich0:0:0:0): CAM status: Command timeout
(aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked
ahcich0: Timeout on slot 31 port 0
ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr
00000000 cmd 00047f17
(aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00
00 00 00
(aprobe0:ahcich0:0:0:0): CAM status: Command timeout
(aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked
ahcich0: Timeout on slot 3 port 0
ahcich0: is 00000008 cs 00000000 ss 00000000 rs 8000000f tfd 40 serr
00000000 cmd 00046317
(ada0:ahcich0:0:0:0): DSM TRIM. ACB: 06 01 00 00 00 40 00 00 00 00 01 00
(ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request
(ada0:ahcich0:0:0:0): Error 5, Periph was invalidated
(ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 20 ff a6 40 01 00
00 00 00 00
(ada0:ahcich0:0:0:0): CAM status: Command timeout
(ada0:ahcich0:0:0:0): Error 5, Periph was invalidated
(ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 38 ff a6 40 01 00
00 00 00 00
(ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request
(ada0:ahcich0:0:0:0): Error 5, Periph was invalidated
(ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 88 ff a6 40 01 00
00 00 00 00
(ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request
(ada0:ahcich0:0:0:0): Error 5, Periph was invalidated
(ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 10 ff a6 40 01 00
00 00 00 00
(ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request
(ada0:ahcich0:0:0:0): Error 5, Periph was invalidated
(ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 30 ff a6 40 01 00
00 00 00 00
(ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request
(ada0:ahcich0:0:0:0): Error 5, Periph was invalidated
(ada0:ahcich0:0:0:0): Periph destroyed
ahcich0: Timeout on slot 3 port 0
ahcich0: is 00000002 cs 00000000 ss 00000000 rs 00000008 tfd 50 serr
00000000 cmd 00046317
(aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00
00 00 00
(aprobe0:ahcich0:0:0:0): CAM status: Command timeout
(aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked
ahcich0: Timeout on slot 3 port 0
ahcich0: is 00000002 cs 00000000 ss 00000000 rs 00000008 tfd 50 serr
00000000 cmd 00046317
(aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00
00 00 00
(aprobe0:ahcich0:0:0:0): CAM status: Command timeout
(aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked
After this event, gpart show lists only ada1, ada2, and ada3. The boot
drive is ada0.
The entire dmesg can be seen at http://pastebin.com/RqR8LiSb.
--
Jim Ohlstein
More information about the freebsd-stable
mailing list