SSD becomes detached 9.2

Jim Ohlstein jim at ohlste.in
Sat Nov 23 05:08:33 UTC 2013


Hello,

I am setting up a new (remote) server. I initially installed 9.2 RC4 
amd64 because that's what the data center put in the drive for me. 
Shortly thereafter I downloaded 9.2-STABLE sources and compiled world 
and  a generic kernel. While doing so the system became unreachable by 
SSH The SSH sessions appeared to connect but there was never any data 
returned. I could telnet to port 22 but I could not log in from a 
terminal. I could ping the server as well. I had the server rebooted.

I did install an updated kernel and world (9.2-STABLE amd64 r258426) and 
it happened again just now.


The OS is installed on a 120 GB SSD with root on ZFS. There is also 
another SSD for L2ARC and there are two 3TB SATA drives in a separate 
ZFS mirror pool. All drives passed cursory testing with smartmontools.

CPU is an AMD-8120 (8 core Zambezi).

Very little is running on the server as it is not yet in production 
(thankfully).

Here is the relevant part of dmesg:

ahcich0: Timeout on slot 31 port 0
ahcich0: is 00000008 cs 00000000 ss 00000000 rs f8000000 tfd 40 serr 
00000000 cmd 00047f17
(ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 20 ff a6 40 01 00 
00 00 00 00
(ada0:ahcich0:0:0:0): CAM status: Command timeout
(ada0:ahcich0:0:0:0): Retrying command
ahcich0: Timeout on slot 31 port 0
ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 
00000000 cmd 00047f17
(aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 
00 00 00
(aprobe0:ahcich0:0:0:0): CAM status: Command timeout
(aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked
ahcich0: Timeout on slot 31 port 0
ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 
00000000 cmd 00047f17
(aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 
00 00 00
(aprobe0:ahcich0:0:0:0): CAM status: Command timeout
(aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked
ada0 at ahcich0 bus 0 scbus0 target 0 lun 0
ada0: <Samsung SSD 840 EVO 120GB EXT0AB0Q> s/n S1D5NSAD915803Y detached
ahcich0: Timeout on slot 31 port 0
ahcich0: is 00000001 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 
00000000 cmd 00047f17
(ada0:ahcich0:0:0:0): SETFEATURES ENABLE RCACHE. ACB: ef aa 00 00 00 40 
00 00 00 00 00 00
(ada0:ahcich0:0:0:0): CAM status: Command timeout
(ada0:ahcich0:0:0:0): Error 5, Periph was invalidated
ahcich0: Timeout on slot 31 port 0
ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 
00000000 cmd 00047f17
(aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 
00 00 00
(aprobe0:ahcich0:0:0:0): CAM status: Command timeout
(aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked
ahcich0: Timeout on slot 31 port 0
ahcich0: is 00000002 cs 00000000 ss 00000000 rs 80000000 tfd 50 serr 
00000000 cmd 00047f17
(aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 
00 00 00
(aprobe0:ahcich0:0:0:0): CAM status: Command timeout
(aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked
ahcich0: Timeout on slot 3 port 0
ahcich0: is 00000008 cs 00000000 ss 00000000 rs 8000000f tfd 40 serr 
00000000 cmd 00046317
(ada0:ahcich0:0:0:0): DSM TRIM. ACB: 06 01 00 00 00 40 00 00 00 00 01 00
(ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request
(ada0:ahcich0:0:0:0): Error 5, Periph was invalidated
(ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 10 20 ff a6 40 01 00 
00 00 00 00
(ada0:ahcich0:0:0:0): CAM status: Command timeout
(ada0:ahcich0:0:0:0): Error 5, Periph was invalidated
(ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 38 ff a6 40 01 00 
00 00 00 00
(ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request
(ada0:ahcich0:0:0:0): Error 5, Periph was invalidated
(ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 88 ff a6 40 01 00 
00 00 00 00
(ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request
(ada0:ahcich0:0:0:0): Error 5, Periph was invalidated
(ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 10 ff a6 40 01 00 
00 00 00 00
(ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request
(ada0:ahcich0:0:0:0): Error 5, Periph was invalidated
(ada0:ahcich0:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 08 30 ff a6 40 01 00 
00 00 00 00
(ada0:ahcich0:0:0:0): CAM status: Unconditionally Re-queue Request
(ada0:ahcich0:0:0:0): Error 5, Periph was invalidated
(ada0:ahcich0:0:0:0): Periph destroyed
ahcich0: Timeout on slot 3 port 0
ahcich0: is 00000002 cs 00000000 ss 00000000 rs 00000008 tfd 50 serr 
00000000 cmd 00046317
(aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 
00 00 00
(aprobe0:ahcich0:0:0:0): CAM status: Command timeout
(aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked
ahcich0: Timeout on slot 3 port 0
ahcich0: is 00000002 cs 00000000 ss 00000000 rs 00000008 tfd 50 serr 
00000000 cmd 00046317
(aprobe0:ahcich0:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00 00 40 00 00 00 
00 00 00
(aprobe0:ahcich0:0:0:0): CAM status: Command timeout
(aprobe0:ahcich0:0:0:0): Error 5, Retry was blocked

After this event, gpart show lists only ada1, ada2, and ada3. The boot 
drive is ada0.

The entire dmesg can be seen at http://pastebin.com/RqR8LiSb.


-- 
Jim Ohlstein


More information about the freebsd-stable mailing list