Troube with SSD

Willem Jan Withagen wjw at digiware.nl
Wed Feb 1 13:40:24 UTC 2012


Hi,

I have this ZFS server up for about 27 days, and about 3 weeks ago (was
not really paying attention) it turns out it lost its SSD that I'm using
for log and cache. There is also a poor and lonely memory stick for log.
So the box did not really suffer file loss.

system is running:
FreeBSD zfs.digiware.nl 8.2-STABLE FreeBSD 8.2-STABLE #58: Thu Nov 17
09:43:46 CET 2011
root at zfs.digiware.nl:/home/obj/usr/src/src8/src/sys/ZFS  amd64

more info like dmesg, pciconf, kernconf, zpool iostat at:
	http://www.tegenbosch28.nl/FreeBSD/systems/ZFS/

But it is weird to just lose a SSD from the bus. And it has happened
before. And you can see that AHCI really banged on the frontdoor...

The device is a Corsair 60Gb Force GT. And thusfar I have not found any
suggestions that that serie of devices is prone to doing this.

It was a real dead device, the only way to get it back:
	powercycle the device by pulling it, and stick it back
	then camcontrol rescan

I've now upgrade it to a 120Gb Corsair, to see if that has the same problem.

Other FreeBSD-ers have like problems?

Regards,
--WjW


Jan  7 10:04:24 zfs kernel: ahcich3: Timeout on slot 27 port 0
Jan  7 10:04:24 zfs kernel: ahcich3: is 00000000 cs 20000000 ss 38000000
rs 38000000 tfd c0 serr 00000000 cmd 0004dd17
Jan  7 10:04:56 zfs kernel: ahcich3: AHCI reset: device not ready after
31000ms (tfd = 00000080)
Jan  7 10:05:26 zfs kernel: ahcich3: Timeout on slot 29 port 0
Jan  7 10:05:26 zfs kernel: ahcich3: is 00000000 cs 20000000 ss 00000000
rs 20000000 tfd 80 serr 00000000 cmd 0004dd17
Jan  7 10:05:57 zfs kernel: ahcich3: AHCI reset: device not ready after
31000ms (tfd = 00000080)
Jan  7 10:06:27 zfs kernel: ahcich3: Timeout on slot 29 port 0
Jan  7 10:06:27 zfs kernel: ahcich3: is 00000000 cs 20000000 ss 00000000
rs 20000000 tfd 80 serr 00000000 cmd 0004dd17
Jan  7 10:06:27 zfs kernel: (ada2:ahcich3:0:0:0): lost device
Jan  7 10:06:58 zfs kernel: ahcich3: AHCI reset: device not ready after
31000ms (tfd = 00000080)
Jan  7 10:07:28 zfs kernel: ahcich3: Timeout on slot 29 port 0
Jan  7 10:07:28 zfs kernel: ahcich3: is 00000000 cs e0000000 ss e0000000
rs e0000000 tfd 80 serr 00000000 cmd 0004dd17
Jan  7 10:08:16 zfs kernel: ahcich3: AHCI reset: device not ready after
31000ms (tfd = 00000080)
Jan  7 10:08:16 zfs kernel: ahcich3: Poll timeout on slot 31 port 0
Jan  7 10:08:16 zfs kernel: ahcich3: is 00000000 cs 80000000 ss 00000000
rs 80000000 tfd 80 serr 00000000 cmd 0004df17
Jan  7 10:08:46 zfs kernel: ahcich3: Timeout on slot 31 port 0
Jan  7 10:08:46 zfs kernel: ahcich3: is 00000000 cs 80000000 ss 00000000
rs 80000000 tfd 80 serr 00000000 cmd 0004df17
Jan  7 10:08:48 zfs kernel: (ada2:ahcich3:0:0:0): removing device entry
Jan  7 10:09:33 zfs kernel: ahcich3: AHCI reset: device not ready after
31000ms (tfd = 00000080)
Jan  7 10:09:33 zfs kernel: ahcich3: Poll timeout on slot 31 port 0
Jan  7 10:09:33 zfs kernel: ahcich3: is 00000000 cs 80000000 ss 00000000
rs 80000000 tfd 80 serr 00000000 cmd 0004df17


More information about the freebsd-stable mailing list