kern/108924: [ar] Panics when Intel MatrixRAID RAID1 is degraded
Jeremy Chadwick
koitsu at FreeBSD.org
Mon Feb 4 14:00:05 PST 2008
The following reply was made to PR kern/108924; it has been noted by GNATS.
From: Jeremy Chadwick <koitsu at freebsd.org>
To: bug-followup at FreeBSD.org, taras at elantech.ru
Cc: sos at freebsd.org, delphij at FreeBSD.org
Subject: Re: kern/108924: [ar] Panics when Intel MatrixRAID RAID1 is
degraded
Date: Mon, 4 Feb 2008 13:55:01 -0800
Wow, this is a fairly old problem with no solution in over a year?
Here's some additional details from my testing. This is easily
reproducable. I'll work on getting a kernel with DDB/KDB so one can do
backtraces via serial console; I can provide access to this if need be.
Details:
* FreeBSD 7.0-RC1 (and previous 7.0 releases)
* Supermicro SuperServer 5015M-T (Supermicro PDSMI+ motherboard)
* Built-in Intel ICH7 controller
* Hot-swap backplane (which works when disks are JBOD and not using
MatrixRAID)
Installed i386 FreeBSD on ar0 without a problem:
ad4: 190782MB <WDC WD2000JD-00HBB0 08.02D08> at ata2-master SATA150
ad6: 190782MB <WDC WD2000JD-00HBB0 08.02D08> at ata3-master SATA150
ar0: 190779MB <Intel MatrixRAID RAID1> status: READY
ar0: disk0 READY (master) using ad4 at ata2-master
ar0: disk1 READY (mirror) using ad6 at ata3-master
But I attempted a hard failure of a disk, and reattachment of that disk,
FreeBSD eventually made the entire mirror unusable.
Here's the steps I took:
1) Removed ad4 disk
- Kernel said:
ar0: WARNING - mirror protection lost. RAID1 array in DEGRADED mode
subdisk4: detached
ad4: detached
2) atacontrol list
- no sign of ad4 on ATA channel 2
3) atacontrol status ar0
ar0: ATA RAID1 status: DEGRADED
subdisks:
0 ---- MISSING
1 ad6 ONLINE
4) I then decided to copy some data to the array while degraded, just to
make sure data got re-mirrored after bringing ad4 back online.
5) cp /boot/kernel/kernel /usr/test
6) Plugged ad4 disk back in
- Disk LED came on for a second, then went off
- No messages from kernel
7) atacontrol list
- no sign of ad4 on ATA channel 2
8) atacontrol attach ata2
atacontrol: ioctl(IOCATAATTACH): File exists
- LED on ad4 disk suddenly turns on and is lit constantly
- gstat showed no activity on ad4
9) atacontrol status ar0
- same as previous run
10) atacontrol reinit ata2
no device present
- LED on ad4 disk shut off
11) atacontrol status ar0
- same as previous run
12) atacontrol reinit ata2
- same as previous run
13) atacontrol detach ata2
14) atacontrol attach ata2
no device present
- Kernel said:
ata2: [ITHREAD]
15) atacontrol detach ata2
16) atacontrol attach ata2
no device present
- Kernel said:
ata2: [ITHREAD]
17) atacontrol reinit ata2
no device present
18) atacontrol list
- no sign of ad4 on ATA channel 2
19) atacontrol detach ata2
20) atacontrol reinit ata2
- Kernel immediately paniced, and machine rebooted.
- Intel RAID BIOS showed disk 0 (ad4) as "Offline Member", but
disk statistics (size) were available, meaning the disk was
visible and accessible
- Array labelled as "Degraded" in BIOS
21) Booted into FreeBSD
- Kernel started, and said:
ad4: 190782MB <WDC WD2000JD-00HBB0 08.02D08> at ata2-master SATA150
ad6: 190782MB <WDC WD2000JD-00HBB0 08.02D08> at ata3-master SATA150
- Kernel immediately paniced; ar0 is never shown.
- Process which paniced is 0 (swapper)
- Single-user mode crashes at same point
- Power-cycling doesn't help
This thread also complains about similar issues:
http://lists.freebsd.org/pipermail/freebsd-questions/2006-February/114274.html
This really needs some focus. I'd be more than happy to purchase and
donate new hardware for testing if required.
--
| Jeremy Chadwick jdc at parodius.com |
| Parodius Networking http://www.parodius.com/ |
| UNIX Systems Administrator Mountain View, CA, USA |
| Making life hard for others since 1977. PGP: 4BD6C0CB |
More information about the freebsd-bugs
mailing list