drive failure during rebuild causes page fault

Doug White dwhite at gumbysoft.com
Mon Dec 13 18:35:00 PST 2004


On Mon, 13 Dec 2004, Joe Rhett wrote:

> > This is why I don't trust ATA RAID for fault tolerance -- it'll save your
> > data, but the system will tank.  Since the disk state is maintained by
> > the OS and not abstracted by a separate processor, if a disk dies in a
> > particularly bad way the system may not be able to cope.
>
> Yes, but SATA isn't limited by this problem.  It does have a processor per
> disk. (this is all SATA, if I didn't make that clear)

Actually on SATA its worse -- the disk just stops responding to everything
and hangs.  If you don't detect this condition then you go into an
infinite wait.

In any case, yes the ATA RAID code could use a massive robustness pass. So
could the core ATA code.  Patches accepted :)

-- 
Doug White                    |  FreeBSD: The Power to Serve
dwhite at gumbysoft.com          |  www.FreeBSD.org


More information about the freebsd-stable mailing list