RAID and NFS exports (Possible Data Corruption)

David Malone dwmalone at maths.tcd.ie
Tue Jul 15 09:20:10 PDT 2003


On Tue, Jul 15, 2003 at 06:26:24AM -0700, Sumit Shah wrote:
> Here is a message I sent to freebsd-questions and I was hoping I could 
> get some help debugging this.

It seems very unlikely that restarting mountd cound cause an error
like:

>ad4: hard error reading fsbn  242727552

The error means that that the disk said that there was an error
trying to read this block. You say that when you rebooted that the
controler said a disk had gone bad, so this would sort of confirm
this. (I could believe that restarting mountd might upset raid stuff
if there were a kernel bug, but it seems very unlikely it could
cause a disk to go bad.)

My best guess would be that you have a bad batch of disks that
happen to have failed in similar ways. It is possible that restarting
mountd uncovered the errors, 'cos I think mountd internally does
a remount of the filesystem in question and that might cause a chunk
of stuff to be flushed out on to the disk, highlighting an error.

(I had a bunch of the IBM "deathstar" disks fail on me within the
space of a week or so, after they'd been in use for about six
months.)

	David.


More information about the freebsd-hackers mailing list