High traffic NFS performance and availability problems

David Gilbert dgilbert at dclg.ca
Fri Feb 18 08:22:31 PST 2005

>>>>> "David" == David Rice <drice at globat.com> writes:

David> The problems we are having is as follows.

David> 1. Slow perfomance during peek traffic periods 

This is due largely to the cache on your RAID hardware card.
Unfortunately, this is a failure mode of hardware RAID cards you can't
avoid (only delay by buying more disk).

David> 2. Client boxes
David> have high load averages and sometimes crashes due to slow NFS

Clients waiting for nfs requests are still considered "running"

David> performance.  3. File servers that randomly crash with "Fatal
David> trap 12: page fault while in kernel mode" 4. With soft updates
David> enabled during FSCK the fileserver will freeze with all NFS
David> processs in the "snaplck" state. We disabled soft updates
David> because of this.

The remainder of this sounds like memory corruption.


