Stale NFS file handles on 8.x amd64

Jeremy Chadwick freebsd at jdc.parodius.com
Tue Nov 30 01:46:41 UTC 2010


On Mon, Nov 29, 2010 at 08:06:54PM -0500, Adam McDougall wrote:
> I've been running dovecot 1.1 on FreeBSD 7.x for a while with a bare
> minimum of NFS problems, but it got worse with 8.x.  I have 2-4
> servers (usually just 2) accessing mail on a Netapp over NFSv3 via
> imapd. delivery is via procmail which doesn't touch the dovecot
> metadata and webmail uses imapd.  Client connections to imapd go to
> random servers and I don't yet have solid means to keep certain
> users on certain servers.  I upgraded some of the servers to 8.x and
> dovecot 1.2 and ran into Stale NFS file handles causing
> index/uidlist corruption causing inboxes to appear as empty when
> they were not.  In some situations their corrupt index had to be
> deleted manually.  I first suspected dovecot 1.2 since it was
> upgraded at the same time but I downgraded to 1.1 and its doing the
> same thing.  I don't really have a wealth of details to go on yet
> and I usually stay quiet until I do, and half the time it is
> difficult to reproduce myself so I've had to put it in production to
> get a feel for progress.  This only happens a dozen or so times per
> weekday but I feel the need to start taking bigger steps.  I'll
> probably do what I can to get IMAP back on a stable base (7.x?) and
> also try to debug 8.x on the remaining servers.  A binary search is
> within possibility if I can reproduce the symptoms often enough even
> if I have to put a test server in production for a few hours.
> 
> Any tips on where we could start looking, or alterations I could try
> making such as sysctls to return to older behavior?

http://wiki1.dovecot.org/NFS is a good start, especially if this problem
is only seen with Dovecot.  I would start there, specially adjusting
your dovecot.conf to include the necessary directives.

> It might be
> worth noting that I've seen a considerable increase in traffic from
> my mail servers since the 8.x upgrade timeframe, on the order of
> 5-10x as much traffic to the NFS server.  dovecot tries its hardest
> to flush out the access cache when needed and it was working well
> enough since about 1.0.16 (years ago).  It seems like FreeBSD is
> what regressed in this scenario.  dovecot 2.x is going in a
> different direction from my situation and I'm not ready to start
> testing that immediately if I can avoid it as it will involve some
> restructuring.
> 
> Thanks for any input.  For now the following errors are about all I
> have to go on:
> 
> Nov 29 11:07:54 server1 dovecot: IMAP(user1): o_stream_send(/home/user1/Maildir/dovecot/private/control/.INBOX/dovecot-uidlist)
> failed: Stale NFS file handle
> Nov 29 13:19:51 server1 dovecot: IMAP(user1): o_stream_send(/home/user1/Maildir/dovecot/private/control/.INBOX/dovecot-uidlist)
> failed: Stale NFS file handle
> Nov 29 14:35:41 server1 dovecot: IMAP(user2): o_stream_send(/home/user2/Maildir/dovecot/private/control/.INBOX/dovecot-uidlist)
> failed: Stale NFS file handle
> Nov 29 15:07:05 server1 dovecot: IMAP(user3): read(mail, uid=128990)
> failed: Stale NFS file handle
> 
> Nov 29 11:57:22 server2 dovecot: IMAP(user4):
> open(/egr/mail/shared/vprgs/dovecot-acl-list) failed: Stale NFS file
> handle
> Nov 29 14:04:22 server2 dovecot: IMAP(user5): o_stream_send(/home/user5/Maildir/dovecot/private/control/.INBOX/dovecot-uidlist)
> failed: Stale NFS file handle
> Nov 29 14:27:21 server2 dovecot: IMAP(user6): o_stream_send(/home/user6/Maildir/dovecot/private/control/.INBOX/dovecot-uidlist)
> failed: Stale NFS file handle
> Nov 29 15:44:38 server2 dovecot: IMAP(user7):
> open(/egr/mail/shared/decs/dovecot-acl-list) failed: Stale NFS file
> handle
> Nov 29 19:04:54 server2 dovecot: IMAP(user8): o_stream_send(/home/user8/Maildir/dovecot/private/control/.INBOX/dovecot-uidlist)
> failed: Stale NFS file handle
> 
> Nov 29 06:32:11 server3 dovecot: IMAP(user9):
> open(/egr/mail/shared/cmsc/dovecot-acl-list) failed: Stale NFS file
> handle
> Nov 29 10:03:58 server3 dovecot: IMAP(user10): o_stream_send(/home/user10/Maildir/dovecot/private/control/.INBOX/dovecot-uidlist)
> failed: Stale NFS file handle

-- 
| Jeremy Chadwick                                   jdc at parodius.com |
| Parodius Networking                       http://www.parodius.com/ |
| UNIX Systems Administrator                  Mountain View, CA, USA |
| Making life hard for others since 1977.              PGP: 4BD6C0CB |



More information about the freebsd-stable mailing list