zpool failmode=continue

Tue Dec 27 21:15:01 UTC 2011

On Tue, Dec 27, 2011 at 04:37:32PM +0000, Johannes Totz wrote:
> On 13/12/2011 14:53, Johannes Totz wrote:
> >On 13/12/2011 14:44, Peter Maloney wrote:
> >>Are you using NFS or ZVOLs?
> >
> >Neither, see below.
> >
> >>My zfs hangs (all IO) if I go into the .zfs/snapshots directory over
> >>NFS. (planning to file a PR after I find a way to reproduce it reliably,
> >>but it depends on specific snapshots). My workaround is to mount
> >>/var/empty on top of the .zfs directory on the nfs client, and give
> >>nobody else access. Another workaround I thought of is to have another
> >>parent directory in the dataset, and share the 2nd level down which
> >>doesn't contain the .zfs directory.
> >
> >My pool is not exported to any clients. My situation is actually the
> >other way around, should have been more clear: the block device on which
> >I created the pool is a on the network.
> >It's kind of a crazy setup:
> >- sshfs to another (Linux) machine
> >- create big image file
> >- create pool from file vdev mounted via sshfs
> >Eventually the network drops out, zpool shows read and write errors,
> >fine so far. But all new io just hangs instead of failing with an error.
> 
> After some observation, turns out that
> periodic/security/100.chksetuid makes all i/o die on the test pool.
> Is find doing something funny? As it does not even search around on
> the testpool (it's imported but not mounted) nor the sshfs (only ufs
> and zfs is searched) I don't have any clue as to what might go
> wrong...
> zpool status simply mentions read/write errors.
> 
> I noticed this because when logging iostat to a file, i/o always
> stopped at 3am. But I can also trigger it by simply running
> 100.chksetuid. All the other stuff in daily and security is fine.
> 
> Anybody has any idea what might cause it?

This ""problem"" (note the quotes) has been brought up before.  There
isn't anything wrong with the periodic script; if you look at the
script, you'll see that it's ""heavy"" on I/O due to all of the
operations being done:

        find -sx $MP /dev/null -type f \
            \( -perm -u+x -or -perm -g+x -or -perm -o+x \) \
            \( -perm -u+s -or -perm -g+s \) -exec ls -liTd \{\} \+ |
        check_diff setuid - "${host} setuid diffs:"

This is going to traverse the filesystem and do a couple stat(2) calls
(I assume find(1) is smart enough to consolidate them into 1 or 2 at
most), plus there's the -exec call on every single result (pretty sure
one cannot use xargs in this case given the nature of what's being
done).

I can try to dig up those threads for you, but I'm sure if you search
mailing lists for "100.chksetuid zfs" you'll see.

ZFS tends to "bring to light" underlying issues with hardware, as it
stresses the system a lot more than UFS would.  For example, folks using
mps(4) (I think; trying to remember which LSI driver) were having
problems for a while and "fixes to make ZFS happy" were committed to the
driver.  You get the idea I hope.

Your statement here:

> zpool status simply mentions read/write errors.

Acts as pretty much confirmation of this fact.

You're going to need to provide a verbose description of your setup,
including all storage controllers that are used/associated with ZFS,
every disk involved (model string would be helpful), if possible SMART
statistics for each disk (smartctl -A), if you have a heterogeneous
setup (ZFS on some disks, UFS on others), make.conf, loader.conf,
sysctl.conf, full "zpool status" output, uname -a, full "dmesg" output,
etc..  I make no promises that there's any solution to this as well.

-- 
| Jeremy Chadwick                                jdc at parodius.com |
| Parodius Networking                       http://www.parodius.com/ |
| UNIX Systems Administrator                   Mountain View, CA, US |
| Making life hard for others since 1977.               PGP 4BD6C0CB |