Zfs locking up process

Rajil Saraswat rajil.s at gmail.com
Wed Oct 7 15:11:59 UTC 2015


Hello

I have server running Freenas 9.3 with a few jails. The machine has two new
disks setup in mirror. I have a dataset (/mnt/tank/media) which is shared
in two jails.

Unfortunately, sometimes when I do a ls in a jail in the shared directory I
see that the process just hangs.

Today in the jail I did an 'su'  and process just hung. On the host if i do
ls /mnt/tank/media it also hangs.

The su process (pid 77477) is taking up 100% cpu in the jail. It seems that
zfs is holding up the process. Any idea what could be wrong?

Thanks
Rajil

On the host
[root at freenas] ~# ps O nwchan
  PID NWCHAN           TT  STAT     TIME COMMAND
14248 fffffe000962c8a8 v0  Is+   0:00.73 python /etc/netcli (python2.7)
14249 fffffe000962c4a8 v1  Is+   0:00.00 /usr/libexec/getty Pc ttyv1
14250 fffffe000962c0a8 v2  Is+   0:00.00 /usr/libexec/getty Pc ttyv2
14251 fffffe000962cca8 v3  Is+   0:00.00 /usr/libexec/getty Pc ttyv3
14252 fffffe00096284a8 v4  Is+   0:00.00 /usr/libexec/getty Pc ttyv4
14253 fffffe000962eca8 v5  Is+   0:00.00 /usr/libexec/getty Pc ttyv5
14254 fffffe000962e8a8 v6  Is+   0:00.00 /usr/libexec/getty Pc ttyv6
14255 fffffe000962e4a8 v7  Is+   0:00.00 /usr/libexec/getty Pc ttyv7
77477 -                 2- R+J  36:49.59 _su (csh)
77541 fffffe0043d31488  3- D+J   0:00.01 _su (csh)
45586 fffffe00a44050a0  0  Is    0:00.02 -csh (csh)
74258 fffffe00076824a8  0  I+J   0:00.01 csh
77586 fffffe0043d31488  4  Ds+   0:00.01 -csh (csh)
77621 fffffe0043d31488  5- D+    0:00.00 ls -GFa
77626 fffffe01e7a214a8  6  Is+   0:00.02 -csh (csh)
79795 fffffe0055e540a0  8  Ss    0:00.01 -csh (csh)
79798 -                 8  R+    0:00.00 ps O nwchan

[root at freenas] ~# ps O wchan
  PID WCHAN TT  STAT     TIME COMMAND
14248 ttyin v0  Is+   0:00.73 python /etc/netcli (python2.7)
14249 ttyin v1  Is+   0:00.00 /usr/libexec/getty Pc ttyv1
14250 ttyin v2  Is+   0:00.00 /usr/libexec/getty Pc ttyv2
14251 ttyin v3  Is+   0:00.00 /usr/libexec/getty Pc ttyv3
14252 ttyin v4  Is+   0:00.00 /usr/libexec/getty Pc ttyv4
14253 ttyin v5  Is+   0:00.00 /usr/libexec/getty Pc ttyv5
14254 ttyin v6  Is+   0:00.00 /usr/libexec/getty Pc ttyv6
14255 ttyin v7  Is+   0:00.00 /usr/libexec/getty Pc ttyv7
77477 -      2- R+J  36:49.59 _su (csh)
77541 zfs    3- D+J   0:00.01 _su (csh)
45586 pause  0  Is    0:00.02 -csh (csh)
74258 ttyin  0  I+J   0:00.01 csh
77586 zfs    4  Ds+   0:00.01 -csh (csh)
77621 zfs    5- D+    0:00.00 ls -GFa
77626 ttyin  6  Is+   0:00.02 -csh (csh)
79795 pause  8  Ss    0:00.01 -csh (csh)
79800 -      8  R+    0:00.00 ps O wchan

Inside the jail

[r at sshserver ~]$ ps -axl
UID   PID  PPID CPU PRI NI   VSZ  RSS MWCHAN STAT TT      TIME COMMAND
   0  9438     1   0  31  0 12084 1916 select IsJ  ??   0:00.00 dhclient:
epair3b [priv] (dhclient)
  65  9490     1   0  20  0 12084 1976 select IsJ  ??   0:00.00 dhclient:
epair3b (dhclient)
   0  9878     1   0  20  0 12080 1868 select SsJ  ??   0:00.03
/usr/sbin/syslogd -s
   0  9948     1   0  20  0 14184 1860 nanslp IsJ  ??   0:00.03
/usr/sbin/cron -s
   0 45452     1   0  22  0 52036 4916 select IsJ  ??   0:00.01 sshd: r
[priv] (sshd)
1001 45454 45452   0  20  0 52036 5000 select IJ   ??   0:00.01 sshd: r at pts/1
(sshd)
   0 71094     1   0  20  0 26728 4456 select SsJ  ??   0:00.00
/usr/local/sbin/sshd
   0 79100 71094   0  22  0 52036 4912 select IsJ  ??   0:00.01 sshd: r
[priv] (sshd)
1001 79102 79100   0  20  0 52036 4948 select IJ   ??   0:00.00 sshd: r at pts/7
(sshd)
   0 80400 71094   0  23  0 52036 4912 select IsJ  ??   0:00.01 sshd: r
[priv] (sshd)
1001 80402 80400   0  20  0 52036 4920 select IJ   ??   0:00.00 sshd: r at pts/8
(sshd)
   0 80634 71094   0  22  0 52036 4912 select SsJ  ??   0:00.01 sshd: r
[priv] (sshd)
1001 80636 80634   0  20  0 52036 4920 select SJ   ??   0:00.00 sshd: r at pts/9
(sshd)
   0 77477     1   0  20  0 17568 2556 -      R+J   2- 76:29.08 _su (csh)
   0 77541     1   0  20  0 17568 3156 zfs    D+J   3-  0:00.01 _su (csh)
1001 45455 45454   0  20  0 17512 3252 ttyin  Is+J  1   0:00.00 -bash (bash)
1001 79103 79102   0  20  0 17512 3332 ttyin  Is+J  7   0:00.00 -bash (bash)
1001 80403 80402   0  20  0 17512 3332 ttyin  Is+J  8   0:00.00 -bash (bash)
1001 80637 80636   0  20  0 17512 3332 wait   SsJ   9   0:00.00 -bash (bash)
1001 80641 80637   0  20  0 16296 1852 -      R+J   9   0:00.00 ps -axl


More information about the freebsd-fs mailing list