twa + dump = sbwait

Kostik Belousov kostikbel at gmail.com
Mon Sep 17 01:58:11 PDT 2007


On Mon, Sep 17, 2007 at 12:45:57PM +0400, Boris Samorodov wrote:
> On Mon, 17 Sep 2007 11:32:13 +0300 Kostik Belousov wrote:
> > On Mon, Sep 17, 2007 at 12:09:29PM +0400, Boris Samorodov wrote:
> > > On Mon, 17 Sep 2007 11:04:15 +0300 Kostik Belousov wrote:
> > > > On Mon, Sep 17, 2007 at 11:28:58AM +0400, Boris Samorodov wrote:
> > > > > Hi!
> > > > > 
> > > > > 
> > > > > I can't use dump at -CURRENT with twa. The process goes to sbwait
> > > > > state forever. Here are the details.
> > > > > 
> > > > > $ uname -a
> > > > > FreeBSD test.ipt.ru 7.0-CURRENT FreeBSD 7.0-CURRENT #0: Sat Sep 15 19:37:17 MSD 2007     bsam at test.ipt.ru:/usr/obj/usr/src/sys/TEST  amd64
> > > > > 
> > > > > The controller:
> > > > > twa0: <3ware 9000 series Storage Controller> port 0x3000-0x30ff mem 0xd8000000-0xd9ffffff,0xda300000-0xda300fff irq 16 at device 0.0 on pci8
> > > > > twa0: [GIANT-LOCKED]
> > > > > twa0: [ITHREAD]
> > > > > twa0: INFO: (0x04: 0x0053): Battery capacity test is overdue: 
> > > > > twa0: INFO: (0x15: 0x1300): Controller details:: Model 9650SE-8LPML, 8 ports, Firmware FE9X 3.06.00.005, BIOS BE9X 3.06.00.002
> > > > > 
> > > > > Two disks at stripe:
> > > > > da1 at twa0 bus 0 target 1 lun 0
> > > > > da1: <AMCC 9650SE-8LP DISK 3.06> Fixed Direct Access SCSI-5 device 
> > > > > da1: 100.000MB/s transfers
> > > > > da1: 476816MB (976519168 512 byte sectors: 255H 63S/T 60785C)
> > > > > 
> > > > > Mounted as:
> > > > > /dev/da1    /s  ufs rw 2 2
> > > > > 
> > > > > The command:
> > > > > $ dump -0Luan -f s.dump /s
> > > > >   DUMP: Date of this level 0 dump: Mon Sep 17 10:26:21 2007
> > > > >   DUMP: Date of last level 0 dump: the epoch
> > > > >   DUMP: Dumping snapshot of /dev/da1 (/s) to s.dump
> > > > >   DUMP: mapping (Pass I) [regular files]
> > > > >   DUMP: mapping (Pass II) [directories]
> > > > >   DUMP: estimated 432054 tape blocks.
> > > > >   DUMP: dumping (Pass III) [directories]
> > > > >   DUMP: dumping (Pass IV) [regular files]
> > > > > [wait here forever]
> > > > > 
> > > > > The relevant part of iostat -w 10 and top:
> > > > > -----
> > > > >       tty             da0              da1             cpu
> > > > >  tin tout  KB/t tps  MB/s   KB/t tps  MB/s  us ni sy in id
> > > > >    0  231 16.00   0  0.01  16.00 152  2.37   0  0  1  0 99
> > > > >    0  257 16.00   1  0.02  16.00 147  2.29   0  0  1  0 99
> > > > >    0  310 16.00   1  0.01  16.00 146  2.28   0  0  1  0 99
> > > > >    0  265 16.00   0  0.01  16.00 152  2.38   0  0  1  0 99
> > > > >    0  442 16.00   0  0.00  15.98 299  4.66   0  0  1  0 99
> > > > >    0  620 15.54   3  0.04   3.46 1999  6.76   0  0  2  1 96
> > > > >    0  363 16.00   0  0.01   2.77 5141 13.90   0  0  5  2 93
> > > > >    0  309  0.00   0  0.00   3.22 114  0.36   0  0  0  0 100
> > > > >    0  260 16.00   1  0.01   0.00   0  0.00   0  0  0  0 100
> > > > >    0  263  0.00   0  0.00   0.00   0  0.00   0  0  0  0 100
> > > > > 
> > > > > last pid: 74610;  load averages:  0.01,  0.06,  0.01    up 1+09:53:34  10:32:34
> > > > > 98 processes:  98 sleeping
> > > > > CPU states:  0.0% user,  0.0% nice,  0.3% system,  0.0% interrupt, 99.6% idle
> > > > > Mem: 113M Active, 2374M Inact, 337M Wired, 480K Cache, 214M Buf, 4935M Free
> > > > > Swap: 4096M Total, 4096M Free
> > > > > 
> > > > >   PID USERNAME  THR PRI NICE   SIZE    RES STATE  C   TIME   WCPU COMMAND
> > > > > 74595 duser       1  20    0 28248K 26000K pause  5   0:01  0.00% dump
> > > > > 74596 duser       1  20    0 28248K 26000K pause  2   0:01  0.00% dump
> > > > > 74594 duser       1  20    0 28248K 26000K pause  5   0:01  0.00% dump
> > > > > 74593 duser       1   4    0 28248K 26020K sbwait 7   0:01  0.00% dump
> > > > > 74530 duser       1   8    0 28248K 26000K wait   6   0:01  0.00% dump
> > > > > 74443 duser       1   8    0  6216K  1904K wait   5   0:00  0.00% sh
> > > > > -----
> > > > > 
> > > > > Gdb shows nothing interesting:
> > > > > # gdb dump 74593
> > > > > GNU gdb 6.1.1 [FreeBSD]
> > > > > Copyright 2004 Free Software Foundation, Inc.
> > > > > GDB is free software, covered by the GNU General Public License, and you are
> > > > > welcome to change it and/or distribute copies of it under certain conditions.
> > > > > Type "show copying" to see the conditions.
> > > > > There is absolutely no warranty for GDB.  Type "show warranty" for details.
> > > > > This GDB was configured as "amd64-marcel-freebsd"...(no debugging symbols found)...
> > > > > Attaching to program: /sbin/dump, process 74593
> > > > > Reading symbols from /lib/libc.so.7...(no debugging symbols found)...done.
> > > > > Loaded symbols for /lib/libc.so.7
> > > > > Reading symbols from /libexec/ld-elf.so.1...(no debugging symbols found)...done.
> > > > > Loaded symbols for /libexec/ld-elf.so.1
> > > > > 0x000000080071b34a in read () from /lib/libc.so.7
> > > > > 
> > > > > Shall I do some more debugging? Thanks!
> > > 
> > > > Please, verify that you have rev. 1.39 of sys/kern/subr_sleepqueue.c.
> > > 
> > > % grep FBSDID /sys/kern/subr_sleepqueue.c
> > > __FBSDID("$FreeBSD: src/sys/kern/subr_sleepqueue.c,v 1.39 2007/09/13 09:12:36 attilio Exp $");
> 
> > Most likely, this is a different bug then. Just to make sure, please, show
> > the output of ps axl | grep dump. The dump processes shall sleep, and be
> > killable (check that, please).
> 
> % ps axwwl | grep dump | grep -v grep
>  9900 74530 74443   0   8  0 28248 26000 wait   I+    p1    0:00.88 dump -0Luan -f s.dump /s (dump)
>  9900 74593 74530   0   4  0 28248 26020 sbwait I+    p1    0:01.05 dump: /dev/da1: pass 4: 44.73% done, finished in 0:00 at Mon Sep 17 10:27:03 2007 (dump)
>  9900 74594 74593   1  20  0 28248 26000 pause  I+    p1    0:01.06 dump -0Luan -f s.dump /s (dump)
>  9900 74595 74593   0  20  0 28248 26000 pause  I+    p1    0:01.06 dump -0Luan -f s.dump /s (dump)
>  9900 74596 74593   0  20  0 28248 26000 pause  I+    p1    0:01.06 dump -0Luan -f s.dump /s (dump)
> 
> 
> The process is killable:
> # killall dump
> [...]
>   DUMP: Is the new volume mounted and ready to go?: ("yes" or "no") 
>   DUMP: "Yes" or "No"?
>   DUMP: Is the new volume mounted and ready to go?: ("yes" or "no") no
>   DUMP: Do you want to abort?: ("yes" or "no") yes
>   DUMP: The ENTIRE dump is aborted.
> #
And, are you sure that no dump processes left in the system ? If yes, this
is indeed some other bug.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 187 bytes
Desc: not available
Url : http://lists.freebsd.org/pipermail/freebsd-current/attachments/20070917/e0e50798/attachment.pgp


More information about the freebsd-current mailing list