From KIRK at STRAUSER.COM Sat Nov 1 19:20:04 2008 From: KIRK at STRAUSER.COM (Kirk Strauser) Date: Sat Nov 1 19:20:11 2008 Subject: kern/128452: [sa] [panic] Accessing SCSI tape drive randomly crashes my amd64 system Message-ID: <200811020220.mA22K4dY063564@freefall.freebsd.org> The following reply was made to PR kern/128452; it has been noted by GNATS. From: Kirk Strauser To: bug-followup@FreeBSD.org, kirk@strauser.com Cc: Subject: Re: kern/128452: [sa] [panic] Accessing SCSI tape drive randomly crashes my amd64 system Date: Sat, 1 Nov 2008 21:16:51 -0500 And other kernel panic dump. Daily crashes while making backups are pretty standard now. This problem started when the machine was an Athlon 1.4GHz. Since then, I have replaced literally every component but the tape drive itself, including an identical model Tekram DC-390F SCSI card (I bought two a while back and swapped cards between servers for testing). Is there any additional information I could provide to help out? (kgdb) list *0xffffffff80739535 0xffffffff80739535 is in pmap_clear_modify (atomic.h:143). 138 atomic.h: No such file or directory. in atomic.h (kgdb) bt #0 doadump () at pcpu.h:195 #1 0x0000000000000004 in ?? () #2 0xffffffff80488641 in boot (howto=260) at /usr/src/sys/kern/ kern_shutdown.c:418 #3 0xffffffff80488a7c in panic (fmt=0x104
) at /usr/src/sys/kern/kern_shutdown.c:574 #4 0xffffffff8074081a in trap_fatal (frame=0xffffff0001101a50, eva=Variable "eva" is not available. ) at /usr/src/sys/amd64/amd64/trap.c:764 #5 0xffffffff807412d8 in trap (frame=0xffffffffac26ca40) at /usr/src/ sys/amd64/amd64/trap.c:565 #6 0xffffffff80727ece in calltrap () at /usr/src/sys/amd64/amd64/ exception.S:209 #7 0xffffffff80739535 in pmap_clear_modify (m=0xffffff00c4f74e20) at atomic.h:143 #8 0xffffffff8069ca79 in vm_page_set_validclean (m=0xffffff00c4f74e20, base=0, size=4096) at /usr/src/sys/vm/vm_page.c:1813 #9 0xffffffff804eda0f in bufdone_finish (bp=0xffffffff9a26e6a0) at /usr/src/sys/kern/vfs_bio.c:3272 #10 0xffffffff804edcec in bufdone (bp=0xffffffff9a26e6a0) at /usr/src/ sys/kern/vfs_bio.c:3173 #11 0xffffffff804f0375 in cluster_callback (bp=0xffffffff99fbe720) at /usr/src/sys/kern/vfs_cluster.c:542 #12 0xffffffff804edcc5 in bufdone (bp=0xffffffff99fbe720) at /usr/src/ sys/kern/vfs_bio.c:3167 #13 0xffffffff8043c2c2 in g_io_schedule_up (tp=Variable "tp" is not available. ) at /usr/src/sys/geom/geom_io.c:587 #14 0xffffffff8043c566 in g_up_procbody () at /usr/src/sys/geom/ geom_kern.c:95 #15 0xffffffff80468d5d in fork_exit (callout=0xffffffff8043c514 , arg=0x0, frame=0xffffffffac26cc80) at /usr/src/sys/kern/kern_fork.c:804 #16 0xffffffff8072829e in fork_trampoline () at /usr/src/sys/amd64/ amd64/exception.S:455 #17 0x0000000000000000 in ?? () #18 0x0000000000000000 in ?? () #19 0x0000000000000001 in ?? () #20 0x0000000000000000 in ?? () #21 0x0000000000000000 in ?? () #22 0x0000000000000000 in ?? () #23 0x0000000000000000 in ?? () #24 0x0000000000000000 in ?? () #25 0x0000000000000000 in ?? () #26 0x0000000000000000 in ?? () #27 0x0000000000000000 in ?? () #28 0x0000000000000000 in ?? () #29 0x0000000000000000 in ?? () #30 0x0000000000000000 in ?? () #31 0x0000000000000000 in ?? () #32 0x0000000000000000 in ?? () ---Type to continue, or q to quit--- #33 0x0000000000000000 in ?? () #34 0x0000000000000000 in ?? () #35 0x0000000000000000 in ?? () #36 0x0000000000000000 in ?? () #37 0x0000000000000000 in ?? () #38 0x0000000000000000 in ?? () #39 0x0000000000000000 in ?? () #40 0x0000000000000000 in ?? () #41 0x0000000000d0e000 in ?? () #42 0xffffffff80a69180 in tdg_maxid () #43 0xffffffff80a75980 in tdq_cpu () #44 0xffffffff80a75980 in tdq_cpu () #45 0xffffff0001101a50 in ?? () #46 0xffffff0001101d80 in ?? () #47 0xffffffffac26c0c8 in ?? () #48 0x0000000000000000 in ?? () #49 0xffffffff804a78ee in sched_switch (td=0xffffffff8043c514, newtd=0x800530450, flags=Variable "flags" is not available. ) at /usr/src/sys/kern/sched_ule.c:1938 #50 0x0000000000000000 in ?? () #51 0x0000000000000000 in ?? () #52 0x0000000000000000 in ?? () #53 0x0000000000000000 in ?? () #54 0x0000000000000000 in ?? () #55 0x0000000000000000 in ?? () #56 0x0000000000000000 in ?? () #57 0x0000000000000000 in ?? () #58 0x0000000000000000 in ?? () #59 0x0000000000000000 in ?? () #60 0x0000000000000000 in ?? () #61 0x0000000000000000 in ?? () #62 0x0000000000000000 in ?? () #63 0x0000000000000000 in ?? () #64 0x0000000000000000 in ?? () #65 0x0000000000000000 in ?? () #66 0x0000000000000000 in ?? () #67 0x0000000000000000 in ?? () #68 0x0000000000000000 in ?? () #69 0x0000000000000000 in ?? () #70 0x0000000000000000 in ?? () ---Type to continue, or q to quit--- #71 0x0000000000000000 in ?? () #72 0x0000000000000000 in ?? () #73 0x0000000000000000 in ?? () #74 0x0000000000000000 in ?? () #75 0x0000000000000000 in ?? () #76 0x0000000000000000 in ?? () #77 0x0000000000000000 in ?? () #78 0x0000000000000000 in ?? () #79 0x0000000000000000 in ?? () #80 0x0000000000000000 in ?? () #81 0x0000000000000000 in ?? () #82 0x0000000000000000 in ?? () #83 0x0000000000000000 in ?? () #84 0x0000000000000000 in ?? () #85 0x0000000000000000 in ?? () #86 0x0000000000000000 in ?? () #87 0x0000000000000000 in ?? () #88 0x0000000000000000 in ?? () #89 0x0000000000000000 in ?? () #90 0x0000000000000000 in ?? () #91 0x0000000000000000 in ?? () #92 0x0000000000000000 in ?? () #93 0x0000000000000000 in ?? () #94 0x0000000000000000 in ?? () #95 0x0000000000000000 in ?? () #96 0x0000000000000000 in ?? () #97 0x0000000000000000 in ?? () #98 0x0000000000000000 in ?? () #99 0x0000000000000000 in ?? () #100 0x0000000000000000 in ?? () #101 0x0000000000000000 in ?? () #102 0x0000000000000000 in ?? () #103 0x0000000000000000 in ?? () #104 0x0000000000000000 in ?? () #105 0x0000000000000000 in ?? () #106 0x0000000000000000 in ?? () #107 0x0000000000000000 in ?? () #108 0x0000000000000000 in ?? () #109 0x0000000000000000 in ?? () ---Type to continue, or q to quit--- #110 0x0000000000000000 in ?? () #111 0x0000000000000000 in ?? () #112 0x0000000000000000 in ?? () #113 0x0000000000000000 in ?? () #114 0x0000000000000000 in ?? () #115 0x0000000000000000 in ?? () #116 0x0000000000000000 in ?? () #117 0x0000000000000000 in ?? () Cannot access memory at address 0xffffffffac26d000 From bugmaster at FreeBSD.org Mon Nov 3 03:06:59 2008 From: bugmaster at FreeBSD.org (FreeBSD bugmaster) Date: Mon Nov 3 03:08:53 2008 Subject: Current problem reports assigned to freebsd-scsi@FreeBSD.org Message-ID: <200811031106.mA3B6wGX011033@freefall.freebsd.org> Note: to view an individual PR, use: http://www.freebsd.org/cgi/query-pr.cgi?pr=(number). The following is a listing of current problems submitted by FreeBSD users. These represent problem reports covering all versions including experimental development code and obsolete releases. S Tracker Resp. Description -------------------------------------------------------------------------------- o kern/128452 scsi [sa] [panic] Accessing SCSI tape drive randomly crashe o kern/128245 scsi [scsi] "inquiry data fails comparison at DV1 step" [re o i386/127927 scsi isp(4) target driver crashes kernel when set up dma fo o kern/127901 scsi [scsi] "inquiry data fails comparison at DV1 step" [re o kern/126866 scsi [isp] [panic] kernel panic on card initialization o kern/124667 scsi [amd] [panic] FreeBSD-7 kernel page faults at amd-scsi o kern/123674 scsi [ahc] ahc driver dumping o kern/123666 scsi [aac] attach fails with Adaptec SAS RAID 3805 controll o sparc/121676 scsi [iscsi] iscontrol do not connect iscsi-target on sparc o kern/120487 scsi [sg] scsi_sg incompatible with scanners o kern/120247 scsi [mpt] FreeBSD 6.3 and LSI Logic 1030 = only 3.300MB/s o kern/119668 scsi [cam] [patch] certain errors are too verbose comparing o kern/114597 scsi [sym] System hangs at SCSI bus reset with dual HBAs o kern/110847 scsi [ahd] Tyan U320 onboard problem with more than 3 disks o kern/99954 scsi [ahc] reading from DVD failes on 6.x [regression] o kern/94838 scsi Kernel panic while mounting SD card with lock switch o o kern/92798 scsi [ahc] SCSI problem with timeouts o kern/90282 scsi [sym] SCSI bus resets cause loss of ch device o kern/76178 scsi [ahd] Problem with ahd and large SCSI Raid system o kern/74627 scsi [ahc] [hang] Adaptec 2940U2W Can't boot 5.3 s kern/61165 scsi [panic] kernel page fault after calling cam_send_ccb o kern/60641 scsi [sym] Sporadic SCSI bus resets with 53C810 under load o kern/60598 scsi wire down of scsi devices conflicts with config s kern/57398 scsi [mly] Current fails to install on mly(4) based RAID di o kern/52638 scsi [panic] SCSI U320 on SMP server won't run faster than o kern/44587 scsi dev/dpt/dpt.h is missing defines required for DPT_HAND o kern/40895 scsi wierd kernel / device driver bug o kern/39388 scsi ncr/sym drivers fail with 53c810 and more than 256MB m o kern/38828 scsi [dpt] [request] DPT PM2012B/90 doesn't work o kern/35234 scsi World access to /dev/pass? (for scanner) requires acce 30 problems total. From kirk at strauser.com Sun Nov 9 09:20:09 2008 From: kirk at strauser.com (Kirk Strauser) Date: Sun Nov 9 09:20:20 2008 Subject: kern/128452: [sa] [panic] Accessing SCSI tape drive randomly crashes my amd64 system Message-ID: <200811091720.mA9HK8LH008361@freefall.freebsd.org> The following reply was made to PR kern/128452; it has been noted by GNATS. From: Kirk Strauser To: bug-followup@FreeBSD.org, kirk@strauser.com Cc: Subject: Re: kern/128452: [sa] [panic] Accessing SCSI tape drive randomly crashes my amd64 system Date: Sun, 9 Nov 2008 11:16:30 -0600 I got another panic this morning when starting an Amanda "flush" from disk to tape. I had recompiled the kernel with SCHED_4BSD instead of SCHED_ULE for testing. Also, I've run memtest on this system for 8+ hours straight with no RAM errors. # kgdb /boot/kernel/kernel /var/crash/vmcore.10 GNU gdb 6.1.1 [FreeBSD] Copyright 2004 Free Software Foundation, Inc. GDB is free software, covered by the GNU General Public License, and you are welcome to change it and/or distribute copies of it under certain conditions. Type "show copying" to see the conditions. There is absolutely no warranty for GDB. Type "show warranty" for details. This GDB was configured as "amd64-marcel-freebsd"... Unread portion of the kernel message buffer: Fatal trap 12: page fault while in kernel mode cpuid = 0; apic id = 00 fault virtual address = 0x258 fault code = supervisor read data, page not present instruction pointer = 0x8:0xffffffff8047d41a stack pointer = 0x10:0xffffffffaef6cac0 frame pointer = 0x10:0xffffff000443aa50 code segment = base 0x0, limit 0xfffff, type 0x1b = DPL 0, pres 1, long 1, def32 0, gran 1 processor eflags = interrupt enabled, resume, IOPL = 0 current process = 50 (syncer) trap number = 12 panic: page fault cpuid = 0 Uptime: 2d16h27m41s Physical memory: 6130 MB Dumping 675 MB: 660 644 628 612 596 580 564 548 532 516 500 484 468 452 436 420 404 388 372 356 340 324 308 292 276 260 244 228 212 196 180 164 148 132 116 100 84 68 52 36 20 4 Reading symbols from /boot/kernel/if_re.ko...Reading symbols from / boot/kernel/if_re.ko.symbols...done. done. Loaded symbols for /boot/kernel/if_re.ko Reading symbols from /boot/kernel/coretemp.ko...Reading symbols from / boot/kernel/coretemp.ko.symbols...done. done. Loaded symbols for /boot/kernel/coretemp.ko Reading symbols from /boot/kernel/cpufreq.ko...Reading symbols from / boot/kernel/cpufreq.ko.symbols...done. done. Loaded symbols for /boot/kernel/cpufreq.ko Reading symbols from /boot/kernel/pflog.ko...Reading symbols from / boot/kernel/pflog.ko.symbols...done. done. Loaded symbols for /boot/kernel/pflog.ko Reading symbols from /boot/kernel/pf.ko...Reading symbols from /boot/ kernel/pf.ko.symbols...done. done. Loaded symbols for /boot/kernel/pf.ko Reading symbols from /boot/kernel/linux.ko...Reading symbols from / boot/kernel/linux.ko.symbols...done. done. Loaded symbols for /boot/kernel/linux.ko Reading symbols from /boot/kernel/nullfs.ko...Reading symbols from / boot/kernel/nullfs.ko.symbols...done. done. Loaded symbols for /boot/kernel/nullfs.ko Reading symbols from /boot/kernel/fdescfs.ko...Reading symbols from / boot/kernel/fdescfs.ko.symbols...done. done. Loaded symbols for /boot/kernel/fdescfs.ko Reading symbols from /boot/kernel/accf_http.ko...Reading symbols from / boot/kernel/accf_http.ko.symbols...done. done. Loaded symbols for /boot/kernel/accf_http.ko Reading symbols from /boot/kernel/green_saver.ko...Reading symbols from /boot/kernel/green_saver.ko.symbols...done. done. Loaded symbols for /boot/kernel/green_saver.ko #0 doadump () at pcpu.h:195 195 pcpu.h: No such file or directory. in pcpu.h (kgdb) list *0xffffffff8047d41a 0xffffffff8047d41a is in _mtx_lock_sleep (/usr/src/sys/kern/ kern_mutex.c:341). 336 */ 337 v = m->mtx_lock; 338 if (v != MTX_UNOWNED) { 339 owner = (struct thread *)(v & ~MTX_FLAGMASK); 340 #ifdef ADAPTIVE_GIANT 341 if (TD_IS_RUNNING(owner)) { 342 #else 343 if (m != &Giant && TD_IS_RUNNING(owner)) { 344 #endif 345 if (LOCK_LOG_TEST(&m->lock_object, 0)) (kgdb) backtrace #0 doadump () at pcpu.h:195 #1 0x0000000000000004 in ?? () #2 0xffffffff80488821 in boot (howto=260) at /usr/src/sys/kern/ kern_shutdown.c:418 #3 0xffffffff80488c5c in panic (fmt=0x104
) at /usr/src/sys/kern/kern_shutdown.c:574 #4 0xffffffff8073f1aa in trap_fatal (frame=0xffffff000443aa50, eva=Variable "eva" is not available. ) at /usr/src/sys/amd64/amd64/trap.c:764 #5 0xffffffff8073f551 in trap_pfault (frame=0xffffffffaef6ca10, usermode=0) at /usr/src/sys/amd64/amd64/trap.c:680 #6 0xffffffff8073fe0f in trap (frame=0xffffffffaef6ca10) at /usr/src/ sys/amd64/amd64/trap.c:449 #7 0xffffffff8072685e in calltrap () at /usr/src/sys/amd64/amd64/ exception.S:209 #8 0xffffffff8047d41a in _mtx_lock_sleep (m=0xffffff003c1b74d8, tid=18446742974269467216, opts=Variable "opts" is not available. ) at /usr/src/sys/kern/kern_mutex.c:339 #9 0xffffffff804ff4e2 in vfs_msync (mp=0xffffff000445aa68, flags=2) at /usr/src/sys/kern/vfs_subr.c:2976 #10 0xffffffff804ff73b in sync_fsync (ap=Variable "ap" is not available. ) at /usr/src/sys/kern/vfs_subr.c:3225 #11 0xffffffff804ffebc in sched_sync () at vnode_if.h:538 #12 0xffffffff80468efd in fork_exit (callout=0xffffffff804ff8a7 , arg=0x0, frame=0xffffffffaef6cc80) at /usr/src/sys/kern/kern_fork.c:804 #13 0xffffffff80726c2e in fork_trampoline () at /usr/src/sys/amd64/ amd64/exception.S:455 #14 0x0000000000000000 in ?? () #15 0x0000000000000000 in ?? () #16 0x0000000000000001 in ?? () #17 0x0000000000000000 in ?? () #18 0x0000000000000000 in ?? () #19 0x0000000000000000 in ?? () #20 0x0000000000000000 in ?? () #21 0x0000000000000000 in ?? () #22 0x0000000000000000 in ?? () #23 0x0000000000000000 in ?? () #24 0x0000000000000000 in ?? () #25 0x0000000000000000 in ?? () #26 0x0000000000000000 in ?? () #27 0x0000000000000000 in ?? () #28 0x0000000000000000 in ?? () #29 0x0000000000000000 in ?? () #30 0x0000000000000000 in ?? () #31 0x0000000000000000 in ?? () #32 0x0000000000000000 in ?? () #33 0x0000000000000000 in ?? () #34 0x0000000000000000 in ?? () #35 0x0000000000000000 in ?? () #36 0x0000000000000000 in ?? () #37 0x0000000000000000 in ?? () #38 0x0000000000d04000 in ?? () #39 0x0000000000000002 in ?? () #40 0x0000000000000000 in ?? () #41 0xffffff00044428f0 in ?? () #42 0xffffff00044afa50 in ?? () #43 0xffffff000443aa50 in ?? () #44 0xffffffffaef6ca28 in ?? () #45 0xffffff000443aa50 in ?? () #46 0xffffffff804a7246 in sched_switch (td=0x0, newtd=0xffffffff804ff8a7, flags=1) at /usr/src/sys/kern/sched_4bsd.c:910 #47 0x0000000000000000 in ?? () #48 0x0000000000000000 in ?? () #49 0x0000000000000000 in ?? () #50 0x0000000000000000 in ?? () #51 0x0000000000000000 in ?? () #52 0x0000000000000000 in ?? () #53 0x0000000000000000 in ?? () #54 0x0000000000000000 in ?? () #55 0x0000000000000000 in ?? () #56 0x0000000000000000 in ?? () #57 0x0000000000000000 in ?? () #58 0x0000000000000000 in ?? () #59 0x0000000000000000 in ?? () #60 0x0000000000000000 in ?? () #61 0x0000000000000000 in ?? () #62 0x0000000000000000 in ?? () #63 0x0000000000000000 in ?? () #64 0x0000000000000000 in ?? () #65 0x0000000000000000 in ?? () #66 0x0000000000000000 in ?? () #67 0x0000000000000000 in ?? () #68 0x0000000000000000 in ?? () #69 0x0000000000000000 in ?? () #70 0x0000000000000000 in ?? () #71 0x0000000000000000 in ?? () #72 0x0000000000000000 in ?? () #73 0x0000000000000000 in ?? () #74 0x0000000000000000 in ?? () #75 0x0000000000000000 in ?? () #76 0x0000000000000000 in ?? () #77 0x0000000000000000 in ?? () #78 0x0000000000000000 in ?? () #79 0x0000000000000000 in ?? () #80 0x0000000000000000 in ?? () #81 0x0000000000000000 in ?? () #82 0x0000000000000000 in ?? () #83 0x0000000000000000 in ?? () #84 0x0000000000000000 in ?? () #85 0x0000000000000000 in ?? () #86 0x0000000000000000 in ?? () #87 0x0000000000000000 in ?? () #88 0x0000000000000000 in ?? () #89 0x0000000000000000 in ?? () #90 0x0000000000000000 in ?? () #91 0x0000000000000000 in ?? () #92 0x0000000000000000 in ?? () #93 0x0000000000000000 in ?? () #94 0x0000000000000000 in ?? () #95 0x0000000000000000 in ?? () #96 0x0000000000000000 in ?? () #97 0x0000000000000000 in ?? () #98 0x0000000000000000 in ?? () #99 0x0000000000000000 in ?? () #100 0x0000000000000000 in ?? () #101 0x0000000000000000 in ?? () #102 0x0000000000000000 in ?? () #103 0x0000000000000000 in ?? () #104 0x0000000000000000 in ?? () #105 0x0000000000000000 in ?? () #106 0x0000000000000000 in ?? () #107 0x0000000000000000 in ?? () #108 0x0000000000000000 in ?? () #109 0x0000000000000000 in ?? () #110 0x0000000000000000 in ?? () #111 0x0000000000000000 in ?? () #112 0x0000000000000000 in ?? () #113 0x0000000000000000 in ?? () #114 0x0000000000000000 in ?? () #115 0x0000000000000000 in ?? () #116 0x0000000000000000 in ?? () #117 0x0000000000000000 in ?? () #118 0x0000000000000000 in ?? () Cannot access memory at address 0xffffffffaef6d000 (kgdb) quit From bugmaster at FreeBSD.org Mon Nov 10 03:06:57 2008 From: bugmaster at FreeBSD.org (FreeBSD bugmaster) Date: Mon Nov 10 03:09:03 2008 Subject: Current problem reports assigned to freebsd-scsi@FreeBSD.org Message-ID: <200811101106.mAAB6v4e049846@freefall.freebsd.org> Note: to view an individual PR, use: http://www.freebsd.org/cgi/query-pr.cgi?pr=(number). The following is a listing of current problems submitted by FreeBSD users. These represent problem reports covering all versions including experimental development code and obsolete releases. S Tracker Resp. Description -------------------------------------------------------------------------------- o kern/128452 scsi [sa] [panic] Accessing SCSI tape drive randomly crashe o kern/128245 scsi [scsi] "inquiry data fails comparison at DV1 step" [re o kern/127927 scsi [isp] isp(4) target driver crashes kernel when set up o kern/127901 scsi [scsi] "inquiry data fails comparison at DV1 step" [re o kern/126866 scsi [isp] [panic] kernel panic on card initialization o kern/124667 scsi [amd] [panic] FreeBSD-7 kernel page faults at amd-scsi o kern/123674 scsi [ahc] ahc driver dumping o kern/123666 scsi [aac] attach fails with Adaptec SAS RAID 3805 controll o sparc/121676 scsi [iscsi] iscontrol do not connect iscsi-target on sparc o kern/120487 scsi [sg] scsi_sg incompatible with scanners o kern/120247 scsi [mpt] FreeBSD 6.3 and LSI Logic 1030 = only 3.300MB/s o kern/119668 scsi [cam] [patch] certain errors are too verbose comparing o kern/114597 scsi [sym] System hangs at SCSI bus reset with dual HBAs o kern/110847 scsi [ahd] Tyan U320 onboard problem with more than 3 disks o kern/99954 scsi [ahc] reading from DVD failes on 6.x [regression] o kern/94838 scsi Kernel panic while mounting SD card with lock switch o o kern/92798 scsi [ahc] SCSI problem with timeouts o kern/90282 scsi [sym] SCSI bus resets cause loss of ch device o kern/76178 scsi [ahd] Problem with ahd and large SCSI Raid system o kern/74627 scsi [ahc] [hang] Adaptec 2940U2W Can't boot 5.3 s kern/61165 scsi [panic] kernel page fault after calling cam_send_ccb o kern/60641 scsi [sym] Sporadic SCSI bus resets with 53C810 under load o kern/60598 scsi wire down of scsi devices conflicts with config s kern/57398 scsi [mly] Current fails to install on mly(4) based RAID di o kern/52638 scsi [panic] SCSI U320 on SMP server won't run faster than o kern/44587 scsi dev/dpt/dpt.h is missing defines required for DPT_HAND o kern/40895 scsi wierd kernel / device driver bug o kern/39388 scsi ncr/sym drivers fail with 53c810 and more than 256MB m o kern/38828 scsi [dpt] [request] DPT PM2012B/90 doesn't work o kern/35234 scsi World access to /dev/pass? (for scanner) requires acce 30 problems total. From Carole.Macheret at ch.meggitt.com Wed Nov 12 14:04:17 2008 From: Carole.Macheret at ch.meggitt.com (Carole Macheret) Date: Wed Nov 12 14:04:24 2008 Subject: g_vfs_done In-Reply-To: <48A4666C.6080008@samsco.org> References: <4874F53A0200001300130DE3@gw.vibro-meter.com> <48A465B10200001300132295@gw.vibro-meter.com> <48A46586.1F16.0013.0@ch.meggitt.com><48A46586.1F16.0013.0@ch.meggitt.com> <48A4666C.6080008@samsco.org> Message-ID: <491B5C2A.1F16.0013.0@ch.meggitt.com> Hi Scott, Thanks a lot for your advice, we have finally run some tests with the following setting changed: kern.cam.da.retry_count=100 (in /etc/sysctl.conf) Now the FreeBSD virtual machines doesn't freeze anymore after loosing the disks during the IPstor failover. Best regards Carole Macheret >>> Scott Long 14.08.2008 19:07 >>> Carole Macheret wrote: > Hello, > > We are using FreeBSD 7.0-RELEASE #1 running Squid and Zabbix on vmware ESX 3.0.2 and our vmware ESX servers access our SAN through IpStor cluster (Storage virtualization and mirroring). > > We have 2 storages (EVA 6100) and the IpStor solution allows us to mirror disks on both EVAs. > > We have a problem with both the Zabbix and Squid FreeBSD virtual machines, when the virtual machine is loosing its disks (EVA controller reboot or ipstor cluster failover), we have several "g_vfs_done() : da1s1d[WRITE(offset=2312431234, length=12453)] error= 5" errors then the host is definitively frozen. The disk loss lasts 1-5 seconds. Windows virtual machines do freeze during the loss then continue working. On Windows we had to specify a longer timeout for local disk in registry. > > Does anybody has an idea what could be tuned to avoid this problem ? > > Attached you can find the dmesg and a screenshot of the g_vfs_done error... > > Thanks in advance for your help > So the virtual disks that the FreeBSD images are using in VMWare are on an IpStor, and those periodically go away, yes? What's probably happening is that the VMWare host is triggering an event in the FreeBSD client VM that essentially is making the virtual disks go away. Inside the FreeBSD VM, the SCSI layer tries to talk to the disk and gets a selection timeout since the disk is no longer there. It doesn't know that this is a temporary state, and it declares the I/O as failed. At that point, the BSD VM gets upset and everything gets bad. There is a property called kern.cam.da.default_timeout. It's set to 60 seconds, but I don't think that it will help you in this case, since it's likely that the i/o is failing because of a selection timeout, not because the virtual disk is slow in completing the i/o. The kern.cam.da.retry_count property is set to 5, and changing it might help since it might be able to force enough retries to give time for the virtual disk to come back. Try the following command on a running system: sysctl kern.cam.da.retry_count=100 This will allow for about 25 seconds worth of retries (a selection attempt takes 250ms, so you'll get about 4 retries per second). If this doesn't work, try configuring VMWare to give you a serial console that you can capture on the host, then set bootverbose during boot and send me the log once the problem happens. Scott From p.christias at noc.ntua.gr Sun Nov 16 17:13:23 2008 From: p.christias at noc.ntua.gr (Panagiotis Christias) Date: Sun Nov 16 17:13:30 2008 Subject: FreeBSD 7-STABLE, isp(4), QLE2462: panic & deadlocks In-Reply-To: <20081015175453.GA3260@noc.ntua.gr> References: <20081014222343.GA8706@noc.ntua.gr> <1224049455.1277.44.camel@brain.cc.rsu.ru> <20081015175453.GA3260@noc.ntua.gr> Message-ID: <20081117011317.GB52109@noc.ntua.gr> On Wed, Oct 15, 2008 at 08:54:53PM +0300, Panagiotis Christias wrote: > On Wed, Oct 15, 2008 at 09:44:15AM +0400, Oleg Sharoiko wrote: > > Hi! > > > > On Wed, 2008-10-15 at 01:23 +0300, Panagiotis Christias wrote: > > > > > However, when we connect them to the CX3-40, create and mount a new > > > partition and then do something as simple as "tar -C /san -xf ports.tgz" > > > the system panics and deadlocks. We have tried several FreeBSD versions > > > (6.3 i386/adm64, 7.0 i386/adm64, 7.1 i386/adm64 and lastly 7-STABLE i386 > > > - we also tried the latest 8-CURRENT snapshot but it panicked too soon). > > > The result is always the same; panic and deadlock. > > > > Try reducing the number of "tagged openings" with 'camcontrol tags' down > > to 46. If it doesn't work try reducing it further to 2. Also be advised > > that I've seen panics with geom_multipath in FreeBSD-7, unfortunately I > > had no time to test it in -current. > > > Hm.. that would probably explain the fact that I was unable to panic the > system when I had set the hint.isp.0.debug="0x1F" in /boot/device.hints. > > Currently I am stress testing the server with the tagged openings set to > 44 (first value tested). Until now there is no panic or deadlock. I am > trying concurrent tar extractions and rsync copies. The filesystem looks > ok till now according to fsck. I will let it write/copy/delete overnight > and tomorrow I will try different tagged opening values. > > Thank you for the hint! I am wondering what is the performance penalty > with decreased tagged openings. Also, is there anything else I could try > in order to get more useful debug output? I have at least three servers > that I could use for any kind of tests and I am willing to spend as much > time I can get to help solving the problem. > > Finally, the only output in the logs is: > > Expensive timeout(9) function: 0xc06f4210(0xc67e1200) 0.059422635 s > Expensive timeout(9) function: 0xc08d4fd0(0) 0.060676147 s > > I suppose that is related to the CAMDEBUG kernel config options. For the record, I have done many tests using several stressing tools in parallel, different FreeBSD versions (up to 7.1beta2), various filesystem configurations (plain ufs2 with softupdates, ufs2 and gjournal, zfs) and various tag openings values (down to 2). Regardless of the configuration, the system deadlocks, panics or the filesystem gets awfully corrupted within seconds, minutes or a few hours. The only configuration that seems to work without problems(?) but with a unacceptable *severe* performance penalty is when tag openings are set to minimum value of 2 (that is more or less same as disabling tagged command queueing at all). All tests ran using a 500 GB RAID5 LUN on an EMC Clariion CX340: da0 at isp0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-4 device da0: Serial Number CK200083100148 da0: 400.000MB/s transfers da0: Command Queueing Enabled da0: 512000MB (1048576000 512 byte sectors: 255H 63S/T 65270C) Previously, a Sun StorEdge T3 was tested which worked flawlessly but it had a 1 Gbps fibre channel interface, instead of a 4 Gbps that Clariion has, was recognized as a SCSI-3 device and had 2 tags openings (no surprise) by default: da1 at isp1 bus 0 target 0 lun 0 da1: Fixed Direct Access SCSI-3 device da1: 100.000MB/s transfers da1: 241724MB (495050752 512 byte sectors: 255H 63S/T 30815C) As I mentioned before, I am willing to spend time or/and provide access to the system for testing and debugging. Regards, Panagiotis -- Panagiotis J. Christias Network Management Center P.Christias@noc.ntua.gr National Technical Univ. of Athens, GREECE From jcigar at ulb.ac.be Mon Nov 17 01:06:30 2008 From: jcigar at ulb.ac.be (Julien Cigar) Date: Mon Nov 17 01:06:38 2008 Subject: ahc Message-ID: <1226911028.2746.14.camel@frodon.be-bif.ulb.ac.be> Dear FreeBSD users, I'm running FreeBSD 7.0 with Bacula as a backup box. This box has an Adaptec 2940 SCSI card and a Sony SDX700-C tape drive : ahc0: port 0xec00-0xecff mem 0xdffff000-0xdfffffff irq 18 at device 7.0 on pci0 aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs sa0 at ahc0 bus 0 target 5 lun 0 sa0: Removable Sequential Access SCSI-2 device sa0: 20.000MB/s transfers (10.000MHz, offset 8, 16bit) It works more or less in the sense that sometimes Bacula fails to write final EOF to tape. I'm sure that the tapes are OK because I reached the tape rotation cycle and sometimes it fails with a tape that didn't fail in the previous rotation cycle. When I look at the kernel outputs, I have the following : ahc0: Recovery Initiated >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< ahc0: Dumping Card State while idle, at SEQADDR 0x7 Card was paused ACCUM = 0x83, SINDEX = 0x57, DINDEX = 0x26, ARG_2 = 0x3e HCNT = 0xe8 SCBPTR = 0x0 SCSISIGI[0x0] ERROR[0x0] SCSIBUSL[0x0] LASTPHASE[0x1]:(P_BUSFREE) SCSISEQ[0x12]:(ENAUTOATNP|ENRSELI) SBLKCTL[0x2]:(SELWIDE) SCSIRATE[0x0] SEQCTL[0x10]:(FASTMODE) SEQ_FLAGS[0xc0]:(NO_CDB_SENT| NOT_IDENTIFIED) SSTAT0[0x0] SSTAT1[0xa]:(PHASECHG|BUSFREE) SSTAT2[0x0] SSTAT3[0x0] SIMODE0[0x0] SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO) SXFRCTL0[0x80]:(DFON) DFCNTRL[0x0] DFSTATUS[0x2]:(FIFOFULL) STACK: 0x0 0x16a 0x19a 0x3 SCB count = 254 Kernel NEXTQSCB = 238 Card NEXTQSCB = 238 QINFIFO entries: Waiting Queue entries: Disconnected Queue entries: 0:248 QOUTFIFO entries: Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Sequencer SCB Info: 0 SCB_CONTROL[0x44]:(DISCONNECTED|DISCENB) SCB_SCSIID[0x57] SCB_LUN[0x0] SCB_TAG[0xf8] 1 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 2 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 3 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 4 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 5 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 6 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 7 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 8 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 9 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 10 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 11 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 12 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 13 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 14 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] 15 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] Pending list: 248 SCB_CONTROL[0x40]:(DISCENB) SCB_SCSIID[0x57] SCB_LUN[0x0] Kernel Free SCB list: 239 240 241 242 243 244 245 246 247 249 250 251 252 253 237 236 235 234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 Untagged Q(5): 248 <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> (sa0:ahc0:0:5:0): SCB 0xf8 - timed out sg[0] - Addr 0x375e028 : Length 8152 sg[1] - Addr 0x3700000 : Length 4096 sg[2] - Addr 0x2cb0000 : Length 20480 sg[3] - Addr 0x3701000 : Length 20480 sg[4] - Addr 0x375c000 : Length 4096 sg[5] - Addr 0x4026000 : Length 4096 sg[6] - Addr 0x4700000 : Length 3112 (sa0:ahc0:0:5:0): Queuing a BDR SCB ahc0: Timedout SCBs already complete. Interrupts may not be functioning. (sa0:ahc0:0:5:0): Bus Device Reset Message Sent (sa0:ahc0:0:5:0): no longer in timeout, status = 24b ahc0: Bus Device Reset on A:5. 1 SCBs aborted (sa0:ahc0:0:5:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc0:0:5:0): NO SENSE ILI (length mismatch): -56320 asc:0,0 (sa0:ahc0:0:5:0): No additional sense information (sa0:ahc0:0:5:0): MODE SENSE(06). CDB: 1a 0 f 0 1c 0 (sa0:ahc0:0:5:0): NO SENSE ILI (length mismatch): -56320 asc:0,0 (sa0:ahc0:0:5:0): No additional sense information % Do you have an idea what could be wrong ? Should I fill a bug report ? Thanks, (and sorry for my english) Julien ps: as I'm not subscribed on this list, could you include my email address if you reply ? -- Julien Cigar Belgian Biodiversity Platform http://www.biodiversity.be Universit? Libre de Bruxelles (ULB) Campus de la Plaine CP 257 B?timent NO, Bureau 4 N4 115C (Niveau 4) Boulevard du Triomphe, entr?e ULB 2 B-1050 Bruxelles Mail: jcigar@ulb.ac.be @biobel: http://biobel.biodiversity.be/person/show/471 Tel : 02 650 57 52 From bugmaster at FreeBSD.org Mon Nov 17 03:06:57 2008 From: bugmaster at FreeBSD.org (FreeBSD bugmaster) Date: Mon Nov 17 03:09:05 2008 Subject: Current problem reports assigned to freebsd-scsi@FreeBSD.org Message-ID: <200811171106.mAHB6uWT082652@freefall.freebsd.org> Note: to view an individual PR, use: http://www.freebsd.org/cgi/query-pr.cgi?pr=(number). The following is a listing of current problems submitted by FreeBSD users. These represent problem reports covering all versions including experimental development code and obsolete releases. S Tracker Resp. Description -------------------------------------------------------------------------------- o kern/128452 scsi [sa] [panic] Accessing SCSI tape drive randomly crashe o kern/128245 scsi [scsi] "inquiry data fails comparison at DV1 step" [re o kern/127927 scsi [isp] isp(4) target driver crashes kernel when set up o kern/127901 scsi [scsi] "inquiry data fails comparison at DV1 step" [re o kern/126866 scsi [isp] [panic] kernel panic on card initialization o kern/124667 scsi [amd] [panic] FreeBSD-7 kernel page faults at amd-scsi o kern/123674 scsi [ahc] ahc driver dumping o kern/123666 scsi [aac] attach fails with Adaptec SAS RAID 3805 controll o sparc/121676 scsi [iscsi] iscontrol do not connect iscsi-target on sparc o kern/120487 scsi [sg] scsi_sg incompatible with scanners o kern/120247 scsi [mpt] FreeBSD 6.3 and LSI Logic 1030 = only 3.300MB/s o kern/119668 scsi [cam] [patch] certain errors are too verbose comparing o kern/114597 scsi [sym] System hangs at SCSI bus reset with dual HBAs o kern/110847 scsi [ahd] Tyan U320 onboard problem with more than 3 disks o kern/99954 scsi [ahc] reading from DVD failes on 6.x [regression] o kern/94838 scsi Kernel panic while mounting SD card with lock switch o o kern/92798 scsi [ahc] SCSI problem with timeouts o kern/90282 scsi [sym] SCSI bus resets cause loss of ch device o kern/76178 scsi [ahd] Problem with ahd and large SCSI Raid system o kern/74627 scsi [ahc] [hang] Adaptec 2940U2W Can't boot 5.3 s kern/61165 scsi [panic] kernel page fault after calling cam_send_ccb o kern/60641 scsi [sym] Sporadic SCSI bus resets with 53C810 under load o kern/60598 scsi wire down of scsi devices conflicts with config s kern/57398 scsi [mly] Current fails to install on mly(4) based RAID di o kern/52638 scsi [panic] SCSI U320 on SMP server won't run faster than o kern/44587 scsi dev/dpt/dpt.h is missing defines required for DPT_HAND o kern/40895 scsi wierd kernel / device driver bug o kern/39388 scsi ncr/sym drivers fail with 53c810 and more than 256MB m o kern/38828 scsi [dpt] [request] DPT PM2012B/90 doesn't work o kern/35234 scsi World access to /dev/pass? (for scanner) requires acce 30 problems total. From kirk at daycos.com Mon Nov 17 07:40:05 2008 From: kirk at daycos.com (Kirk Strauser) Date: Mon Nov 17 07:40:12 2008 Subject: kern/128452: [sa] [panic] Accessing SCSI tape drive randomly crashes my amd64 system Message-ID: <200811171540.mAHFe5wq088058@freefall.freebsd.org> The following reply was made to PR kern/128452; it has been noted by GNATS. From: Kirk Strauser To: bug-followup@freebsd.org, kirk@strauser.com Cc: Subject: Re: kern/128452: [sa] [panic] Accessing SCSI tape drive randomly crashes my amd64 system Date: Mon, 17 Nov 2008 09:32:58 -0600 I don't wish to pester, but is anyone actually looking at these? If so, should I continue submitting dumps, or do you already have what you need? Is there anything else I can provide? From Andre.Albsmeier at siemens.com Tue Nov 18 03:37:14 2008 From: Andre.Albsmeier at siemens.com (Andre Albsmeier) Date: Tue Nov 18 03:37:21 2008 Subject: Quantum SLDT600 write problems Message-ID: <20081118112402.GA78188@curry.mchp.siemens.de> Hello, for months, I am experiencing occasionally appearing problems using a Quantum SDLT600. What we do is simple: - open() /dev/sa0 - set the blocksize to 64k using ioctl() - write() data in 64k chunks - close() Sometimes, the write() comes back with EIO. This can happen whenever it likes to -- after a few GB, hundreds of GB or never. If it happens, the kernel spits out errors (see below). Otherwise, the machine runs rock solid as a server running quotas, samba, nfsd, dhcpd, NIS, ntpd, ... The complete hardware, apart from the SDLT drive itself, has been replaced a while ago. Earlier it was a 1,4GHz Tualatin on an Asus CUBX-L board using an Adaptec 29160 controller, now it is an 3GHz E8400 on an Asus P5W board using an Adaptec 39320LPE controller. Even the cable from the controller to the drive was changed. OS has always been a recent version of FreeBSD 6.x-STABLE (now 6.4). Any ideas what is happening here? Here are the kernel errors: Nov 18 12:04:13 server kernel: ahd3: Recovery Initiated - Card was not paused Nov 18 12:04:13 server kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< Nov 18 12:04:13 server kernel: ahd3: Dumping Card State at program address 0x32 Mode 0x0 Nov 18 12:04:13 server kernel: INTSTAT[0x0] SELOID[0x0] SELID[0x0] HS_MAILBOX[0x0] Nov 18 12:04:13 server kernel: INTCTL[0x80] SEQINTSTAT[0x0] SAVED_MODE[0x11] DFFSTAT[0x33] Nov 18 12:04:13 server kernel: SCSISIGI[0x0] SCSIPHASE[0x0] SCSIBUS[0x0] LASTPHASE[0x1] Nov 18 12:04:13 server kernel: SCSISEQ0[0x0] SCSISEQ1[0x12] SEQCTL0[0x0] SEQINTCTL[0x0] Nov 18 12:04:13 server kernel: SEQ_FLAGS[0xc0] SEQ_FLAGS2[0x0] QFREEZE_COUNT[0xe1c] Nov 18 12:04:13 server kernel: KERNEL_QFREEZE_COUNT[0xe1c] MK_MESSAGE_SCB[0xff00] Nov 18 12:04:13 server kernel: MK_MESSAGE_SCSIID[0xff] SSTAT0[0x0] SSTAT1[0x8] SSTAT2[0x0] Nov 18 12:04:13 server kernel: SSTAT3[0x0] PERRDIAG[0x0] SIMODE1[0xa4] LQISTAT0[0x0] Nov 18 12:04:13 server kernel: LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0] LQOSTAT1[0x0] Nov 18 12:04:13 server kernel: LQOSTAT2[0x0] Nov 18 12:04:13 server kernel: Nov 18 12:04:13 server kernel: SCB Count = 16 CMDS_PENDING = 1 LASTSCB 0xffff CURRSCB 0x1 NEXTSCB 0x0 Nov 18 12:04:13 server kernel: qinstart = 9444 qinfifonext = 9444 Nov 18 12:04:13 server kernel: QINFIFO: Nov 18 12:04:13 server kernel: WAITING_TID_QUEUES: Nov 18 12:04:13 server kernel: Pending list: Nov 18 12:04:13 server kernel: 1 FIFO_USE[0x0] SCB_CONTROL[0x44] SCB_SCSIID[0x7] Nov 18 12:04:13 server kernel: Total 1 Nov 18 12:04:13 server kernel: Kernel Free SCB list: 2 15 13 12 11 10 9 8 7 6 5 4 3 14 0 Nov 18 12:04:13 server kernel: Sequencer Complete DMA-inprog list: Nov 18 12:04:13 server kernel: Sequencer Complete list: Nov 18 12:04:13 server kernel: Sequencer DMA-Up and Complete list: Nov 18 12:04:13 server kernel: Sequencer On QFreeze and Complete list: Nov 18 12:04:13 server kernel: Nov 18 12:04:13 server kernel: Nov 18 12:04:13 server kernel: ahd3: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0 Nov 18 12:04:13 server kernel: SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89] Nov 18 12:04:13 server kernel: SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] Nov 18 12:04:13 server kernel: SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 Nov 18 12:04:13 server kernel: HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] Nov 18 12:04:13 server kernel: Nov 18 12:04:13 server kernel: ahd3: FIFO1 Free, LONGJMP == 0x81fc, SCB 0x1 Nov 18 12:04:13 server kernel: SEQIMODE[0x3f] SEQINTSRC[0x0] DFCNTRL[0x4] DFSTATUS[0x89] Nov 18 12:04:13 server kernel: SG_CACHE_SHADOW[0x2] SG_STATE[0x0] DFFSXFRCTL[0x0] Nov 18 12:04:13 server kernel: SOFFCNT[0x0] MDFFSTAT[0x5] SHADDR = 0x00, SHCNT = 0x0 Nov 18 12:04:13 server kernel: HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10] Nov 18 12:04:13 server kernel: LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 Nov 18 12:04:13 server kernel: ahd3: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x42 Nov 18 12:04:13 server kernel: ahd3: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0 Nov 18 12:04:13 server kernel: ahd3: SAVED_SCSIID = 0x0 SAVED_LUN = 0x0 Nov 18 12:04:13 server kernel: SIMODE0[0xc] Nov 18 12:04:13 server kernel: CCSCBCTL[0x4] Nov 18 12:04:13 server kernel: ahd3: REG0 == 0x6f74, SINDEX = 0x1b8, DINDEX = 0x1ba Nov 18 12:04:13 server kernel: ahd3: SCBPTR == 0x0, SCB_NEXT == 0xff00, SCB_NEXT2 == 0x0 Nov 18 12:04:13 server kernel: CDB 0 0 0 0 0 0 Nov 18 12:04:13 server kernel: STACK: 0x23 0x0 0x0 0x0 0x0 0x0 0x0 0x0 Nov 18 12:04:13 server kernel: <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): SCB 1 - timed out Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): Queuing a BDR SCB Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): Bus Device Reset Message Sent Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): no longer in timeout, status = 24b Nov 18 12:04:13 server kernel: ahd3: Bus Device Reset on A:0. 1 SCBs aborted Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): WRITE FILEMARKS(6). CDB: 10 0 0 0 2 0 Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): CAM Status: SCSI Status Error Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): SCSI Status: Check Condition Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): UNIT ATTENTION csi:0,49,cc,1e asc:29,3 Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): Bus device reset function occurred Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): Retries Exhausted Nov 18 12:04:13 server kernel: (sa0:ahd3:0:0:0): failed to write terminating filemark(s) Thanks, -Andre From kirk at daycos.com Wed Nov 19 07:40:04 2008 From: kirk at daycos.com (Kirk Strauser) Date: Wed Nov 19 07:40:13 2008 Subject: kern/128452: [sa] [panic] Accessing SCSI tape drive randomly crashes my amd64 system Message-ID: <200811191540.mAJFe4e5022903@freefall.freebsd.org> The following reply was made to PR kern/128452; it has been noted by GNATS. From: Kirk Strauser To: bug-followup@freebsd.org, kirk@strauser.com Cc: Subject: Re: kern/128452: [sa] [panic] Accessing SCSI tape drive randomly crashes my amd64 system Date: Wed, 19 Nov 2008 09:33:33 -0600 You can close this bug. I replaced the card with an Adaptec 29160 and it's been working perfectly ever since. In all fairness, though, the old cards (Tekram DC390F) should be removed from the list of supported hardware since they are no longer functional under FreeBSD 7. From peter at simons-rock.edu Fri Nov 21 09:51:45 2008 From: peter at simons-rock.edu (Peter C. Lai) Date: Fri Nov 21 09:51:58 2008 Subject: ahc(4) on aic7899 wedges with a Tandberg LTO-2 sa drive Message-ID: <20081121172028.GQ92091@cesium.hyperfine.info> I've got a Tandberg TS 400 LTO-2 drive in a Dell PE1800 which is wedging ahc(4) after writing about 5 GB of data. I've attached dmesg.boot. uname -a: FreeBSD phoenix.simons-rock.edu 7.1-PRERELEASE FreeBSD 7.1-PRERELEASE #0: Fri Oct 17 23:17:38 EDT 2008 root@phoenix.simons-rock.edu:/usr/obj/usr/src/sys/PHOENIXPCL i386 I put in a tape, and mt -f /dev/sa0 status gives the following kernel message: Nov 21 12:06:28 phoenix kernel: (ahc0:A:6:0): Sending PPR bus_width 1, period 9, offset 7e, ppr_options 2 Nov 21 12:06:28 phoenix kernel: (ahc0:A:6:0): Received PPR width 1, period 9, offset 7e,options 2 Nov 21 12:06:28 phoenix kernel: Filtered to width 1, period 9, offset 7e, options 2 Nov 21 12:06:28 phoenix kernel: (sa0:ahc0:0:6:0): error 6 Nov 21 12:06:28 phoenix kernel: (sa0:ahc0:0:6:0): Unretryable Error Nov 21 12:06:28 phoenix kernel: (ahc0:A:6:0): Sending PPR bus_width 1, period 9, offset 7e, ppr_options 2 Nov 21 12:06:28 phoenix kernel: (ahc0:A:6:0): Received PPR width 1, period 9, offset 7e,options 2 Nov 21 12:06:28 phoenix kernel: Filtered to width 1, period 9, offset 7e, options 2 Nov 21 12:06:28 phoenix kernel: (sa0:ahc0:0:6:0): error 6 Nov 21 12:06:28 phoenix kernel: (sa0:ahc0:0:6:0): Unretryable Error Nov 21 12:06:28 phoenix kernel: (ahc0:A:6:0): Sending PPR bus_width 1, period 9, offset 7e, ppr_options 2 Nov 21 12:06:28 phoenix kernel: (ahc0:A:6:0): Received PPR width 1, period 9, offset 7e,options 2 Nov 21 12:06:28 phoenix kernel: Nov 21 12:06:28 phoenix kernel: Filtered to width 1, period 9, offset 7e, options 2 Nov 21 12:06:28 phoenix kernel: (sa0:ahc0:0:6:0): error 6 Nov 21 12:06:28 phoenix kernel: (sa0:ahc0:0:6:0): Unretryable Error Nov 21 12:07:30 phoenix kernel: (ahc0:A:6:0): Sending PPR bus_width 1, period 9, offset 7e, ppr_options 2 Nov 21 12:07:30 phoenix kernel: (ahc0:A:6:0): Received PPR width 1, period 9, offset 7e,options 2 Nov 21 12:07:30 phoenix kernel: Filtered to width 1, period 9, offset 7e, options 2 Nov 21 12:07:30 phoenix kernel: (sa0:ahc0:0:6:0): Retrying Command but it returns ok at the end: Mode Density Blocksize bpi Compression Current: 0x42 variable 0 0x1 ---------available modes--------- 0: 0x42 variable 0 0x1 1: 0x42 variable 0 0x1 2: 0x42 variable 0 0x1 3: 0x42 variable 0 0x1 --------------------------------- Current Driver State: at rest. --------------------------------- File Number: 0 Record Number: 0 Residual Count 0 I am using star bs=32k -no-fifo to write to the tape. ahc(4) crashes regardless of setting the blocksize on the hardware (to 32k) or not: Nov 21 08:58:15 phoenix kernel: ahc0: Recovery Initiated Nov 21 08:58:15 phoenix kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< Nov 21 08:58:15 phoenix kernel: ahc0: Dumping Card State while idle, at SEQADDR 0x8 Nov 21 08:58:15 phoenix kernel: Card was paused Nov 21 08:58:15 phoenix kernel: ACCUM = 0x4, SINDEX = 0x67, DINDEX = 0x27, ARG_2 = 0x3b Nov 21 08:58:15 phoenix kernel: HCNT = 0x0 SCBPTR = 0x0 Nov 21 08:58:15 phoenix kernel: SCSIPHASE[0x0] SCSISIGI[0x0] ERROR[0x0] SCSIBUSL[0x0] Nov 21 08:58:15 phoenix kernel: LASTPHASE[0x1]:(P_BUSFREE) SCSISEQ[0x12]:(ENAUTOATNP|ENRSELI) Nov 21 08:58:15 phoenix kernel: SBLKCTL[0xa]:(SELWIDE|SELBUSB) SCSIRATE[0x0] SEQCTL[0x10]:(FASTMODE) Nov 21 08:58:15 phoenix kernel: SEQ_FLAGS[0xc0]:(NO_CDB_SENT|NOT_IDENTIFIED) SSTAT0[0x0] Nov 21 08:58:15 phoenix kernel: SSTAT1[0x0] SSTAT2[0x0] SSTAT3[0x0] SIMODE0[0x8]:(ENSWRAP) Nov 21 08:58:15 phoenix kernel: SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO) SXFRCTL0[0x80]:(DFON) Nov 21 08:58:15 phoenix kernel: DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL) Nov 21 08:58:15 phoenix kernel: STACK: 0x0 0x164 0x179 0x3 Nov 21 08:58:15 phoenix kernel: SCB count = 254 Nov 21 08:58:15 phoenix kernel: Kernel NEXTQSCB = 247 Nov 21 08:58:15 phoenix kernel: Card NEXTQSCB = 247 Nov 21 08:58:15 phoenix kernel: QINFIFO entries: Nov 21 08:58:15 phoenix kernel: Waiting Queue entries: Nov 21 08:58:15 phoenix kernel: Disconnected Queue entries: 0:238 Nov 21 08:58:15 phoenix kernel: QOUTFIFO entries: Nov 21 08:58:15 phoenix kernel: Sequencer Free SCB List: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Nov 21 08:58:15 phoenix kernel: Sequencer SCB Info: Nov 21 08:58:15 phoenix kernel: 0 SCB_CONTROL[0x44]:(DISCONNECTED|DISCENB) SCB_SCSIID[0x67] Nov 21 08:58:15 phoenix kernel: SCB_LUN[0x0] SCB_TAG[0xee] Nov 21 08:58:15 phoenix kernel: 1 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) Nov 21 08:58:15 phoenix kernel: SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] Nov 21 08:58:15 phoenix kernel: 2 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) Nov 21 08:58:15 phoenix kernel: SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] Nov 21 08:58:15 phoenix kernel: 3 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) Nov 21 08:58:15 phoenix kernel: SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] Nov 21 08:58:15 phoenix kernel: 4 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) ... Nov 21 08:58:15 phoenix kernel: 31 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID) Nov 21 08:58:15 phoenix kernel: SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff] Nov 21 08:58:15 phoenix kernel: Pending list: Nov 21 08:58:15 phoenix kernel: 238 SCB_CONTROL[0x40]:(DISCENB) SCB_SCSIID[0x67] SCB_LUN[0x0] Nov 21 08:58:15 phoenix kernel: Kernel Free SCB list: 239 240 241 242 243 244 245 246 248 249 250 251 252 253 237 236 235 234 233 232 231 230 229 228 227 226 225 224 223 222 221 220 219 218 217 216 215 214 213 212 211 210 209 208 207 206 205 204 203 202 201 200 199 198 197 196 195 194 193 192 191 190 189 188 187 186 185 184 183 182 181 180 179 178 177 176 175 174 173 172 171 170 169 168 167 166 165 164 163 162 161 160 159 158 157 156 155 154 153 152 151 150 149 148 147 146 145 144 143 142 141 140 139 138 137 136 135 134 133 132 131 130 129 128 127 126 125 124 123 122 121 120 119 118 117 116 115 114 113 112 111 110 109 108 107 106 105 104 103 102 101 100 99 98 97 96 95 94 93 92 91 90 89 88 87 86 85 84 83 82 81 80 79 78 77 76 75 74 73 72 71 70 69 68 67 66 65 64 63 62 61 60 59 58 57 56 55 54 53 52 51 50 49 48 47 46 45 44 43 42 41 40 39 38 37 36 35 34 33 32 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 Nov 21 08:58:15 phoenix kernel: Untagged Q(6): 238 Nov 21 08:58:15 phoenix kernel: Nov 21 08:58:15 phoenix kernel: <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>> Nov 21 08:58:15 phoenix kernel: (sa0:ahc0:0:6:0): SCB 0xee - timed out Nov 21 08:58:15 phoenix kernel: sg[0] - Addr 0x21706000 : Length 4096 Nov 21 08:58:15 phoenix kernel: sg[1] - Addr 0x217aa000 : Length 4096 Nov 21 08:58:15 phoenix kernel: sg[2] - Addr 0x2183b000 : Length 4096 ... Nov 21 08:58:15 phoenix kernel: sg[7] - Addr 0x221f6000 : Length 4096 Nov 21 08:58:15 phoenix kernel: (sa0:ahc0:0:6:0): Queuing a BDR SCB Nov 21 08:58:15 phoenix kernel: Infinite interrupt loop, INTSTAT = 0ahc0: Timedout SCBs already complete. Interrupts may not be functioning. Nov 21 08:58:17 phoenix kernel: Infinite interrupt loop, INTSTAT = 0ahc0: Recovery Initiated Nov 21 08:58:17 phoenix kernel: >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<< Nov 21 08:58:17 phoenix kernel: ahc0: Dumping Card State while idle, at SEQADDR 0x18 Nov 21 08:58:17 phoenix kernel: Card was paused ... Nov 21 08:58:17 phoenix kernel: (sa0:ahc0:0:6:0): no longer in timeout, status = 24b Nov 21 08:58:17 phoenix kernel: ahc0: target 6 using 8bit transfers Nov 21 08:58:17 phoenix kernel: ahc0: target 6 using asynchronous transfers Nov 21 08:58:17 phoenix kernel: ahc0: Issued Channel A Bus Reset. 1 SCBs aborted Nov 21 08:58:17 phoenix kernel: (sa0:ahc0:0:6:0): Command timed out Nov 21 08:58:17 phoenix kernel: (sa0:ahc0:0:6:0): error 5 Nov 21 08:58:17 phoenix kernel: (sa0:ahc0:0:6:0): Retries Exausted Nov 21 08:58:17 phoenix kernel: ahc0: Timedout SCBs already complete. Interrupts may not be functioning. Nov 21 08:58:17 phoenix kernel: (ahc0:A:6:0): Sending PPR bus_width 1, period 9, offset 7e, ppr_options 2 Nov 21 08:58:17 phoenix kernel: (ahc0:A:6:0): Received PPR width 1, period 9, offset 7e,options 2 Nov 21 08:58:17 phoenix kernel: Filtered to width 1, period 9, offset 7e, options 2 Nov 21 08:58:17 phoenix kernel: ahc0: target 6 using 16bit transfers Nov 21 08:58:17 phoenix kernel: ahc0: target 6 synchronous at 80.0MHz DT, offset = 0x7e Nov 21 09:02:17 phoenix kernel: ahc0: Recovery Initiated ... until shutdown. A shutdown -r will still leave the controller wedged until a fullblow powercycle is done without the tape in the drive. Reading from tapes work just fine though. Also this identical hardware work just fine under Linux. Please help. Thanks! -- =========================================================== Peter C. Lai | Bard College at Simon's Rock Systems Administrator | 84 Alford Rd. Information Technology Svcs. | Gt. Barrington, MA 01230 USA peter AT simons-rock.edu | (413) 528-7428 =========================================================== From toasty at dragondata.com Sat Nov 22 10:34:02 2008 From: toasty at dragondata.com (Kevin Day) Date: Sat Nov 22 10:34:09 2008 Subject: hpacucli on 7.0/amd64 not working Message-ID: Has anyone managed to get hpacucli working on amd64 in 7.0? I've got an HP DL185 G5 with HP's E200 RAID card in it. I had hpacucli working okay in 6.3/i386, and it works fine in a 64 bit Linux boot, but not in 7.0/amd64: # /usr/local/sbin/hpacucli .P Array Configuration Utility CLI .2. Detecting Controllers...In AddChild:0x838c180 In AddChild child doesnot exists:0x838c180 Iam inside findDevce The device enumerated now is 0x838c180 In Reenumerate Childern Eoption is 1 This device can discover children Thu Nov 13 15:29:40 2008 Operation Call: OperationSetAllowedControllerDiscovery Thu Nov 13 15:29:40 2008 Operation Call: OperationReadSystemInfo Thu Nov 13 15:29:40 2008 Operation Call: OperationCaptureConfigurationMutex Thu Nov 13 15:29:40 2008 Operation Call: OperationReleaseConfigurationMutex Thu Nov 13 15:29:40 2008 Operation Call: OperationDiscoverHostBusAdapters Thu Nov 13 15:29:40 2008 Operation Call: OperationDiscoverNonFibreHBA __TRACE_CODE*1310* pBmicRequest->wCommandStatus=0 /usr/home/user/ im453_new/im453/.s_/LINUX/src/lxioctlciss.cpp zzz freebsd Bus = 5, devfn = 64, dev = 8, fn = 0 freebsd, Looking for bus 5, device 8, function 0 ioctl: Inappropriate ioctl for device Thu Nov 13 15:29:40 2008 Operation Call: OperationDiscoverInternalArrayControllers zzz freebsd Bus = 5, devfn = 64, dev = 8, fn = 0 freebsd, Looking for bus 5, device 8, function 0 ioctl: Inappropriate ioctl for device Thu Nov 13 15:29:40 2008 Operation Call: OperationDiscoverChildren The device enumerated now is 0x838c180 In Reenumerate Childern Eoption is 2 In Enumerate Child nodes Thu Nov 13 15:29:40 2008 Operation Call: OperationSystemPostProcess Pchild in GenerateXML:0x8386d80 Pchild in GenerateXML:0x0 Thu Nov 13 15:29:40 2008 Operation Call: OperationCaptureConfigurationMutex Thu Nov 13 15:29:40 2008 Thu Nov 13 15:29:40 2008 OperationCaptureConfigurationMutex Thu Nov 13 15:29:40 2008 ModRoot137888128-System137937280 Thu Nov 13 15:29:40 2008 Done. Type "help" for a list of supported commands. Type "exit" to close the console. => controller all show The device enumerated now is 0x838c180 In Reenumerate Childern Eoption is 1 This device can discover children Thu Nov 13 15:30:06 2008 Operation Call: OperationSetAllowedControllerDiscovery Thu Nov 13 15:30:06 2008 Operation Call: OperationReadSystemInfo Thu Nov 13 15:30:06 2008 Operation Call: OperationCaptureConfigurationMutex Thu Nov 13 15:30:06 2008 Operation Call: OperationReleaseConfigurationMutex Thu Nov 13 15:30:06 2008 Operation Call: OperationDiscoverHostBusAdapters Thu Nov 13 15:30:06 2008 Operation Call: OperationDiscoverNonFibreHBA zzz freebsd Bus = 5, devfn = 64, dev = 8, fn = 0 freebsd, Looking for bus 5, device 8, function 0 ioctl: Inappropriate ioctl for device Thu Nov 13 15:30:06 2008 Operation Call: OperationDiscoverInternalArrayControllers zzz freebsd Bus = 5, devfn = 64, dev = 8, fn = 0 freebsd, Looking for bus 5, device 8, function 0 ioctl: Inappropriate ioctl for device Thu Nov 13 15:30:06 2008 Operation Call: OperationDiscoverChildren The device enumerated now is 0x838c180 In Reenumerate Childern Eoption is 2 In Enumerate Child nodes Thu Nov 13 15:30:06 2008 Operation Call: OperationSystemPostProcess Pchild in GenerateXML:0x8386d80 Pchild in GenerateXML:0x0 Error: No controllers detected. Anyone managed to get this to work? If not, any ideas as to what's going on? I'm guessing the "inappropriate ioctl for device" is significant here. I tried emailing the listed HP contact for hpacucli (v.sri.sai.ganesh at hp.com) but didn't get any reply. -- Kevin ciss0: port 0xe800-0xe8ff mem 0xdef80000-0xdeffffff,0xdef78000-0xdef7ffff irq 35 at device 8.0 on pci5 ciss0@pci0:5:8:0: class=0x010400 card=0x3212103c chip=0x3238103c rev=0x00 hdr=0x00 vendor = 'Hewlett-Packard Company' device = 'Smart Array E200/E200i Controller' class = mass storage subclass = RAID cap 01[c0] = powerspec 2 supports D0 D1 D3 current D0 cap 05[cc] = MSI supports 2 messages, 64 bit cap 07[dc] = PCI-X 64-bit supports 133MHz, 4096 burst read, 1 split transaction From bugmaster at FreeBSD.org Mon Nov 24 03:07:22 2008 From: bugmaster at FreeBSD.org (FreeBSD bugmaster) Date: Mon Nov 24 03:09:05 2008 Subject: Current problem reports assigned to freebsd-scsi@FreeBSD.org Message-ID: <200811241107.mAOB7LGd020031@freefall.freebsd.org> Note: to view an individual PR, use: http://www.freebsd.org/cgi/query-pr.cgi?pr=(number). The following is a listing of current problems submitted by FreeBSD users. These represent problem reports covering all versions including experimental development code and obsolete releases. S Tracker Resp. Description -------------------------------------------------------------------------------- o kern/128452 scsi [sa] [panic] Accessing SCSI tape drive randomly crashe o kern/128245 scsi [scsi] "inquiry data fails comparison at DV1 step" [re o kern/127927 scsi [isp] isp(4) target driver crashes kernel when set up o kern/127901 scsi [scsi] "inquiry data fails comparison at DV1 step" [re o kern/126866 scsi [isp] [panic] kernel panic on card initialization o kern/124667 scsi [amd] [panic] FreeBSD-7 kernel page faults at amd-scsi o kern/123674 scsi [ahc] ahc driver dumping o kern/123666 scsi [aac] attach fails with Adaptec SAS RAID 3805 controll o sparc/121676 scsi [iscsi] iscontrol do not connect iscsi-target on sparc o kern/120487 scsi [sg] scsi_sg incompatible with scanners o kern/120247 scsi [mpt] FreeBSD 6.3 and LSI Logic 1030 = only 3.300MB/s o kern/119668 scsi [cam] [patch] certain errors are too verbose comparing o kern/114597 scsi [sym] System hangs at SCSI bus reset with dual HBAs o kern/110847 scsi [ahd] Tyan U320 onboard problem with more than 3 disks o kern/99954 scsi [ahc] reading from DVD failes on 6.x [regression] o kern/94838 scsi Kernel panic while mounting SD card with lock switch o o kern/92798 scsi [ahc] SCSI problem with timeouts o kern/90282 scsi [sym] SCSI bus resets cause loss of ch device o kern/76178 scsi [ahd] Problem with ahd and large SCSI Raid system o kern/74627 scsi [ahc] [hang] Adaptec 2940U2W Can't boot 5.3 s kern/61165 scsi [panic] kernel page fault after calling cam_send_ccb o kern/60641 scsi [sym] Sporadic SCSI bus resets with 53C810 under load o kern/60598 scsi wire down of scsi devices conflicts with config s kern/57398 scsi [mly] Current fails to install on mly(4) based RAID di o kern/52638 scsi [panic] SCSI U320 on SMP server won't run faster than o kern/44587 scsi dev/dpt/dpt.h is missing defines required for DPT_HAND o kern/40895 scsi wierd kernel / device driver bug o kern/39388 scsi ncr/sym drivers fail with 53c810 and more than 256MB m o kern/38828 scsi [dpt] [request] DPT PM2012B/90 doesn't work o kern/35234 scsi World access to /dev/pass? (for scanner) requires acce 30 problems total. From linimon at FreeBSD.org Tue Nov 25 15:33:38 2008 From: linimon at FreeBSD.org (linimon@FreeBSD.org) Date: Tue Nov 25 15:33:44 2008 Subject: kern/127901: [scsi] "inquiry data fails comparison at DV1 step" [regression] Message-ID: <200811252333.mAPNXbRD020276@freefall.freebsd.org> Synopsis: [scsi] "inquiry data fails comparison at DV1 step" [regression] State-Changed-From-To: open->closed State-Changed-By: linimon State-Changed-When: Tue Nov 25 23:33:23 UTC 2008 State-Changed-Why: Superseded by kern/128245. http://www.freebsd.org/cgi/query-pr.cgi?pr=127901 From mailshakeb at gmail.com Thu Nov 27 01:22:44 2008 From: mailshakeb at gmail.com (shakeb ainul) Date: Thu Nov 27 01:22:50 2008 Subject: Problem with RAID1 Disk on Freebsd Message-ID: <124704c40811270052s1d215d24kc7b057da17a1cb83@mail.gmail.com> Hi, I am a developer with one of the top IT companies in Asia. I have the following problem with my RAID 1 server. Last week, the only disk of my server failed due to unknown reason and it required a reboot of the server. Following error messages were logged in: /var/log/messages Nov 21 02:18:17 server1 kernel: ciss0: *** SCSI bus speed downshifted, SCSI port 2 Nov 21 02:20:58 server1 kernel: ciss0: *** SCSI bus speed downshifted, SCSI port 2 Nov 21 02:22:37 server1 kernel: ciss0: *** SCSI bus speed downshifted, SCSI port 2 Nov 21 02:31:01 server1 kernel: ciss0: *** Physical drive failure: SCSI port 2 ID 1 Nov 21 02:31:01 server1 kernel: ciss0: *** State change, logical drive 0 Nov 21 02:31:01 server1 kernel: ciss0: logical drive 0 (da0) changed status OK->interim recovery, spare status 0x0 Attached is the dmesg.boot file of my server. Please advise on what could be the possible causes for this fault and what can we do to ensure it does not happen again in future. Thanks in anticipation. Regards, SHAKEB AINUL -------------- next part -------------- Copyright (c) 1992-2007 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD is a registered trademark of The FreeBSD Foundation. FreeBSD 6.2-RELEASE #0: Mon Nov 5 14:29:36 EST 2007 root@pit-cvs:/usr/obj/usr/src/sys/HP_DL380G4_SMP_PITT Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Intel(R) Xeon(TM) CPU 3.40GHz (3400.14-MHz K8-class CPU) Origin = "GenuineIntel" Id = 0xf41 Stepping = 1 Features=0xbfebfbff Features2=0x649d> AMD Features=0x20000800 Logical CPUs per core: 2 real memory = 2147430400 (2047 MB) avail memory = 2066169856 (1970 MB) ACPI APIC Table: FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 cpu2 (AP): APIC ID: 6 cpu3 (AP): APIC ID: 7 ioapic0 irqs 0-23 on motherboard ioapic1 irqs 24-47 on motherboard ioapic2 irqs 48-71 on motherboard ioapic3 irqs 72-95 on motherboard ioapic4 irqs 96-119 on motherboard acpi0: on motherboard acpi0: Power Button (fixed) Timecounter "ACPI-safe" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x908-0x90b on acpi0 cpu0: on acpi0 acpi_perf0: on cpu0 cpu1: on acpi0 cpu2: on acpi0 cpu3: on acpi0 pcib0: on acpi0 pci0: on pcib0 pcib1: at device 2.0 on pci0 pci2: on pcib1 pcib2: at device 0.0 on pci2 pci3: on pcib2 bge0: mem 0xfdef0000-0xfdefffff irq 25 at device 1.0 on pci3 miibus0: on bge0 brgphy0: on miibus0 brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto bge0: Ethernet address: 00:13:21:cc:d5:7d bge1: mem 0xfdee0000-0xfdeeffff irq 26 at device 1.1 on pci3 miibus1: on bge1 brgphy1: on miibus1 brgphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto bge1: Ethernet address: 00:13:21:cc:d5:7c pcib3: at device 0.2 on pci2 pci4: on pcib3 ciss0: port 0x4000-0x40ff mem 0xfdff0000-0xfdff1fff,0xfdf80000-0xfdfbffff irq 51 at device 3.0 on pci4 ciss0: [GIANT-LOCKED] pcib4: at device 6.0 on pci0 pci5: on pcib4 pcib5: at device 0.0 on pci5 pci6: on pcib5 pcib6: at device 0.2 on pci5 pci10: on pcib6 uhci0: port 0x2000-0x201f irq 16 at device 29.0 on pci0 uhci0: [GIANT-LOCKED] usb0: on uhci0 usb0: USB revision 1.0 uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 2 ports with 2 removable, self powered uhci1: port 0x2020-0x203f irq 19 at device 29.1 on pci0 uhci1: [GIANT-LOCKED] usb1: on uhci1 usb1: USB revision 1.0 uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 2 ports with 2 removable, self powered uhci2: port 0x2040-0x205f irq 18 at device 29.2 on pci0 uhci2: [GIANT-LOCKED] usb2: on uhci2 usb2: USB revision 1.0 uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub2: 2 ports with 2 removable, self powered uhci3: port 0x2060-0x207f irq 16 at device 29.3 on pci0 uhci3: [GIANT-LOCKED] usb3: on uhci3 usb3: USB revision 1.0 uhub3: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub3: 2 ports with 2 removable, self powered ehci0: mem 0xfbef0000-0xfbef03ff irq 23 at device 29.7 on pci0 ehci0: [GIANT-LOCKED] usb4: EHCI version 1.0 usb4: companion controllers, 2 ports each: usb0 usb1 usb2 usb3 usb4: on ehci0 usb4: USB revision 2.0 uhub4: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1 uhub4: 8 ports with 8 removable, self powered pcib7: at device 30.0 on pci0 pci1: on pcib7 pci1: at device 3.0 (no driver attached) pci1: at device 4.0 (no driver attached) pci1: at device 4.2 (no driver attached) isab0: at device 31.0 on pci0 isa0: on isab0 atapci0: port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x500-0x50f at device 31.1 on pci0 ata0: on atapci0 ata1: on atapci0 acpi_tz0: on acpi0 atkbdc0: port 0x60,0x64 irq 1 on acpi0 atkbd0: irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] psm0: irq 12 on atkbdc0 psm0: [GIANT-LOCKED] psm0: model Generic PS/2 mouse, device ID 0 sio0: configured irq 4 not in bitmap of probed irqs 0 sio0: port may not be enabled sio0: port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A fdc0: port 0x3f2-0x3f5 irq 6 drq 2 on acpi0 fdc0: [FAST] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 orm0: at iomem 0xc0000-0xc7fff,0xc8000-0xcbfff,0xee000-0xeffff on isa0 sc0: at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sio1: configured irq 3 not in bitmap of probed irqs 0 sio1: port may not be enabled sio1 at port 0x2f8-0x2ff irq 3 on isa0 sio1: type 16550A Timecounters tick every 1.000 msec ipfw2 (+ipv6) initialized, divert loadable, rule-based forwarding enabled, default to accept, logging limited to 100 packets/entry by default acd0: DVDROM at ata0-master UDMA33 SMP: AP CPU #1 Launched! SMP: AP CPU #3 Launched! SMP: AP CPU #2 Launched! da0 at ciss0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-0 device da0: 135.168MB/s transfers da0: 69459MB (142253280 512 byte sectors: 255H 32S/T 17433C) Trying to mount root from ufs:/dev/da0s1a WARNING: / was not properly dismounted fec0: port bge0 in bundle is down fec0: port bge1 in bundle is down bge1: link state changed to UP bge0: link state changed to UP fec0: port bge0 in bundle is up fec0: port bge1 in bundle is up From kama at pvp.se Fri Nov 28 02:06:20 2008 From: kama at pvp.se (kama) Date: Fri Nov 28 02:06:27 2008 Subject: hpacucli on 7.0/amd64 not working In-Reply-To: References: Message-ID: <20081128102808.Y20485@ns1.as.pvp.se> Hi, I can confirm this problem with amd64 in FreeBSD 7.1. I have HP DL380 G5 w P400 and P800 array cards. /Bjorn On Sat, 22 Nov 2008, Kevin Day wrote: > > Has anyone managed to get hpacucli working on amd64 in 7.0? I've got > an HP DL185 G5 with HP's E200 RAID card in it. I had hpacucli working > okay in 6.3/i386, and it works fine in a 64 bit Linux boot, but not in > 7.0/amd64: > > # /usr/local/sbin/hpacucli > .P Array Configuration Utility CLI .2. > Detecting Controllers...In AddChild:0x838c180 > In AddChild child doesnot exists:0x838c180 > Iam inside findDevce > The device enumerated now is 0x838c180 > In Reenumerate Childern Eoption is 1 > This device can discover children > Thu Nov 13 15:29:40 2008 > Operation Call: OperationSetAllowedControllerDiscovery > Thu Nov 13 15:29:40 2008 > Operation Call: OperationReadSystemInfo > Thu Nov 13 15:29:40 2008 > Operation Call: OperationCaptureConfigurationMutex > Thu Nov 13 15:29:40 2008 > Operation Call: OperationReleaseConfigurationMutex > Thu Nov 13 15:29:40 2008 > Operation Call: OperationDiscoverHostBusAdapters > Thu Nov 13 15:29:40 2008 > Operation Call: OperationDiscoverNonFibreHBA > __TRACE_CODE*1310* pBmicRequest->wCommandStatus=0 /usr/home/user/ > im453_new/im453/.s_/LINUX/src/lxioctlciss.cpp > zzz freebsd Bus = 5, devfn = 64, dev = 8, fn = 0 > freebsd, Looking for bus 5, device 8, function 0 > ioctl: Inappropriate ioctl for device > Thu Nov 13 15:29:40 2008 > Operation Call: OperationDiscoverInternalArrayControllers > zzz freebsd Bus = 5, devfn = 64, dev = 8, fn = 0 > freebsd, Looking for bus 5, device 8, function 0 > ioctl: Inappropriate ioctl for device > Thu Nov 13 15:29:40 2008 > Operation Call: OperationDiscoverChildren > The device enumerated now is 0x838c180 > In Reenumerate Childern Eoption is 2 > In Enumerate Child nodes > Thu Nov 13 15:29:40 2008 > Operation Call: OperationSystemPostProcess > Pchild in GenerateXML:0x8386d80 > Pchild in GenerateXML:0x0 > Thu Nov 13 15:29:40 2008 > Operation Call: OperationCaptureConfigurationMutex > Thu Nov 13 15:29:40 2008 > > Thu Nov 13 15:29:40 2008 > OperationCaptureConfigurationMutex > Thu Nov 13 15:29:40 2008 > ModRoot137888128-System137937280 > Thu Nov 13 15:29:40 2008 > > Done. > Type "help" for a list of supported commands. > Type "exit" to close the console. > > => controller all show > The device enumerated now is 0x838c180 > In Reenumerate Childern Eoption is 1 > This device can discover children > Thu Nov 13 15:30:06 2008 > Operation Call: OperationSetAllowedControllerDiscovery > Thu Nov 13 15:30:06 2008 > Operation Call: OperationReadSystemInfo > Thu Nov 13 15:30:06 2008 > Operation Call: OperationCaptureConfigurationMutex > Thu Nov 13 15:30:06 2008 > Operation Call: OperationReleaseConfigurationMutex > Thu Nov 13 15:30:06 2008 > Operation Call: OperationDiscoverHostBusAdapters > Thu Nov 13 15:30:06 2008 > Operation Call: OperationDiscoverNonFibreHBA > zzz freebsd Bus = 5, devfn = 64, dev = 8, fn = 0 > freebsd, Looking for bus 5, device 8, function 0 > ioctl: Inappropriate ioctl for device > Thu Nov 13 15:30:06 2008 > Operation Call: OperationDiscoverInternalArrayControllers > zzz freebsd Bus = 5, devfn = 64, dev = 8, fn = 0 > freebsd, Looking for bus 5, device 8, function 0 > ioctl: Inappropriate ioctl for device > Thu Nov 13 15:30:06 2008 > Operation Call: OperationDiscoverChildren > The device enumerated now is 0x838c180 > In Reenumerate Childern Eoption is 2 > In Enumerate Child nodes > Thu Nov 13 15:30:06 2008 > Operation Call: OperationSystemPostProcess > Pchild in GenerateXML:0x8386d80 > Pchild in GenerateXML:0x0 > > Error: No controllers detected. > > Anyone managed to get this to work? If not, any ideas as to what's > going on? I'm guessing the "inappropriate ioctl for device" is > significant here. I tried emailing the listed HP contact for hpacucli > (v.sri.sai.ganesh at hp.com) but didn't get any reply. > > -- Kevin > > > ciss0: port 0xe800-0xe8ff mem > 0xdef80000-0xdeffffff,0xdef78000-0xdef7ffff irq 35 at device 8.0 on pci5 > > ciss0@pci0:5:8:0: class=0x010400 card=0x3212103c chip=0x3238103c > rev=0x00 hdr=0x00 > vendor = 'Hewlett-Packard Company' > device = 'Smart Array E200/E200i Controller' > class = mass storage > subclass = RAID > cap 01[c0] = powerspec 2 supports D0 D1 D3 current D0 > cap 05[cc] = MSI supports 2 messages, 64 bit > cap 07[dc] = PCI-X 64-bit supports 133MHz, 4096 burst read, 1 > split transaction > > > > _______________________________________________ > freebsd-scsi@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-scsi > To unsubscribe, send any mail to "freebsd-scsi-unsubscribe@freebsd.org" > From toasty at dragondata.com Fri Nov 28 10:58:36 2008 From: toasty at dragondata.com (Kevin Day) Date: Fri Nov 28 10:58:43 2008 Subject: hpacucli on 7.0/amd64 not working In-Reply-To: <20081128102808.Y20485@ns1.as.pvp.se> References: <20081128102808.Y20485@ns1.as.pvp.se> Message-ID: <31429EA0-46F4-4D18-9F62-8AE3CFDF3ED8@dragondata.com> Doing some digging with ktrace showed that it was /dev/pci it was having problems with. I've tried nudging a few of my HP contacts into seeing if they can do anything. No luck so far. On Nov 28, 2008, at 3:33 AM, kama wrote: > > Hi, I can confirm this problem with amd64 in FreeBSD 7.1. > > I have HP DL380 G5 w P400 and P800 array cards. > > /Bjorn > > On Sat, 22 Nov 2008, Kevin Day wrote: > >> >> Has anyone managed to get hpacucli working on amd64 in 7.0? I've got >> an HP DL185 G5 with HP's E200 RAID card in it. I had hpacucli working >> okay in 6.3/i386, and it works fine in a 64 bit Linux boot, but not >> in >> 7.0/amd64: >>