gmirror/disk problems!

Johan Ström johan at stromnet.org
Sun Feb 12 05:22:17 PST 2006


On 10 feb 2006, at 07.43, Johan Ström wrote:

>
> On 10 feb 2006, at 07.15, Johan Ström wrote:
>
>> Hi list!
>>
>> I've been experiencing problems earlier with gmirror (thread "Page  
>> fault, GEOM problem??"). My gmirror crashed, and the box  
>> compleatly froze.
>> Now I got a new mobo, and it has been working great since (no  
>> crashes, and i get decent 40-50mb/s read/write instead of ~10-20).
>> This morning i woke up to this:
>>
>>
>> subdisk4: detached
>> ad4: detached
>> unknown: TIMEOUT - READ_DMA retrying (1 retry left) LBA=187595536
>> unknown: timeout waiting to issue command
>> unknown: error issueing READ_DMA command
>> GEOM_MIRROR: Device gm0s1: provider ad4s1 disconnected.
>> GEOM_MIRROR: Request failed (error=6). ad4s1[WRITE 
>> (offset=134373376, length=16384)]
>> GEOM_MIRROR: Request failed (error=6). ad4s1[WRITE 
>> (offset=134438912, length=16384)]
>> GEOM_MIRROR: Request failed (error=6). ad4s1[WRITE 
>> (offset=268591104, length=16384)]
>> GEOM_MIRROR: Request failed (error=6). ad4s1[WRITE 
>> (offset=268607488, length=16384)]
>> GEOM_MIRROR: Request failed (error=6). ad4s1[WRITE 
>> (offset=268656640, length=16384)]
>> GEOM_MIRROR: Request failed (error=6). ad4s1[WRITE 
>> (offset=5966399488, length=2048)]
>> GEOM_MIRROR: Request failed (error=5). ad4s1[READ 
>> (offset=96048882176, length=32768)]
>>
>> Just like "old times"... However, no page faults! Yay.. But.. what  
>> is going on here?? Why does the atacontroler or whatever think they
>> need to detach my disk?? And how do i reattach it? I have tried  
>> some stuff with atacontrol:
>>
>> $ atacontrol list
>> ATA channel 0:
>>     Master: acd0 <CD-ROM CDU701-F/1.0q> ATA/ATAPI revision 0
>>     Slave:       no device present
>> ATA channel 1:
>>     Master:      no device present
>>     Slave:       no device present
>> ATA channel 2:
>>     Master:      no device present
>>     Slave:       no device present
>> ATA channel 3:
>>     Master:  ad6 <Maxtor 7L300S0/BANC1G10> Serial ATA v1.0
>>     Slave:       no device present
>> $ atacontrol attach ata2
>> atacontrol: ioctl(IOCATAATTACH): File exists
>> $ atacontrol reinit ata2
>> < here i get a long system wide block>
>> Master:      no device present
>> Slave:       no device present
>> $
>>
>> Okay so no luck reiniting it.. I dont realy wanna reboot the box  
>> (each time this might happen).. But im happy that it doesnt crash  
>> totally anymore heh...
>>
>> dmesg of current system:
>
> Feb  2 19:39:09 elfi syslogd: kernel boot file is /boot/kernel/kernel
> Feb  2 19:39:09 elfi kernel: Copyright (c) 1992-2005 The FreeBSD  
> Project.
> Feb  2 19:39:09 elfi kernel: Copyright (c) 1979, 1980, 1983, 1986,  
> 1988, 1989, 1991, 1992, 1993, 1994
> Feb  2 19:39:09 elfi kernel: The Regents of the University of  
> California. All rights reserved.
> Feb  2 19:39:09 elfi kernel: FreeBSD 6.0-RELEASE #2: Thu Dec  1  
> 20:18:30 CET 2005
> Feb  2 19:39:09 elfi kernel: johan at elfi.stromnet.org:/usr/obj/usr/ 
> src/sys/GENERIC
> Feb  2 19:39:09 elfi kernel: ACPI APIC Table: <A M I  OEMAPIC >
> Feb  2 19:39:09 elfi kernel: Timecounter "i8254" frequency 1193182  
> Hz quality 0
> Feb  2 19:39:09 elfi kernel: CPU: AMD Athlon(tm) XP  (1200.01-MHz  
> 686-class CPU)
> Feb  2 19:39:09 elfi kernel: Origin = "AuthenticAMD"  Id = 0x662   
> Stepping = 2
> Feb  2 19:39:09 elfi kernel:  
> Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PG 
> E,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE>
> Feb  2 19:39:09 elfi kernel: AMD Features=0xc0480800<SYSCALL,MP,MMX 
> +,3DNow+,3DNow>
> Feb  2 19:39:09 elfi kernel: real memory  = 536674304 (511 MB)
> Feb  2 19:39:09 elfi kernel: avail memory = 515833856 (491 MB)
> Feb  2 19:39:09 elfi kernel: ioapic0 <Version 1.1> irqs 0-23 on  
> motherboard
> Feb  2 19:39:09 elfi kernel: npx0: [FAST]
> Feb  2 19:39:09 elfi kernel: npx0: <math processor> on motherboard
> Feb  2 19:39:09 elfi kernel: npx0: INT 16 interface
> Feb  2 19:39:09 elfi kernel: acpi0: <A M I OEMRSDT> on motherboard
> Feb  2 19:39:09 elfi kernel: acpi0: Power Button (fixed)
> Feb  2 19:39:09 elfi kernel: pci_link0: <ACPI PCI Link LNKA> irq 0  
> on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link1: <ACPI PCI Link LNKB> irq 5  
> on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link2: <ACPI PCI Link LNKC> irq 0  
> on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link3: <ACPI PCI Link LNKD> irq 0  
> on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link4: <ACPI PCI Link LNKE> irq 11  
> on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link5: <ACPI PCI Link LUS0> irq 5  
> on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link6: <ACPI PCI Link LUS1> irq 5  
> on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link7: <ACPI PCI Link LUS2> irq 3  
> on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link8: <ACPI PCI Link LKLN> irq 5  
> on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link9: <ACPI PCI Link LAPU> irq 0  
> on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link10: <ACPI PCI Link LAUI> irq  
> 11 on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link11: <ACPI PCI Link LKMO> irq 0  
> on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link12: <ACPI PCI Link LKSM> irq 5  
> on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link13: <ACPI PCI Link LFWR> irq 0  
> on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link14: <ACPI PCI Link LETH> irq 0  
> on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link15: <ACPI PCI Link LATA> irq  
> 10 on acpi0
> Feb  2 19:39:09 elfi kernel: pci_link16: <ACPI PCI Link LSHD> irq 0  
> on acpi0
> Feb  2 19:39:09 elfi kernel: Timecounter "ACPI-fast" frequency  
> 3579545 Hz quality 1000
> Feb  2 19:39:09 elfi kernel: acpi_timer0: <24-bit timer at  
> 3.579545MHz> port 0x4008-0x400b on acpi0
> Feb  2 19:39:09 elfi kernel: cpu0: <ACPI CPU> on acpi0
> Feb  2 19:39:09 elfi kernel: acpi_throttle0: <ACPI CPU Throttling>  
> on cpu0
> Feb  2 19:39:09 elfi kernel: pcib0: <ACPI Host-PCI bridge> port  
> 0xcf8-0xcff on acpi0
> Feb  2 19:39:09 elfi kernel: pci0: <ACPI PCI bus> on pcib0
> Feb  2 19:39:09 elfi kernel: agp0: <NVIDIA nForce2 AGP Controller>  
> mem 0xf8000000-0xfbffffff at device 0.0 on pci0
> Feb  2 19:39:09 elfi kernel: pci0: <memory, RAM> at device 0.1 (no  
> driver attached)
> Feb  2 19:39:09 elfi kernel: pci0: <memory, RAM> at device 0.2 (no  
> driver attached)
> Feb  2 19:39:09 elfi kernel: pci0: <memory, RAM> at device 0.3 (no  
> driver attached)
> Feb  2 19:39:09 elfi kernel: pci0: <memory, RAM> at device 0.4 (no  
> driver attached)
> Feb  2 19:39:09 elfi kernel: pci0: <memory, RAM> at device 0.5 (no  
> driver attached)
> Feb  2 19:39:09 elfi kernel: isab0: <PCI-ISA bridge> at device 1.0  
> on pci0
> Feb  2 19:39:09 elfi kernel: isa0: <ISA bus> on isab0
> Feb  2 19:39:09 elfi kernel: pci0: <serial bus, SMBus> at device  
> 1.1 (no driver attached)
> Feb  2 19:39:09 elfi kernel: ohci0: <OHCI (generic) USB controller>  
> mem 0xfebfb000-0xfebfbfff irq 20 at device 2.0 on pci0
> Feb  2 19:39:09 elfi kernel: ohci0: [GIANT-LOCKED]
> Feb  2 19:39:09 elfi kernel: usb0: OHCI version 1.0, legacy support
> Feb  2 19:39:09 elfi kernel: usb0: <OHCI (generic) USB controller>  
> on ohci0
> Feb  2 19:39:09 elfi kernel: usb0: USB revision 1.0
> Feb  2 19:39:09 elfi kernel: uhub0: nVidia OHCI root hub, class  
> 9/0, rev 1.00/1.00, addr 1
> Feb  2 19:39:09 elfi kernel: uhub0: 4 ports with 4 removable, self  
> powered
> Feb  2 19:39:09 elfi kernel: ohci1: <OHCI (generic) USB controller>  
> mem 0xfebfc000-0xfebfcfff irq 21 at device 2.1 on pci0
> Feb  2 19:39:09 elfi kernel: ohci1: [GIANT-LOCKED]
> Feb  2 19:39:09 elfi kernel: usb1: OHCI version 1.0, legacy support
> Feb  2 19:39:09 elfi kernel: usb1: <OHCI (generic) USB controller>  
> on ohci1
> Feb  2 19:39:09 elfi kernel: usb1: USB revision 1.0
> Feb  2 19:39:09 elfi kernel: uhub1: nVidia OHCI root hub, class  
> 9/0, rev 1.00/1.00, addr 1
> Feb  2 19:39:09 elfi kernel: uhub1: 4 ports with 4 removable, self  
> powered
> Feb  2 19:39:09 elfi kernel: ehci0: <EHCI (generic) USB 2.0  
> controller> mem 0xfebfdc00-0xfebfdcff irq 22 at device 2.2 on pci0
> Feb  2 19:39:09 elfi kernel: ehci0: [GIANT-LOCKED]
> Feb  2 19:39:09 elfi kernel: usb2: EHCI version 1.0
> Feb  2 19:39:09 elfi kernel: usb2: companion controllers, 4 ports  
> each: usb0 usb1
> Feb  2 19:39:09 elfi kernel: usb2: <EHCI (generic) USB 2.0  
> controller> on ehci0
> Feb  2 19:39:09 elfi kernel: usb2: USB revision 2.0
> Feb  2 19:39:09 elfi kernel: uhub2: nVidia EHCI root hub, class  
> 9/0, rev 2.00/1.00, addr 1
> Feb  2 19:39:09 elfi kernel: uhub2: 8 ports with 8 removable, self  
> powered
> Feb  2 19:39:09 elfi kernel: nve0: <NVIDIA nForce MCP5 Networking  
> Adapter> port 0xdc00-0xdc07 mem 0xfebfe000-0xfebfefff irq 20 at  
> device 4.0 on pci0
> Feb  2 19:39:09 elfi kernel: nve0: Ethernet address 00:13:d4:bf:5b:79
> Feb  2 19:39:09 elfi kernel: miibus0: <MII bus> on nve0
> Feb  2 19:39:09 elfi kernel: rlphy0: <RTL8201L 10/100 media  
> interface> on miibus0
> Feb  2 19:39:09 elfi kernel: rlphy0:  10baseT, 10baseT-FDX,  
> 100baseTX, 100baseTX-FDX, auto
> Feb  2 19:39:09 elfi kernel: nve0: Ethernet address: 00:13:d4:bf:5b:79
> Feb  2 19:39:09 elfi kernel: nve0: [GIANT-LOCKED]
> Feb  2 19:39:09 elfi kernel: pci0: <multimedia, audio> at device  
> 6.0 (no driver attached)
> Feb  2 19:39:09 elfi kernel: pcib1: <ACPI PCI-PCI bridge> at device  
> 8.0 on pci0
> Feb  2 19:39:09 elfi kernel: pci_link0: BIOS IRQ 22 for 0.11.INTA  
> is invalid
> Feb  2 19:39:09 elfi kernel: pci_link2: BIOS IRQ 21 for 0.6.INTA is  
> invalid
> Feb  2 19:39:09 elfi kernel: pci2: <ACPI PCI bus> on pcib1
> Feb  2 19:39:09 elfi kernel: pci2: <display, VGA> at device 6.0 (no  
> driver attached)
> Feb  2 19:39:09 elfi kernel: xl0: <3Com 3c905C-TX Fast Etherlink  
> XL> port 0xcc00-0xcc7f mem 0xfeafec00-0xfeafec7f irq 17 at device  
> 9.0 on pci2
> Feb  2 19:39:09 elfi kernel: miibus1: <MII bus> on xl0
> Feb  2 19:39:09 elfi kernel: xlphy0: <3c905C 10/100 internal PHY>  
> on miibus1
> Feb  2 19:39:09 elfi kernel: xlphy0:  10baseT, 10baseT-FDX,  
> 100baseTX, 100baseTX-FDX, auto
> Feb  2 19:39:09 elfi kernel: xl0: Ethernet address: 00:04:76:ef:c6:36
> Feb  2 19:39:09 elfi kernel: atapci0: <nVidia nForce2 MCP UDMA133  
> controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xffa0-0xffaf  
> at device 9.0 on pci0
> Feb  2 19:39:09 elfi kernel: ata0: <ATA channel 0> on atapci0
> Feb  2 19:39:09 elfi kernel: ata1: <ATA channel 1> on atapci0
> Feb  2 19:39:09 elfi kernel: atapci1: <nVidia nForce2 MCP SATA150  
> controller> port  
> 0xec00-0xec07,0xe880-0xe883,0xe800-0xe807,0xe480-0xe483,0x7f00-0x7f0f, 
> 0x7c00-
> 0x7c7f irq 22 at device 11.0 on pci0
> Feb  2 19:39:09 elfi kernel: ata2: <ATA channel 0> on atapci1
> Feb  2 19:39:09 elfi kernel: ata3: <ATA channel 1> on atapci1
> Feb  2 19:39:09 elfi kernel: pcib2: <ACPI PCI-PCI bridge> at device  
> 30.0 on pci0
> Feb  2 19:39:09 elfi kernel: pci1: <ACPI PCI bus> on pcib2
> Feb  2 19:39:09 elfi kernel: acpi_button0: <Power Button> on acpi0
> Feb  2 19:39:09 elfi kernel: fdc0: <floppy drive controller (FDE)>  
> port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0
> Feb  2 19:39:09 elfi kernel: fdc0: [FAST]
> Feb  2 19:39:09 elfi kernel: ppc0: <ECP parallel printer port> port  
> 0x378-0x37f,0x778-0x77f irq 7 drq 3 on acpi0
> Feb  2 19:39:09 elfi kernel: ppc0: SMC-like chipset (ECP/EPP/PS2/ 
> NIBBLE) in COMPATIBLE mode
> Feb  2 19:39:09 elfi kernel: ppc0: FIFO with 16/16/9 bytes threshold
> Feb  2 19:39:09 elfi kernel: ppbus0: <Parallel port bus> on ppc0
> Feb  2 19:39:09 elfi kernel: plip0: <PLIP network interface> on ppbus0
> Feb  2 19:39:09 elfi kernel: lpt0: <Printer> on ppbus0
> Feb  2 19:39:09 elfi kernel: lpt0: Interrupt-driven port
> Feb  2 19:39:09 elfi kernel: ppi0: <Parallel I/O> on ppbus0
> Feb  2 19:39:09 elfi kernel: atkbdc0: <Keyboard controller (i8042)>  
> port 0x60,0x64 irq 1 on acpi0
> Feb  2 19:39:09 elfi kernel: atkbd0: <AT Keyboard> irq 1 on atkbdc0
> Feb  2 19:39:09 elfi kernel: kbd0 at atkbd0
> Feb  2 19:39:09 elfi kernel: atkbd0: [GIANT-LOCKED]
> Feb  2 19:39:09 elfi kernel: sio0: <16550A-compatible COM port>  
> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
> Feb  2 19:39:09 elfi kernel: sio0: type 16550A
> Feb  2 19:39:09 elfi kernel: pmtimer0 on isa0
> Feb  2 19:39:09 elfi kernel: orm0: <ISA Option ROMs> at iomem  
> 0xc0000-0xc7fff,0xc8000-0xc87ff on isa0
> Feb  2 19:39:09 elfi kernel: sc0: <System console> at flags 0x100  
> on isa0
> Feb  2 19:39:09 elfi kernel: sc0: VGA <16 virtual consoles,  
> flags=0x300>
> Feb  2 19:39:09 elfi kernel: sio1: configured irq 3 not in bitmap  
> of probed irqs 0
> Feb  2 19:39:09 elfi kernel: sio1: port may not be enabled
> Feb  2 19:39:09 elfi kernel: vga0: <Generic ISA VGA> at port  
> 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
> Feb  2 19:39:09 elfi kernel: Timecounter "TSC" frequency 1200006671  
> Hz quality 800
> Feb  2 19:39:09 elfi kernel: Timecounters tick every 1.000 msec
> Feb  2 19:39:09 elfi kernel: acd0: CDROM <CD-ROM CDU701-F/1.0q> at  
> ata0-master PIO4
> Feb  2 19:39:09 elfi kernel: ad4: 286188MB <Maxtor 7L300S0  
> BANC1G10> at ata2-master SATA150
> Feb  2 19:39:09 elfi kernel: ad6: 286188MB <Maxtor 7L300S0  
> BANC1G10> at ata3-master SATA150
> Feb  2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1 created  
> (id=4118114647).
> Feb  2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider  
> ad4s1 detected.
> Feb  2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider  
> ad6s1 detected.
> Feb  2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider  
> ad6s1 activated.
> Feb  2 19:39:09 elfi kernel: Root mount waiting for: GMIRROR
> Feb  2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider  
> ad4s1 activated.
> Feb  2 19:39:09 elfi kernel: GEOM_MIRROR: Device gm0s1: provider  
> mirror/gm0s1 launched.
>
>
> There we go..: ) The last was from a previous boot before i pulled  
> the promise card out... Has worked fine since (7 days uptime).
>
>>
>> I could try to move the disks to my promise sata2 tx4 card i  
>> bought for the old mobo (which didnt have sata)... But i'd rather  
>> find the problem ;)
>>
>> Hope someone can help.
>> Thanks
>> Johan
>>
>

I tried to do some more revival of the disconnected disk, no success.  
pulled it out and plugged it back in again, still not detected..  
tried all sorts of combinations of reinit attach detach etc with  
atacontrol... Finnaly I gave up and rebooted the box and now it's  
rebuilding again...
Does anyone have any clue why this is happening? Okay its better than  
before, no crashing.. but loosing one drive in a gmirror and having  
to reboot to fix it is not good.

Thanks


More information about the freebsd-stable mailing list