ad8: TIMEOUT - WRITE_DMA errors UFS 7.0-RC1

Eilko Bos eilko at bos-zuidema.nl
Sun Mar 9 00:09:41 UTC 2008


>From the keyboard of Michael Haro, written on Sun, Jan 27, 2008 at 12:01:03AM -0800:
> > Can anyone else using 7.0 who hasn't already (especially those using ZFS)
> > check his/her /var/log/messages for disk TIMEOUTs or other disk error
> > messages?  If this is widespread, I think the chances re slim that it is a
> > hardware problem in every case.
> 
> I've had this problem with Hitachi sata drives using a promise sata controller.

I am using 2 160Gb Maxtor disks in geom_mirror. With 6.3 it runs fine. I 
upgraded to 7.0-RELEASE and after install problems started. Disk TIMEOUTs
freezed the box as soon as I initiated a lot of disk activity (e.g. make
buildworld of building a kernel).

I 'downgraded' the box to 6.3 again (had to rebuild the mirror because it was
touched by a newer gmirror) and now the problems have gone again. I have the
strong impression it is not hardware bot rather 7.0-RELEASE related.
Actually I want to get rid of the box at home (want to carry it to a datacenter)
but if it can be helpfull I am willing to have it for another week or two at
home to upgrade/downgrade/etc. with it.

My dmesg (6.3 again):
--------------------------
Copyright (c) 1992-2008 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
        The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 6.3-RELEASE #0: Wed Jan 16 04:45:45 UTC 2008
    root at dessler.cse.buffalo.edu:/usr/obj/usr/src/sys/SMP
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: Intel Pentium III (1000.04-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x68a  Stepping = 10
  Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE>
real memory  = 1610592256 (1535 MB)
avail memory = 1564733440 (1492 MB)
ACPI APIC Table: <ASUS   CUR-DLS >
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 cpu0 (BSP): APIC ID:  3
 cpu1 (AP): APIC ID:  0
ioapic0 <Version 1.1> irqs 0-15 on motherboard
ioapic1 <Version 1.1> irqs 16-31 on motherboard
kbd1 at kbdmux0
ath_hal: 0.9.20.3 (AR5210, AR5211, AR5212, RF5111, RF5112, RF2413, RF5413)
hptrr: HPT RocketRAID controller driver v1.1 (Jan 16 2008 04:43:12)
acpi0: <ASUS CUR-DLS> on motherboard
acpi0: Power Button (fixed)
Timecounter "ACPI-safe" frequency 3579545 Hz quality 850
acpi_timer0: <32-bit timer at 3.579545MHz> port 0xe408-0xe40b on acpi0
cpu0: <ACPI CPU> on acpi0
cpu1: <ACPI CPU> on acpi0
acpi_button0: <Power Button> on acpi0
pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
pci0: <ACPI PCI bus> on pcib0
fxp0: <Intel 82559 Pro/100 Ethernet> port 0xd800-0xd83f mem 0xfe000000-0xfe000fff,0xfd800000-0xfd8fffff at device 2.0 on pci0
miibus0: <MII bus> on fxp0
inphy0: <i82555 10/100 media interface> on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
fxp0: Ethernet address: 00:e0:18:47:01:35
em0: <Intel(R) PRO/1000 Network Connection Version - 6.7.2> port 0xd400-0xd43f mem 0xfd000000-0xfd01ffff,0xfc800000-0xfc81ffff irq 17 at device 4.0 on pci0
em0: Ethernet address: 00:07:e9:3e:e7:90
pci0: <display, VGA> at device 7.0 (no driver attached)
isab0: <PCI-ISA bridge> port 0xe800-0xe80f at device 15.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <ServerWorks ROSB4 UDMA33 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xb800-0xb80f at device 15.1 on pci0
ata0: <ATA channel 0> on atapci0
ata1: <ATA channel 1> on atapci0
ohci0: <OHCI (generic) USB controller> mem 0xfa000000-0xfa000fff irq 9 at device 15.2 on pci0
ohci0: [GIANT-LOCKED]
usb0: OHCI version 1.0, legacy support
usb0: <OHCI (generic) USB controller> on ohci0
usb0: USB revision 1.0
uhub0: (0x1166) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 4 ports with 4 removable, self powered
pcib1: <ACPI Host-PCI bridge> on acpi0
pci1: <ACPI PCI bus> on pcib1
sym0: <1010-33> port 0xb400-0xb4ff mem 0xf9800000-0xf98003ff,0xf9000000-0xf9001fff irq 24 at device 5.0 on pci1
sym0: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking
sym0: open drain IRQ line driver, using on-chip SRAM
sym0: using LOAD/STORE-based firmware.
sym0: handling phase mismatch from SCRIPTS.
sym0: [GIANT-LOCKED]
sym1: <1010-33> port 0xb000-0xb0ff mem 0xf8800000-0xf88003ff,0xf8000000-0xf8001fff irq 25 at device 5.1 on pci1
sym1: Symbios NVRAM, ID 7, Fast-80, LVD, parity checking
sym1: open drain IRQ line driver, using on-chip SRAM
sym1: using LOAD/STORE-based firmware.
sym1: handling phase mismatch from SCRIPTS.
sym1: [GIANT-LOCKED]
atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
psm0: <PS/2 Mouse> irq 12 on atkbdc0
psm0: [GIANT-LOCKED]
psm0: model IntelliMouse, device ID 3
sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
sio0: type 16550A
sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0
sio1: type 16550A
ppc0: <ECP parallel printer port> port 0x378-0x37f,0x778-0x77a irq 7 drq 3 on acpi0
ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode
ppc0: FIFO with 16/16/8 bytes threshold
ppbus0: <Parallel port bus> on ppc0
plip0: <PLIP network interface> on ppbus0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
ppi0: <Parallel I/O> on ppbus0
fdc0: <floppy drive controller> port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0
fdc0: [FAST]
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
pmtimer0 on isa0
orm0: <ISA Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xc97ff,0xcc000-0xcc7ff on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
Timecounters tick every 1.000 msec
hptrr: no controller detected.
Waiting 5 seconds for SCSI devices to settle
(noperiph:sym0:0:-1:-1): SCSI BUS reset delivered.
(noperiph:sym1:0:-1:-1): SCSI BUS reset delivered.
ad0: 152627MB <MAXTOR STM3160215A 3.AAD> at ata0-master UDMA33
ad1: 152627MB <MAXTOR STM3160215A 3.AAD> at ata0-slave UDMA33
acd0: CDROM <COMPAQ CD-ROM SN-124/N104> at ata1-master PIO4

- - - - - - - - - - - - - -
Rebuilding the mirror:
GEOM_MIRROR: Device gm0 created (id=24929696).
GEOM_MIRROR: Device gm0: provider ad0 detected.
GEOM_MIRROR: Device gm0: provider ad0 activated.
GEOM_MIRROR: Device gm0: provider mirror/gm0 launched.
GEOM_MIRROR: Kernel module is too old to handle metadata from ad1.
SMP: AP CPU #1 Launched!
Trying to mount root from ufs:/dev/mirror/gm0s1a
em0: link state changed to UP
GEOM_MIRROR: Device gm0: provider ad1 detected.
GEOM_MIRROR: Device gm0: rebuilding provider ad1.


Grtz,
--
Eilko Bos.


More information about the freebsd-stable mailing list