i386/79686: Spurious notebook disk errors from ATA driver.

Dermot Tynan dtynan at kalopa.com
Fri Apr 8 08:10:06 PDT 2005


>Number:         79686
>Category:       i386
>Synopsis:       Spurious notebook disk errors from ATA driver.
>Confidential:   no
>Severity:       serious
>Priority:       medium
>Responsible:    freebsd-i386
>State:          open
>Quarter:        
>Keywords:       
>Date-Required:
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Fri Apr 08 15:10:05 GMT 2005
>Closed-Date:
>Last-Modified:
>Originator:     Dermot Tynan
>Release:        FreeBSD 5.3-RELEASE i386
>Organization:
Kalopa Internet Solutions
>Environment:
System: FreeBSD freddie.kalopa.com 5.3-RELEASE FreeBSD 5.3-RELEASE #0: Fri Nov 5 04:19:18 UTC 2004 root at harlow.cse.buffalo.edu:/usr/obj/usr/src/sys/GENERIC i386


>Description:
	Seem to get random disk errors since upgrading from 5.2.1
	to 5.3-RELEASE.  My 30GB Toshiba 2.5" disk was fine until
	I upgraded and then I started getting "ad0: TIMEOUT -
	READ_DMA retrying" messages.  I assumed it was a bad disk
	so I replaced it with a new Seagate 40GB.  A week later and
	I'm getting the same errors.  This could be related to bug
	i386/78517.  I'm guessing that the timeout value might be
	a bit too fascist or else it's something more serious.

	The system was re-installed from the official 5.3 release
	CDs.  The previous disk errors were reported after using a
	make buildworld (etc) from 5.2.1 to 5.3

	I could be wrong, but it seems strange to be seeing disk
	errors on two separate drives when the machine has been
	running FreeBSD without problems for two years.

	I've set the severity to 'serious' because I suspect that the
	failed disk writes really failed and weren't retried, even
	though I doubt there's actually a disk error.

	The following is a dmesg output:

Copyright (c) 1992-2004 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
	The Regents of the University of California. All rights reserved.
FreeBSD 5.3-RELEASE #0: Fri Nov  5 04:19:18 UTC 2004
    root at harlow.cse.buffalo.edu:/usr/obj/usr/src/sys/GENERIC
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: Intel(R) Pentium(R) III Mobile CPU      1200MHz (1196.02-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x6b1  Stepping = 1
  Features=0x383f9ff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE>
real memory  = 268238848 (255 MB)
avail memory = 252821504 (241 MB)
npx0: [FAST]
npx0: <math processor> on motherboard
npx0: INT 16 interface
acpi0: <COMPAQ CPQ0030> on motherboard
acpi0: Power Button (fixed)
Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
acpi_timer0: <24-bit timer at 3.579545MHz> port 0x1008-0x100b on acpi0
cpu0: <ACPI CPU (3 Cx states)> on acpi0
acpi_tz0: <Thermal Zone> on acpi0
pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
pci0: <ACPI PCI bus> on pcib0
agp0: <Intel 82830 host to AGP bridge> mem 0x60000000-0x6fffffff at device 0.0 on pci0
pcib1: <ACPI PCI-PCI bridge> at device 1.0 on pci0
pci1: <ACPI PCI bus> on pcib1
drm0: <ATI Radeon LY Mobility M6> port 0x3000-0x30ff mem 0x40200000-0x4020ffff,0x48000000-0x4fffffff irq 11 at device 0.0 on pci1
info: [drm] AGP at 0x60000000 256MB
info: [drm] Initialized radeon 1.11.0 20020828 on minor 0
uhci0: <Intel 82801CA/CAM (ICH3) USB controller USB-A> port 0x4000-0x401f irq 11 at device 29.0 on pci0
uhci0: [GIANT-LOCKED]
usb0: <Intel 82801CA/CAM (ICH3) USB controller USB-A> on uhci0
usb0: USB revision 1.0
uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhci1: <Intel 82801CA/CAM (ICH3) USB controller USB-B> port 0x4020-0x403f irq 11 at device 29.1 on pci0
uhci1: [GIANT-LOCKED]
usb1: <Intel 82801CA/CAM (ICH3) USB controller USB-B> on uhci1
usb1: USB revision 1.0
uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
uhci2: <Intel 82801CA/CAM (ICH3) USB controller USB-C> port 0x4040-0x405f irq 11 at device 29.2 on pci0
uhci2: [GIANT-LOCKED]
usb2: <Intel 82801CA/CAM (ICH3) USB controller USB-C> on uhci2
usb2: USB revision 1.0
uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub2: 2 ports with 2 removable, self powered
pcib2: <ACPI PCI-PCI bridge> at device 30.0 on pci0
pci2: <ACPI PCI bus> on pcib2
cbb0: <TI1420 PCI-CardBus Bridge> mem 0x40000000-0x40000fff irq 11 at device 3.0 on pci2
cardbus0: <CardBus bus> on cbb0
pccard0: <16-bit PCCard bus> on cbb0
cbb1: <TI1420 PCI-CardBus Bridge> mem 0x40080000-0x40080fff irq 11 at device 3.1 on pci2
cardbus1: <CardBus bus> on cbb1
pccard1: <16-bit PCCard bus> on cbb1
pci2: <simple comms> at device 4.0 (no driver attached)
fxp0: <Intel 82801CAM (ICH3) Pro/100 VM Ethernet> port 0x2800-0x283f mem 0x40100000-0x40100fff irq 11 at device 8.0 on pci2
miibus0: <MII bus> on fxp0
inphy0: <i82562EM 10/100 media interface> on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
fxp0: Ethernet address: 00:02:a5:b6:57:65
pcm0: <ESS Technology Allegro-1> port 0x2400-0x24ff irq 11 at device 9.0 on pci2
pcm0: failed to enable memory mapping!
pcm0: [GIANT-LOCKED]
pcm0: <ESS Technology ES1988 AC97 Codec>
isab0: <PCI-ISA bridge> at device 31.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <Intel ICH3 UDMA100 controller> port 0x4060-0x406f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 irq 11 at device 31.1 on pci0
ata0: channel #0 on atapci0
ata1: channel #1 on atapci0
acpi_cmbat0: <Control Method Battery> on acpi0
acpi_cmbat1: <Control Method Battery> on acpi0
acpi_acad0: <AC Adapter> on acpi0
acpi_button0: <Sleep Button> on acpi0
acpi_lid0: <Control Method Lid Switch> on acpi0
atkbdc0: <Keyboard controller (i8042)> port 0x64,0x60 irq 1 on acpi0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
psm0: <PS/2 Mouse> irq 12 on atkbdc0
psm0: [GIANT-LOCKED]
psm0: model IntelliMouse Explorer, device ID 4
sio0: <Standard PC COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
sio0: type 16550A
fdc0: <floppy drive controller (FDE)> port 0x3f7,0x3f0-0x3f5 irq 6 drq 2 on acpi0
fdc0: [FAST]
fd0: <1440-KB 3.5" drive> on fdc0 drive 0
sio1: <Generic IRDA-compatible device> port 0x100-0x107,0x3e8-0x3ef irq 3 drq 1 on acpi0
sio1: type 16550A
ppc0: <ECP parallel printer port> port 0x778-0x77a,0x378-0x37f irq 7 drq 3 on acpi0
ppc0: Generic chipset (EPP/NIBBLE) in COMPATIBLE mode
ppbus0: <Parallel port bus> on ppc0
plip0: <PLIP network interface> on ppbus0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
ppi0: <Parallel I/O> on ppbus0
orm0: <ISA Option ROM> at iomem 0xc0000-0xcefff on isa0
pmtimer0 on isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
Timecounter "TSC" frequency 1196021820 Hz quality 800
Timecounters tick every 10.000 msec
pcm0: Unknown HWVOL event
ad0: 38154MB <ST94011A/3.05> [77520/16/63] at ata0-master UDMA100
acd0: CDRW <SD-R2102/1A08> at ata1-master PIO4
Mounting root from ufs:/dev/ad0s1a
ad0: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=21291391
ad0: FAILURE - WRITE_DMA timed out
ad0: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=262495
ad0: FAILURE - WRITE_DMA timed out
ad0: TIMEOUT - READ_DMA retrying (2 retries left) LBA=33691071
ad0: FAILURE - READ_DMA timed out
ad0: FAILURE - READ_DMA status=51<READY,DSC,ERROR> error=40<UNCORRECTABLE> LBA=33691071
ad0: FAILURE - READ_DMA status=51<READY,DSC,ERROR> error=40<UNCORRECTABLE> LBA=33691071
ad0: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=3979647
ad0: FAILURE - WRITE_DMA timed out
ad0: TIMEOUT - READ_DMA retrying (2 retries left) LBA=33691071
ad0: FAILURE - READ_DMA timed out
ad0: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=66445503
ad0: FAILURE - WRITE_DMA timed out
ad0: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=262495
ad0: FAILURE - WRITE_DMA timed out
ad0: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=262495
ad0: FAILURE - WRITE_DMA timed out
ad0: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=262495
ad0: FAILURE - WRITE_DMA timed out
ad0: TIMEOUT - READ_DMA retrying (2 retries left) LBA=33691071
ad0: FAILURE - READ_DMA timed out
ad0: FAILURE - READ_DMA status=51<READY,DSC,ERROR> error=40<UNCORRECTABLE> LBA=33691071
cpu0: Performance states changed
cpu0: Performance states changed
ad0: TIMEOUT - READ_DMA retrying (2 retries left) LBA=33691071
ad0: FAILURE - READ_DMA timed out
cpu0: Performance states changed
cpu0: Performance states changed
ad0: TIMEOUT - READ_DMA retrying (2 retries left) LBA=33691008
ad0: FAILURE - READ_DMA timed out

Note that probably 24 hours elapsed from boot until all the above errors
were reported.

>How-To-Repeat:

>Fix:

>Release-Note:
>Audit-Trail:
>Unformatted:


More information about the freebsd-i386 mailing list