amd64/73252: ad6: WARNING - READ_DMA interrupt was seen but timeout fired after READ_DMA UDMA ICRC error

Stephan Peijnik speijnik at gmail.com
Thu Oct 28 12:20:20 PDT 2004


>Number:         73252
>Category:       amd64
>Synopsis:       ad6: WARNING - READ_DMA interrupt was seen but timeout fired after READ_DMA UDMA ICRC  error
>Confidential:   no
>Severity:       critical
>Priority:       low
>Responsible:    freebsd-amd64
>State:          open
>Quarter:        
>Keywords:       
>Date-Required:
>Class:          sw-bug
>Submitter-Id:   current-users
>Arrival-Date:   Thu Oct 28 19:20:19 GMT 2004
>Closed-Date:
>Last-Modified:
>Originator:     Stephan Peijnik
>Release:        
>Organization:
None
>Environment:
FreeBSD lucifer.home.lan 5.3-BETA7 FreeBSD 5.3-BETA7 #0: Fri Oct 15 21:52:21 CEST 2004 speijnik at lucifer.home.lan:/usr/obj/usr/src/sys/LUCIFER amd64
>Description:
I am experiencing this problem for quite some time now, with different hard-disk drives. After getting these messages the system usually hangs (if the system HDD is involved aswell).

The log below shows the error messages, this time on my second HDD. I'm getting these on the first HDD aswell, causing the system to hang and no logfile to be written, that's why I don't have the logs of that hang aswell.

After a while (1-2 minutes) this freezes the system and nothing but hard-resetting the system can be done anymore.

This has happened both with PATA harddisks (Seagate Barracuda, 120G and 60G) and SATA harddisks (Samsung Spinpoint P80, 80G, no RAID).

/var/log/messages:

Oct 26 19:53:12 lucifer syslogd: kernel boot file is /boot/kernel/kernel
Oct 26 19:53:12 lucifer kernel: Copyright (c) 1992-2004 The FreeBSD Project.
Oct 26 19:53:12 lucifer kernel: Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
Oct 26 19:53:12 lucifer kernel: The Regents of the University of California. All rights reserved.
Oct 26 19:53:12 lucifer kernel: FreeBSD 5.3-BETA7 #0: Fri Oct 15 21:52:21 CEST 2004
Oct 26 19:53:12 lucifer kernel: speijnik at lucifer.home.lan:/usr/obj/usr/src/sys/LUCIFER
Oct 26 19:53:12 lucifer kernel: Timecounter "i8254" frequency 1193182 Hz quality 0
Oct 26 19:53:12 lucifer kernel: CPU: AMD Opteron(tm) Processor 240 (1404.56-MHz K8-class CPU)
Oct 26 19:53:12 lucifer kernel: Origin = "AuthenticAMD"  Id = 0xf51  Stepping = 1
Oct 26 19:53:12 lucifer kernel: Features=0x78bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2>
Oct 26 19:53:12 lucifer kernel: AMD Features=0xe0500800<SYSCALL,NX,MMX+,LM,3DNow+,3DNow>
Oct 26 19:53:12 lucifer kernel: real memory  = 1072627712 (1022 MB)
Oct 26 19:53:12 lucifer kernel: avail memory = 1026146304 (978 MB)
Oct 26 19:53:12 lucifer kernel: ACPI APIC Table: <VIAK8  AWRDACPI>
Oct 26 19:53:12 lucifer kernel: FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
Oct 26 19:53:12 lucifer kernel: cpu0 (BSP): APIC ID:  0
Oct 26 19:53:12 lucifer kernel: cpu1 (AP): APIC ID:  1
Oct 26 19:53:12 lucifer kernel: ioapic0: Changing APIC ID to 2
Oct 26 19:53:12 lucifer kernel: ioapic0 <Version 0.3> irqs 0-23 on motherboard
Oct 26 19:53:12 lucifer kernel: acpi0: <VIAK8 AWRDACPI> on motherboard
Oct 26 19:53:12 lucifer kernel: acpi0: Power Button (fixed)
Oct 26 19:53:12 lucifer kernel: Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
Oct 26 19:53:12 lucifer kernel: acpi_timer0: <24-bit timer at 3.579545MHz> port 0x4008-0x400b on acpi0
Oct 26 19:53:12 lucifer kernel: cpu0: <ACPI CPU> on acpi0
Oct 26 19:53:12 lucifer kernel: cpu1: <ACPI CPU> on acpi0
Oct 26 19:53:12 lucifer kernel: acpi_tz0: <Thermal Zone> on acpi0
Oct 26 19:53:12 lucifer kernel: acpi_button0: <Power Button> on acpi0
Oct 26 19:53:12 lucifer kernel: pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
Oct 26 19:53:12 lucifer kernel: pci0: <ACPI PCI bus> on pcib0
Oct 26 19:53:12 lucifer kernel: pcib1: <PCI-PCI bridge> at device 1.0 on pci0
Oct 26 19:53:12 lucifer kernel: pci1: <PCI bus> on pcib1
Oct 26 19:53:12 lucifer kernel: pci1: <display, VGA> at device 0.0 (no driver attached)
Oct 26 19:53:12 lucifer kernel: bge0: <Broadcom BCM5705 Gigabit Ethernet, ASIC rev. 0x3003> mem 0xfa000000-0xfa00ffff irq 16 at device 11.0 on pci0
Oct 26 19:53:12 lucifer kernel: miibus0: <MII bus> on bge0
Oct 26 19:53:12 lucifer kernel: brgphy0: <BCM5705 10/100/1000baseTX PHY> on miibus0
Oct 26 19:53:12 lucifer kernel: brgphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto
Oct 26 19:53:12 lucifer kernel: bge0: Ethernet address: 00:0c:76:7f:f1:be
Oct 26 19:53:12 lucifer kernel: atapci0: <VIA 6420 SATA150 controller> port 0xd400-0xd4ff,0xd000-0xd00f,0xcc00-0xcc03,0xc800-0xc807,0xc400-0xc403,0xc000-0xc007 irq 20 at device 15.0 on pci0
Oct 26 19:53:12 lucifer kernel: ata2: channel #0 on atapci0
Oct 26 19:53:12 lucifer kernel: ata3: channel #1 on atapci0
Oct 26 19:53:12 lucifer kernel: atapci1: <VIA 8237 UDMA133 controller> port 0xd800-0xd80f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 15.1 on pci0
Oct 26 19:53:12 lucifer kernel: ata0: channel #0 on atapci1
Oct 26 19:53:12 lucifer kernel: ata1: channel #1 on atapci1
Oct 26 19:53:12 lucifer kernel: uhci0: <VIA 83C572 USB controller> port 0xdc00-0xdc1f irq 21 at device 16.0 on pci0
Oct 26 19:53:12 lucifer kernel: uhci0: [GIANT-LOCKED]
Oct 26 19:53:12 lucifer kernel: usb0: <VIA 83C572 USB controller> on uhci0
Oct 26 19:53:12 lucifer kernel: usb0: USB revision 1.0
Oct 26 19:53:12 lucifer kernel: uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
Oct 26 19:53:12 lucifer kernel: uhub0: 2 ports with 2 removable, self powered
Oct 26 19:53:12 lucifer kernel: uhci1: <VIA 83C572 USB controller> port 0xe000-0xe01f irq 21 at device 16.1 on pci0
Oct 26 19:53:12 lucifer kernel: uhci1: [GIANT-LOCKED]
Oct 26 19:53:12 lucifer kernel: usb1: <VIA 83C572 USB controller> on uhci1
Oct 26 19:53:12 lucifer kernel: usb1: USB revision 1.0
Oct 26 19:53:12 lucifer kernel: uhub1: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
Oct 26 19:53:12 lucifer kernel: uhub1: 2 ports with 2 removable, self powered
Oct 26 19:53:12 lucifer kernel: uhci2: <VIA 83C572 USB controller> port 0xe400-0xe41f irq 21 at device 16.2 on pci0
Oct 26 19:53:12 lucifer kernel: uhci2: [GIANT-LOCKED]
Oct 26 19:53:12 lucifer kernel: usb2: <VIA 83C572 USB controller> on uhci2
Oct 26 19:53:12 lucifer kernel: usb2: USB revision 1.0
Oct 26 19:53:12 lucifer kernel: uhub2: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
Oct 26 19:53:12 lucifer kernel: uhub2: 2 ports with 2 removable, self powered
Oct 26 19:53:12 lucifer kernel: pci0: <serial bus, USB> at device 16.4 (no driver attached)
Oct 26 19:53:12 lucifer kernel: isab0: <PCI-ISA bridge> at device 17.0 on pci0
Oct 26 19:53:12 lucifer kernel: isa0: <ISA bus> on isab0
Oct 26 19:53:12 lucifer kernel: pcm0: <VIA VT8237> port 0xec00-0xecff irq 22 at device 17.5 on pci0
Oct 26 19:53:12 lucifer kernel: pcm0: [GIANT-LOCKED]
Oct 26 19:53:12 lucifer kernel: pcm0: <Avance Logic ALC200 AC97 Codec>
Oct 26 19:53:12 lucifer kernel: fdc0: <floppy drive controller> port 0x3f7,0x3f0-0x3f5 irq 6 drq 2 on acpi0
Oct 26 19:53:12 lucifer kernel: fdc0: [FAST]
Oct 26 19:53:12 lucifer kernel: fd0: <1440-KB 3.5" drive> on fdc0 drive 0
Oct 26 19:53:12 lucifer kernel: sio0: configured irq 4 not in bitmap of probed irqs 0
Oct 26 19:53:12 lucifer kernel: sio0: port may not be enabled
Oct 26 19:53:12 lucifer kernel: sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
Oct 26 19:53:12 lucifer kernel: sio0: type 16550A
Oct 26 19:53:12 lucifer kernel: sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0
Oct 26 19:53:12 lucifer kernel: sio1: type 16550A
Oct 26 19:53:12 lucifer kernel: atkbdc0: <Keyboard controller (i8042)> port 0x64,0x60 irq 1 on acpi0
Oct 26 19:53:12 lucifer kernel: atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
Oct 26 19:53:12 lucifer kernel: kbd0 at atkbd0
Oct 26 19:53:12 lucifer kernel: atkbd0: [GIANT-LOCKED]
Oct 26 19:53:12 lucifer kernel: psm0: <PS/2 Mouse> irq 12 on atkbdc0
Oct 26 19:53:12 lucifer kernel: psm0: [GIANT-LOCKED]
Oct 26 19:53:12 lucifer kernel: psm0: model IntelliMouse Explorer, device ID 4
Oct 26 19:53:12 lucifer kernel: orm0: <ISA Option ROMs> at iomem 0xd0000-0xd3fff,0xc0000-0xcefff on isa0
Oct 26 19:53:12 lucifer kernel: ppc0: cannot reserve I/O port range
Oct 26 19:53:12 lucifer kernel: sc0: <System console> at flags 0x100 on isa0
Oct 26 19:53:12 lucifer kernel: sc0: VGA <16 virtual consoles, flags=0x300>
Oct 26 19:53:12 lucifer kernel: vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
Oct 26 19:53:12 lucifer kernel: Timecounters tick every 0.976 msec
Oct 26 19:53:12 lucifer kernel: acd0: DVDROM <NEC DV-5500A/1.05> at ata1-master UDMA33
Oct 26 19:53:12 lucifer kernel: ad4: 76351MB <SAMSUNG SP0812C/SU100-27> [155127/16/63] at ata2-master SATA150
Oct 26 19:53:12 lucifer kernel: ad6: 76351MB <SAMSUNG SP0812C/SU100-27> [155127/16/63] at ata3-master SATA150
Oct 26 19:53:12 lucifer kernel: SMP: AP CPU #1 Launched!
Oct 26 19:53:12 lucifer kernel: Mounting root from ufs:/dev/ad4s1a
Oct 26 19:55:31 lucifer su: speijnik to root on /dev/ttyp0
Oct 26 22:01:28 lucifer kernel: ad6: WARNING - READ_DMA UDMA ICRC error (retrying request) LBA=147152363
Oct 26 22:01:28 lucifer kernel: ntfs_procfixups: magic doesn't match: 00000000 != 454c4946
Oct 26 22:01:28 lucifer kernel: ntfs_loadntnode: BAD MFT RECORD 2404
Oct 26 22:01:28 lucifer kernel: ntfs_findvattr: FAILED TO LOAD INO: 2404
Oct 26 22:01:28 lucifer kernel: ntfs_strategy: ntfs_readattr failed
Oct 26 22:01:28 lucifer kernel: vnode_pager_getpages: I/O read error
Oct 26 22:01:28 lucifer kernel: vm_fault: pager read error, pid 1312 (cp)
Oct 26 22:01:28 lucifer kernel: ntfs_procfixups: magic doesn't match: 00000000 != 454c4946
Oct 26 22:01:28 lucifer kernel: ntfs_loadntnode: BAD MFT RECORD 2405
Oct 26 22:01:28 lucifer kernel: ntfs_findvattr: FAILED TO LOAD INO: 2405
Oct 26 22:01:28 lucifer kernel: ntfs_strategy: ntfs_readattr failed
Oct 26 22:01:28 lucifer kernel: vnode_pager_getpages: I/O read error
Oct 26 22:01:28 lucifer kernel: vm_fault: pager read error, pid 1312 (cp)
Oct 26 22:01:28 lucifer kernel: ntfs_procfixups: magic doesn't match: 00000000 != 454c4946
Oct 26 22:01:28 lucifer kernel: ntfs_loadntnode: BAD MFT RECORD 2406
Oct 26 22:01:28 lucifer kernel: ntfs_findvattr: FAILED TO LOAD INO: 2406
Oct 26 22:01:28 lucifer kernel: ntfs_strategy: ntfs_readattr failed
Oct 26 22:01:28 lucifer kernel: vnode_pager_getpages: I/O read error
Oct 26 22:01:28 lucifer kernel: vm_fault: pager read error, pid 1312 (cp)
Oct 26 22:01:29 lucifer kernel: ad6: WARNING - READ_DMA UDMA ICRC error (retrying request) LBA=147166427
Oct 26 22:01:34 lucifer kernel: ad6: WARNING - READ_DMA interrupt was seen but timeout fired LBA=147166427
Oct 26 22:02:09 lucifer last message repeated 7 times
Oct 26 22:03:59 lucifer last message repeated 22 times
Oct 26 22:04:04 lucifer su: speijnik to root on /dev/ttyp4
Oct 26 22:04:04 lucifer kernel: ad6: WARNING - READ_DMA interrupt was seen but timeout fired LBA=147166427
Oct 26 22:04:39 lucifer last message repeated 7 times
Oct 26 22:04:54 lucifer last message repeated 3 times
Oct 26 22:04:56 lucifer su: speijnik to root on /dev/ttyp0
Oct 26 22:04:59 lucifer kernel: ad6: WARNING - READ_DMA interrupt was seen but timeout fired LBA=147166427
Oct 26 22:05:34 lucifer last message repeated 7 times
Oct 26 22:07:39 lucifer last message repeated 25 times
Oct 26 22:07:43 lucifer su: speijnik to root on /dev/ttyp2
Oct 26 22:07:44 lucifer kernel: ad6: WARNING - READ_DMA interrupt was seen but timeout fired LBA=147166427
Oct 26 22:08:14 lucifer last message repeated 6 times
Oct 26 22:10:14 lucifer last message repeated 24 times
Oct 26 22:16:39 lucifer last message repeated 77 times
Oct 26 22:16:39 lucifer su: speijnik to root on /dev/ttyp3
Oct 26 22:16:44 lucifer kernel: ad6: WARNING - READ_DMA interrupt was seen but timeout fired LBA=147166427
Oct 26 22:17:14 lucifer last message repeated 6 times
Oct 26 22:17:14 lucifer su: speijnik to root on /dev/ttyp0
Oct 26 22:17:19 lucifer kernel: ad6: WARNING - READ_DMA interrupt was seen but timeout fired LBA=147166427
Oct 26 22:17:54 lucifer last message repeated 7 times
Oct 26 22:19:59 lucifer last message repeated 25 times
Oct 26 22:30:03 lucifer last message repeated 121 times
Oct 26 22:40:08 lucifer last message repeated 121 times
Oct 26 22:50:13 lucifer last message repeated 121 times
Oct 26 23:00:13 lucifer last message repeated 120 times
Oct 26 23:10:13 lucifer last message repeated 120 times
Oct 26 23:20:13 lucifer last message repeated 120 times
Oct 26 23:30:13 lucifer last message repeated 120 times
Oct 26 23:40:18 lucifer last message repeated 121 times
Oct 26 23:50:18 lucifer last message repeated 120 times
Oct 26 23:59:58 lucifer last message repeated 116 times
Oct 27 00:00:03 lucifer kernel: ad6: WARNING - READ_DMA interrupt was seen but timeout fired LBA=147166427
Oct 27 00:00:37 lucifer last message repeated 7 times
Oct 27 00:02:38 lucifer last message repeated 24 times
Oct 27 00:12:42 lucifer last message repeated 121 times
Oct 27 00:22:47 lucifer last message repeated 121 times
Oct 27 00:32:47 lucifer last message repeated 120 times
Oct 27 00:42:47 lucifer last message repeated 120 times
Oct 27 00:52:47 lucifer last message repeated 120 times
Oct 27 01:02:47 lucifer last message repeated 120 times
Oct 27 01:12:47 lucifer last message repeated 120 times
Oct 27 01:22:47 lucifer last message repeated 120 times
Oct 27 01:32:47 lucifer last message repeated 120 times
Oct 27 01:42:46 lucifer last message repeated 120 times
Oct 27 01:52:51 lucifer last message repeated 121 times
Oct 27 02:02:51 lucifer last message repeated 120 times
Oct 27 02:12:51 lucifer last message repeated 120 times
Oct 27 02:22:51 lucifer last message repeated 120 times
Oct 27 02:32:51 lucifer last message repeated 120 times
Oct 27 02:42:51 lucifer last message repeated 120 times
Oct 27 02:52:51 lucifer last message repeated 120 times
Oct 27 03:02:50 lucifer last message repeated 120 times
Oct 27 03:12:50 lucifer last message repeated 120 times
Oct 27 03:22:50 lucifer last message repeated 120 times
Oct 27 03:32:50 lucifer last message repeated 120 times
Oct 27 03:42:50 lucifer last message repeated 120 times
Oct 27 03:52:50 lucifer last message repeated 120 times
Oct 27 04:02:50 lucifer last message repeated 120 times
Oct 27 04:12:55 lucifer last message repeated 121 times
Oct 27 04:22:55 lucifer last message repeated 120 times
Oct 27 04:32:55 lucifer last message repeated 120 times
Oct 27 04:42:54 lucifer last message repeated 120 times
Oct 27 04:52:54 lucifer last message repeated 120 times
Oct 27 05:02:54 lucifer last message repeated 120 times
Oct 27 05:12:54 lucifer last message repeated 120 times
Oct 27 05:22:54 lucifer last message repeated 120 times
Oct 27 05:32:54 lucifer last message repeated 120 times
Oct 27 05:42:54 lucifer last message repeated 120 times
Oct 27 05:52:54 lucifer last message repeated 120 times
Oct 27 06:02:53 lucifer last message repeated 120 times
Oct 27 06:12:53 lucifer last message repeated 120 times
Oct 27 06:22:58 lucifer last message repeated 121 times
Oct 27 06:32:58 lucifer last message repeated 120 times
Oct 27 06:42:58 lucifer last message repeated 120 times
Oct 27 06:52:58 lucifer last message repeated 120 times

As I wrote, if that doesn't happen on the system hdd the only problem it causes is a slowdown of the system in gernal and it makes you unable to unmount the problem drive/partition. This means it is not a NTFS-only problem, as it happens to UFS partitions aswell.

I am sure this is not a hardware problem as everything is working fine under other Operating Systems (including Fedora Core2 x86_64, FreeBSD 5.x i386 and Windows XP 32/64bit).

>How-To-Repeat:
Unknown, randomly happens after some time.
However, it tends to happen under rather high CPU load (>50%) and/or when heavily accessing the HDD. I got the errors above while copying about 20 files with a size of 3-4M each.
>Fix:

>Release-Note:
>Audit-Trail:
>Unformatted:


More information about the freebsd-amd64 mailing list