ichwd causes freeze instead of reset
Jeremy Chadwick
freebsd at jdc.parodius.com
Sat Aug 21 21:33:21 UTC 2010
On Sat, Aug 21, 2010 at 11:09:04PM +0200, Stefan Bethke wrote:
> Am 21.08.2010 um 23:02 schrieb Andriy Gapon:
>
> > on 21/08/2010 23:33 Stefan Bethke said the following:
> >> Hi,
> >>
> >> somewhat foolishly, I activated watchdogd and ichwd on a remote box, and
> >> while testing it (by suspending watchdogd), apparently the watchdog
> >> triggered. But instead of resetting, the machine does not react anymore on
> >> the serial console. I will have to wait until Monday to get physical access,
> >> so it might be hanging or just switched itself off; I have no way of telling
> >> right now.
> >>
> >> ichwd probes as: ichwd0: <Intel ICH7 watchdog timer> on isa0 ichwd0: Intel
> >> ICH7 watchdog timer (ICH7 or equivalent) ppc0: cannot reserve I/O port range
> >>
> >> (not sure why ppc0 is getting involved at that point.)
> >>
> >> FreeBSD lokschuppen.zs64.net 8.1-PRERELEASE FreeBSD 8.1-PRERELEASE #30: Thu
> >> Jul 15 12:58:20 UTC 2010
> >> root at lokschuppen.zs64.net:/usr/obj/usr/src/sys/EISENBOOT amd64
> >>
> >> Once the box is up again, is it worthwile trying ichwd again, should I try
> >> and use SW_WATCHDOG, or forget about it?
> >
> > Just test it more while having physical access before making any conclusions.
> > There could be a number of radically different possibilities ranging from
> > hardware peculiarities to configuration problems to pilot errors to etc.
>
> I guess what I'm looking for is some confirmation that ichwd is working properly on this particular hardware: Asus Pundit P4 P5G41 with a G41 chipset.
>
> Below are pciconv -lvb and dmesg:
>
> hostb0 at pci0:0:0:0: class=0x060000 card=0x836d1043 chip=0x2e308086 rev=0x03 hdr=0x00
> vendor = 'Intel Corporation'
> class = bridge
> subclass = HOST-PCI
> vgapci0 at pci0:0:2:0: class=0x030000 card=0x836d1043 chip=0x2e328086 rev=0x03 hdr=0x00
> vendor = 'Intel Corporation'
> device = 'Intel G41 express graphics (PCIVEN_8086&DEV_2E32&SUBSYS_31031565&REV_033&115)'
> class = display
> subclass = VGA
> bar [10] = type Memory, range 64, base 0xfe400000, size 4194304, enabled
> bar [18] = type Prefetchable Memory, range 64, base 0xe0000000, size 268435456, enabled
> bar [20] = type I/O Port, range 32, base 0xbc00, size 8, enabled
> vgapci1 at pci0:0:2:1: class=0x038000 card=0x836d1043 chip=0x2e338086 rev=0x03 hdr=0x00
> vendor = 'Intel Corporation'
> class = display
> bar [10] = type Memory, range 64, base 0xfe800000, size 1048576, enabled
> none0 at pci0:0:27:0: class=0x040300 card=0x82fe1043 chip=0x27d88086 rev=0x01 hdr=0x00
> vendor = 'Intel Corporation'
> device = 'IDT High Definition Audio Driver (BA101897)'
> class = multimedia
> subclass = HDA
> bar [10] = type Memory, range 64, base 0xfe3f8000, size 16384, enabled
> pcib1 at pci0:0:28:0: class=0x060400 card=0x81791043 chip=0x27d08086 rev=0x01 hdr=0x01
> vendor = 'Intel Corporation'
> device = '82801G (ICH7 Family) PCIe Root Port'
> class = bridge
> subclass = PCI-PCI
> pcib2 at pci0:0:28:2: class=0x060400 card=0x81791043 chip=0x27d48086 rev=0x01 hdr=0x01
> vendor = 'Intel Corporation'
> device = '82801G (ICH7 Family) PCIe Root Port'
> class = bridge
> subclass = PCI-PCI
> pcib3 at pci0:0:28:3: class=0x060400 card=0x81791043 chip=0x27d68086 rev=0x01 hdr=0x01
> vendor = 'Intel Corporation'
> device = '82801G (ICH7 Family) PCIe Root Port'
> class = bridge
> subclass = PCI-PCI
> uhci0 at pci0:0:29:0: class=0x0c0300 card=0x81791043 chip=0x27c88086 rev=0x01 hdr=0x00
> vendor = 'Intel Corporation'
> device = '82801G (ICH7 Family) USB Universal Host Controller'
> class = serial bus
> subclass = USB
> bar [20] = type I/O Port, range 32, base 0xb400, size 32, enabled
> uhci1 at pci0:0:29:1: class=0x0c0300 card=0x81791043 chip=0x27c98086 rev=0x01 hdr=0x00
> vendor = 'Intel Corporation'
> device = '82801G (ICH7 Family) USB Universal Host Controller'
> class = serial bus
> subclass = USB
> bar [20] = type I/O Port, range 32, base 0xb480, size 32, enabled
> uhci2 at pci0:0:29:2: class=0x0c0300 card=0x81791043 chip=0x27ca8086 rev=0x01 hdr=0x00
> vendor = 'Intel Corporation'
> device = '82801G (ICH7 Family) USB Universal Host Controller'
> class = serial bus
> subclass = USB
> bar [20] = type I/O Port, range 32, base 0xb800, size 32, enabled
> uhci3 at pci0:0:29:3: class=0x0c0300 card=0x81791043 chip=0x27cb8086 rev=0x01 hdr=0x00
> vendor = 'Intel Corporation'
> device = '82801G (ICH7 Family) USB Universal Host Controller'
> class = serial bus
> subclass = USB
> bar [20] = type I/O Port, range 32, base 0xb880, size 32, enabled
> ehci0 at pci0:0:29:7: class=0x0c0320 card=0x81791043 chip=0x27cc8086 rev=0x01 hdr=0x00
> vendor = 'Intel Corporation'
> device = '82801G (ICH7 Family) USB 2.0 Enhanced Host Controller'
> class = serial bus
> subclass = USB
> bar [10] = type Memory, range 32, base 0xfe3f7c00, size 1024, enabled
> pcib4 at pci0:0:30:0: class=0x060401 card=0x81791043 chip=0x244e8086 rev=0xe1 hdr=0x01
> vendor = 'Intel Corporation'
> device = '82801 Family (ICH2/3/4/5/6/7/8/9,63xxESB) Hub Interface to PCI Bridge'
> class = bridge
> subclass = PCI-PCI
> isab0 at pci0:0:31:0: class=0x060100 card=0x81791043 chip=0x27b88086 rev=0x01 hdr=0x00
> vendor = 'Intel Corporation'
> device = 'Intel 82801GB/GR (ICH7 Family) LPC Interface Controller - 27B8 (945GL)'
> class = bridge
> subclass = PCI-ISA
> atapci0 at pci0:0:31:1: class=0x01018a card=0x81791043 chip=0x27df8086 rev=0x01 hdr=0x00
> vendor = 'Intel Corporation'
> device = '82801G (ICH7 Family) Ultra ATA Storage Controller'
> class = mass storage
> subclass = ATA
> bar [10] = type I/O Port, range 32, base 0x1f0, size 8, enabled
> bar [14] = type I/O Port, range 32, base 0x3f4, size 1, enabled
> bar [18] = type I/O Port, range 32, base 0x170, size 8, enabled
> bar [1c] = type I/O Port, range 32, base 0x374, size 1, enabled
> bar [20] = type I/O Port, range 32, base 0xffa0, size 16, enabled
> atapci1 at pci0:0:31:2: class=0x01018f card=0x81791043 chip=0x27c08086 rev=0x01 hdr=0x00
> vendor = 'Intel Corporation'
> device = '82801GB/GR/GH (ICH7 Family) Serial ATA Storage Controller'
> class = mass storage
> subclass = ATA
> bar [10] = type I/O Port, range 32, base 0xb080, size 8, enabled
> bar [14] = type I/O Port, range 32, base 0xb000, size 4, enabled
> bar [18] = type I/O Port, range 32, base 0xac00, size 8, enabled
> bar [1c] = type I/O Port, range 32, base 0xa880, size 4, enabled
> bar [20] = type I/O Port, range 32, base 0xa800, size 16, enabled
> none1 at pci0:0:31:3: class=0x0c0500 card=0x81791043 chip=0x27da8086 rev=0x01 hdr=0x00
> vendor = 'Intel Corporation'
> device = 'Intel[R] 82801G (ICH7 Family) C- 27DA (82801G)'
> class = serial bus
> subclass = SMBus
> bar [20] = type I/O Port, range 32, base 0x400, size 32, enabled
> em0 at pci0:3:0:0: class=0x020000 card=0xa01f8086 chip=0x10d38086 rev=0x00 hdr=0x00
> vendor = 'Intel Corporation'
> device = 'Intel 82574L Gigabit Ethernet Controller (82574L)'
> class = network
> subclass = ethernet
> bar [10] = type Memory, range 32, base 0xfebe0000, size 131072, enabled
> bar [14] = type Memory, range 32, base 0xfeb00000, size 524288, enabled
> bar [18] = type I/O Port, range 32, base 0xec00, size 32, enabled
> bar [1c] = type Memory, range 32, base 0xfebdc000, size 16384, enabled
> re0 at pci0:2:0:0: class=0x020000 card=0x82c61043 chip=0x816810ec rev=0x02 hdr=0x00
> vendor = 'Realtek Semiconductor'
> device = 'Gigabit Ethernet NIC(NDIS 6.0) (RTL8168/8111/8111c)'
> class = network
> subclass = ethernet
> bar [10] = type I/O Port, range 32, base 0xd800, size 256, enabled
> bar [18] = type Prefetchable Memory, range 64, base 0xfdfff000, size 4096, enabled
> bar [20] = type Prefetchable Memory, range 64, base 0xfdfe0000, size 65536, enabled
> none2 at pci0:1:0:0: class=0x0c0010 card=0x34011106 chip=0x34011106 rev=0x00 hdr=0x00
> vendor = 'VIA Technologies, Inc.'
> class = serial bus
> subclass = FireWire
> bar [10] = type Memory, range 64, base 0xfe9fe800, size 2048, enabled
> bar [18] = type I/O Port, range 32, base 0xc800, size 256, enabled
> none3 at pci0:1:0:1: class=0x018000 card=0x401a1106 chip=0x401a1106 rev=0x00 hdr=0x00
> vendor = 'VIA Technologies, Inc.'
> class = mass storage
> bar [10] = type Memory, range 64, base 0xfe9ff000, size 2048, enabled
> none4 at pci0:1:0:2: class=0x080501 card=0x401b1106 chip=0x401b1106 rev=0x00 hdr=0x00
> vendor = 'VIA Technologies, Inc.'
> class = base peripheral
> subclass = SD host controller
> bar [10] = type Memory, range 64, base 0xfe9ffc00, size 256, enabled
>
> Copyright (c) 1992-2010 The FreeBSD Project.
> Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
> The Regents of the University of California. All rights reserved.
> FreeBSD is a registered trademark of The FreeBSD Foundation.
> FreeBSD 8.1-PRERELEASE #30: Thu Jul 15 12:58:20 UTC 2010
> root at lokschuppen.zs64.net:/usr/obj/usr/src/sys/EISENBOOT amd64
> Timecounter "i8254" frequency 1193182 Hz quality 0
> CPU: Intel(R) Core(TM)2 Duo CPU E7300 @ 2.66GHz (2666.65-MHz K8-class CPU)
> Origin = "GenuineIntel" Id = 0x10676 Family = 6 Model = 17 Stepping = 6
> Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE>
> Features2=0x8e39d<SSE3,DTES64,MON,DS_CPL,EST,TM2,SSSE3,CX16,xTPR,PDCM,SSE4.1>
> AMD Features=0x20100800<SYSCALL,NX,LM>
> AMD Features2=0x1<LAHF>
> TSC: P-state invariant
> real memory = 4294967296 (4096 MB)
> avail memory = 4080877568 (3891 MB)
> ACPI APIC Table: <A_M_I_ OEMAPIC >
> FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
> FreeBSD/SMP: 1 package(s) x 2 core(s)
> cpu0 (BSP): APIC ID: 0
> cpu1 (AP): APIC ID: 1
> ioapic0 <Version 2.0> irqs 0-23 on motherboard
> kbd1 at kbdmux0
> acpi0: <A_M_I_ OEMXSDT> on motherboard
> acpi0: [ITHREAD]
> acpi0: Power Button (fixed)
> acpi0: reservation of fed1c000, 4000 (3) failed
> acpi0: reservation of fed20000, 70000 (3) failed
> acpi0: reservation of ffc00000, 300000 (3) failed
> acpi0: reservation of fec00000, 1000 (3) failed
> acpi0: reservation of fee00000, 1000 (3) failed
> acpi0: reservation of f0000000, 4000000 (3) failed
> acpi0: reservation of 0, a0000 (3) failed
> acpi0: reservation of 100000, ddd00000 (3) failed
> Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
> acpi_timer0: <24-bit timer at 3.579545MHz> port 0x808-0x80b on acpi0
> cpu0: <ACPI CPU> on acpi0
> ACPI Warning: Incorrect checksum in table [OEMB] - 0xCC, should be 0xCB (20100331/tbutils-354)
> cpu1: <ACPI CPU> on acpi0
> acpi_hpet0: <High Precision Event Timer> iomem 0xfed00000-0xfed003ff on acpi0
> Timecounter "HPET" frequency 14318180 Hz quality 900
> pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
> pci0: <ACPI PCI bus> on pcib0
> vgapci0: <VGA-compatible display> port 0xbc00-0xbc07 mem 0xfe400000-0xfe7fffff,0xe0000000-0xefffffff irq 16 at device 2.0 on pci0
> agp0: <Intel G41 SVGA controller> on vgapci0
> agp0: detected 32764k stolen memory
> agp0: aperture size is 256M
> vgapci1: <VGA-compatible display> mem 0xfe800000-0xfe8fffff at device 2.1 on pci0
> pci0: <multimedia, HDA> at device 27.0 (no driver attached)
> pcib1: <ACPI PCI-PCI bridge> irq 16 at device 28.0 on pci0
> pci3: <ACPI PCI bus> on pcib1
> em0: <Intel(R) PRO/1000 Network Connection 7.0.5> port 0xec00-0xec1f mem 0xfebe0000-0xfebfffff,0xfeb00000-0xfeb7ffff,0xfebdc000-0xfebdffff irq 16 at device 0.0 on pci3
> em0: Using MSI interrupt
> em0: [FILTER]
> em0: Ethernet address: 00:1b:21:50:0b:f0
> pcib2: <ACPI PCI-PCI bridge> irq 18 at device 28.2 on pci0
> pci2: <ACPI PCI bus> on pcib2
> re0: <RealTek 8168/8111 B/C/CP/D/DP/E PCIe Gigabit Ethernet> port 0xd800-0xd8ff mem 0xfdfff000-0xfdffffff,0xfdfe0000-0xfdfeffff irq 18 at device 0.0 on pci2
> re0: Using 1 MSI messages
> re0: Chip rev. 0x3c000000
> re0: MAC rev. 0x00400000
> miibus0: <MII bus> on re0
> rgephy0: <RTL8169S/8110S/8211B media interface> PHY 1 on miibus0
> rgephy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
> re0: Ethernet address: 00:26:18:d5:2c:23
> re0: [FILTER]
> pcib3: <ACPI PCI-PCI bridge> irq 19 at device 28.3 on pci0
> pci1: <ACPI PCI bus> on pcib3
> pci1: <serial bus, FireWire> at device 0.0 (no driver attached)
> pci1: <mass storage> at device 0.1 (no driver attached)
> pci1: <base peripheral, SD host controller> at device 0.2 (no driver attached)
> uhci0: <Intel 82801G (ICH7) USB controller USB-A> port 0xb400-0xb41f irq 23 at device 29.0 on pci0
> uhci0: [ITHREAD]
> uhci0: LegSup = 0x2f00
> usbus0: <Intel 82801G (ICH7) USB controller USB-A> on uhci0
> uhci1: <Intel 82801G (ICH7) USB controller USB-B> port 0xb480-0xb49f irq 19 at device 29.1 on pci0
> uhci1: [ITHREAD]
> uhci1: LegSup = 0x2f00
> usbus1: <Intel 82801G (ICH7) USB controller USB-B> on uhci1
> uhci2: <Intel 82801G (ICH7) USB controller USB-C> port 0xb800-0xb81f irq 18 at device 29.2 on pci0
> uhci2: [ITHREAD]
> uhci2: LegSup = 0x2f00
> usbus2: <Intel 82801G (ICH7) USB controller USB-C> on uhci2
> uhci3: <Intel 82801G (ICH7) USB controller USB-D> port 0xb880-0xb89f irq 16 at device 29.3 on pci0
> uhci3: [ITHREAD]
> uhci3: LegSup = 0x2f00
> usbus3: <Intel 82801G (ICH7) USB controller USB-D> on uhci3
> ehci0: <Intel 82801GB/R (ICH7) USB 2.0 controller> mem 0xfe3f7c00-0xfe3f7fff irq 23 at device 29.7 on pci0
> ehci0: [ITHREAD]
> usbus4: EHCI version 1.0
> usbus4: <Intel 82801GB/R (ICH7) USB 2.0 controller> on ehci0
> pcib4: <ACPI PCI-PCI bridge> at device 30.0 on pci0
> pci4: <ACPI PCI bus> on pcib4
> isab0: <PCI-ISA bridge> at device 31.0 on pci0
> isa0: <ISA bus> on isab0
> atapci0: <Intel ICH7 UDMA100 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xffa0-0xffaf at device 31.1 on pci0
> ata0: <ATA channel 0> on atapci0
> ata0: [ITHREAD]
> atapci1: <Intel ICH7 SATA300 controller> port 0xb080-0xb087,0xb000-0xb003,0xac00-0xac07,0xa880-0xa883,0xa800-0xa80f irq 19 at device 31.2 on pci0
> atapci1: [ITHREAD]
> ata2: <ATA channel 0> on atapci1
> ata2: [ITHREAD]
> ata3: <ATA channel 1> on atapci1
> ata3: [ITHREAD]
> pci0: <serial bus, SMBus> at device 31.3 (no driver attached)
> acpi_button0: <Power Button> on acpi0
> atrtc0: <AT realtime clock> port 0x70-0x71 irq 8 on acpi0
> uart0: <16550 or compatible> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
> uart0: [FILTER]
> uart0: console (115200,n,8,1)
> orm0: <ISA Option ROM> at iomem 0xcc800-0xcd7ff on isa0
> sc0: <System console> at flags 0x100 on isa0
> sc0: VGA <16 virtual consoles, flags=0x300>
> vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
> atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
> atkbd0: <AT Keyboard> irq 1 on atkbdc0
> kbd0 at atkbd0
> atkbd0: [GIANT-LOCKED]
> atkbd0: [ITHREAD]
> ppc0: cannot reserve I/O port range
> coretemp0: <CPU On-Die Thermal Sensors> on cpu0
> est0: <Enhanced SpeedStep Frequency Control> on cpu0
> p4tcc0: <CPU Frequency Thermal Control> on cpu0
> coretemp1: <CPU On-Die Thermal Sensors> on cpu1
> est1: <Enhanced SpeedStep Frequency Control> on cpu1
> p4tcc1: <CPU Frequency Thermal Control> on cpu1
> ZFS filesystem version 3
> ZFS storage pool version 14
> Timecounters tick every 1.000 msec
> usbus0: 12Mbps Full Speed USB v1.0
> usbus1: 12Mbps Full Speed USB v1.0
> usbus2: 12Mbps Full Speed USB v1.0
> usbus3: 12Mbps Full Speed USB v1.0
> usbus4: 480Mbps High Speed USB v2.0
> ad4: 953869MB <SAMSUNG HD103UJ 1AA01113> at ata2-master UDMA100 SATA
> ugen0.1: <Intel> at usbus0
> uhub0: <Intel UHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus0
> ugen1.1: <Intel> at usbus1
> uhub1: <Intel UHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus1
> ugen2.1: <Intel> at usbus2
> uhub2: <Intel UHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus2
> ugen3.1: <Intel> at usbus3
> uhub3: <Intel UHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus3
> ugen4.1: <Intel> at usbus4
> uhub4: <Intel EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1> on usbus4
> SMP: AP CPU #1 Launched!
> Root mount waiting for: usbus4 usbus3 usbus2 usbus1 usbus0
> uhub0: 2 ports with 2 removable, self powered
> uhub1: 2 ports with 2 removable, self powered
> uhub2: 2 ports with 2 removable, self powered
> uhub3: 2 ports with 2 removable, self powered
> Root mount waiting for: usbus4
> Root mount waiting for: usbus4
> Root mount waiting for: usbus4
> uhub4: 8 ports with 8 removable, self powered
> Trying to mount root from ufs:/dev/ad4s1a
> ugen0.2: <ftdi> at usbus0
> uftdi0: <usb serial converter> on usbus0
This dmesg + pciconf -lvc doesn't show any signs of ichwd in use.
I can confirm that ichwd(4) works fine on the following system types:
- Supermicro SuperServer 5015M-T+ (Intel ICH7R-based)
- Supermicro SuperServer 5015B-MT (Intel ICH9R-based)
- Supermicro X7SBA motherboard (Intel ICH9R-based)
- Supermicro X7SBL-LN2 motherboard (Intel ICH9R-based)
The Asus P5G41 looks like a workstation-class board[1]. I wouldn't be
surprised if certain configuration bits aren't being set by the system
BIOS or during the manufacturing process by Asus for this reason.
All in all, I really wouldn't worry about ichwd(4) not working for you.
If your system randomly locks up, you try to investigate the root cause
and solve it. Hardware watchdogs are usually a server-focused feature.
Regarding SW_WATCHDOG, it doesn't work on SMP systems, which yours is.
So don't bother.
[1]: http://commercial.asus.com/product/detail/18
--
| Jeremy Chadwick jdc at parodius.com |
| Parodius Networking http://www.parodius.com/ |
| UNIX Systems Administrator Mountain View, CA, USA |
| Making life hard for others since 1977. PGP: 4BD6C0CB |
More information about the freebsd-stable
mailing list