System Freezing -Again

Grant Peel gpeel at thenetnow.com
Sat Mar 11 02:12:58 UTC 2006


Hi all, 

As mentioned in the recent past, I am getting lockups on my Dell PE 1850 w/ FreeBSD 6.0.

Dmesg says ipv6 is initialized in the ipfw daemon... not sure if that is OK since the kernel is compiled without ipv6/

Also, just befor this last crasgh (today) I see the following in /var/log/messages....is this normal? (ipfw entries):

Mar 10 09:29:39 s1 kernel: 6:110 216.221.88.85:49447 out via em0
Mar 10 09:33:16 s1 kernel: em0
Mar 10 10:03:13 s1 kernel: 5:110 70.48.38.90:1305 out via em0
Mar 10 10:10:56 s1 kernel: .34.137.45:25 in via em0
Mar 10 10:10:56 s1 kernel: 81.25:58023 out via em0
Mar 10 10:10:56 s1 kernel: 34.137.45:25 in via em0
Mar 10 10:10:56 s1 kernel: 34.137.45:25 in via em0
Mar 10 10:10:56 s1 kernel: .34.137.45:25 in via em0
Mar 10 10:16:36 s1 kernel: <<110>ipfw: 50020 Accept TCP 209.226.175.185:63728 64.34.137.45:25 in via em0
Mar 10 10:16:36 s1 kernel: 110>ipfw: 50020 Accept TCP 64.34.137.45:25 209.226.175.185:63728 out via em0
Mar 10 10:20:04 s1 kernel:
Mar 10 10:36:42 s1 kernel: .25:63415 out via em0
Mar 10 10:36:42 s1 kernel: 5
Mar 10 10:36:42 s1 kernel: 63415 out via em0
Mar 10 10:36:42 s1 kernel:
Mar 10 10:36:42 s1 kernel: 7.45:25 in via em0
Mar 10 10:36:42 s1 kernel: 37.45:25 in via em0
Mar 10 10:36:42 s1 kernel:
Mar 10 11:20:47 s1 kernel:
Mar 10 11:24:51 s1 kernel: 50020 Accept TCP 64.34.137.45:110 70.48.38.90:1729 out via em0
Mar 10 11:36:14 s1 kernel: ipfw: 50020 Accept TCP 64.34.137.47:110 216.9.250.224:57245 out via em0
Mar 10 11:48:15 s1 kernel: 137.45:110 70.48.38.90:1585 out via em0


Also, I have the atkbd disabled so I can take advantage of the DRAC console, it needs ukbd, to connect. If the console (ukbd) dissconnects, is it possible the system console is disconnecting such that the machine can't even talk to itself?

As mentioned before, there is nothing anywhere in any log file that indicates any faults. Also, there is nothing in the sustems SEL fhat indicate any issues.

Immediately after today's crash, I had the NOC tech check the machine. The physical state, all LEDs, CPU fan, motherboard fan, power supply etc.. appeared to be that of a running machine. Even the NICs werer connected and blinking.

Perhaps I should reinstall the GENERIC kernel?

-Grant

----- Original Message ----- 
From: Grant Peel 
To: freebsd-questions at freebsd.org ; Derek Ragona 
Sent: Thursday, March 09, 2006 6:28 PM
Subject: Re: System Freezing -Again


Sorry Derek, I did forget the dmesg ... here it is ...

Also, Do you think the "^@" s not a hint to something?

root on s1# more dmesg.boot
Copyright (c) 1992-2005 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
        The Regents of the University of California. All rights reserved.
FreeBSD 6.0-RELEASE #1: Wed Jan 25 09:15:04 EST 2006
    gpeel at s1.fpm3.com:/usr/src/sys/i386/compile/DS9
MPTable: <DELL     PE 016C     >
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: Intel(R) Xeon(TM) CPU 3.00GHz (2992.71-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0xf43  Stepping = 3
  Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SS
E2,SS,HTT,TM,PBE>
  Features2=0x641d<SSE3,RSVD2,MON,DS_CPL,CNTX-ID,CX16,<b14>>
  AMD Features=0x20100000<NX,LM>
  Hyperthreading: 2 logical CPUs
real memory  = 536608768 (511 MB)
avail memory = 515801088 (491 MB)
ioapic0: Changing APIC ID to 2
ioapic0: Assuming intbase of 0
ioapic1: Changing APIC ID to 3
ioapic1: Assuming intbase of 24
ioapic2: Changing APIC ID to 4
ioapic2: Assuming intbase of 48
ioapic0 <Version 2.0> irqs 0-23 on motherboard
ioapic1 <Version 2.0> irqs 24-47 on motherboard
ioapic2 <Version 2.0> irqs 48-71 on motherboard
npx0: [FAST]
npx0: <math processor> on motherboard
npx0: INT 16 interface
cpu0 on motherboard
pcib0: <MPTable Host-PCI bridge> pcibus 0 on motherboard
pci0: <PCI bus> on pcib0
pcib1: <PCI-PCI bridge> at device 2.0 on pci0
pci1: <PCI bus> on pcib1
pcib2: <MPTable PCI-PCI bridge> at device 0.0 on pci1
pci2: <PCI bus> on pcib2
mpt0: <LSILogic 1030 Ultra4 Adapter> port 0xec00-0xecff mem 0xdfdf0000-0xdfdfffff,0xdfde0000-0xdfdeffff irq 26 at device 5.0 o
n pci2
mpt0: [GIANT-LOCKED]
mpt0: MPI Version=1.2.12.0
mpt0: Unhandled Event Notify Frame. Event 0xa.
pcib3: <MPTable PCI-PCI bridge> at device 0.2 on pci1
pci3: <PCI bus> on pcib3
pcib4: <PCI-PCI bridge> at device 4.0 on pci0
pci4: <PCI bus> on pcib4
pcib5: <PCI-PCI bridge> at device 5.0 on pci0
pci5: <PCI bus> on pcib5
pcib6: <MPTable PCI-PCI bridge> at device 0.0 on pci5
pci6: <PCI bus> on pcib6
em0: <Intel(R) PRO/1000 Network Connection, Version - 2.1.7> port 0xdcc0-0xdcff mem 0xdfae0000-0xdfafffff irq 48 at device 7.0
 on pci6
em0: Ethernet address: 00:14:22:1c:d5:7e
em0:  Speed:N/A  Duplex:N/A
pcib7: <MPTable PCI-PCI bridge> at device 0.2 on pci5
pci7: <PCI bus> on pcib7
em1: <Intel(R) PRO/1000 Network Connection, Version - 2.1.7> port 0xccc0-0xccff mem 0xdf8e0000-0xdf8fffff irq 49 at device 8.0
 on pci7
em1: Ethernet address: 00:14:22:1c:d5:7f
em1:  Speed:N/A  Duplex:N/A
pcib8: <PCI-PCI bridge> at device 6.0 on pci0
pci8: <PCI bus> on pcib8
uhci0: <Intel 82801EB (ICH5) USB controller USB-A> port 0xace0-0xacff irq 16 at device 29.0 on pci0
uhci0: [GIANT-LOCKED]
usb0: <Intel 82801EB (ICH5) USB controller USB-A> on uhci0
usb0: USB revision 1.0
uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhci1: <Intel 82801EB (ICH5) USB controller USB-B> port 0xacc0-0xacdf irq 19 at device 29.1 on pci0
uhci1: [GIANT-LOCKED]
usb1: <Intel 82801EB (ICH5) USB controller USB-B> on uhci1
usb1: USB revision 1.0
uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
uhci2: <Intel 82801EB (ICH5) USB controller USB-C> port 0xaca0-0xacbf irq 18 at device 29.2 on pci0
uhci2: [GIANT-LOCKED]
usb2: <Intel 82801EB (ICH5) USB controller USB-C> on uhci2
usb2: USB revision 1.0
uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub2: 2 ports with 2 removable, self powered
ehci0: <EHCI (generic) USB 2.0 controller> mem 0xdff00000-0xdff003ff irq 23 at device 29.7 on pci0
ehci0: [GIANT-LOCKED]
usb3: EHCI version 1.0
usb3: companion controllers, 2 ports each: usb0 usb1 usb2
usb3: <EHCI (generic) USB 2.0 controller> on ehci0
usb3: USB revision 2.0
uhub3: Intel EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
uhub3: 6 ports with 6 removable, self powered
uhub4: vendor 0x413c product 0xa001, class 9/0, rev 2.00/0.00, addr 2
uhub4: multiple transaction translators
uhub4: 2 ports with 2 removable, self powered
pcib9: <MPTable PCI-PCI bridge> at device 30.0 on pci0
pci9: <PCI bus> on pcib9
pci9: <unknown> at device 5.0 (no driver attached)
pci9: <unknown> at device 5.1 (no driver attached)
pci9: <unknown> at device 5.2 (no driver attached)
atapci0: <SiI 0680 UDMA133 controller> port 0xbcf0-0xbcf7,0xbce4-0xbce7,0xbcd8-0xbcdf,0xbcd0-0xbcd3,0xbc70-0xbc7f mem 0xdf5fec
00-0xdf5fecff irq 23 at device 6.0 on pci9
ata2: <ATA channel 0> on atapci0
ata3: <ATA channel 1> on atapci0
pci9: <display, VGA> at device 13.0 (no driver attached)
isab0: <PCI-ISA bridge> at device 31.0 on pci0
isa0: <ISA bus> on isab0
atapci1: <Intel ICH5 UDMA100 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xfc00-0xfc0f at device 31.1 on pci0
ata0: <ATA channel 0> on atapci1
ata1: <ATA channel 1> on atapci1
pmtimer0 on isa0
orm0: <ISA Option ROMs> at iomem 0xc0000-0xcafff,0xcb000-0xcefff,0xec000-0xeffff on isa0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
ppc0: parallel port not found.
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
sio1: configured irq 3 not in bitmap of probed irqs 0
sio1: port may not be enabled
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
unknown: <PNP0c02> can't assign resources (memory)
unknown: <PNP0501> can't assign resources (port)
unknown: <PNP0303> can't assign resources (port)
unknown: <PNP0c02> can't assign resources (port)
unknown: <PNP0c01> can't assign resources (port)
ukbd0: Dell DRAC4, rev 1.10/0.00, addr 2, iclass 3/1
kbd0 at ukbd0
Timecounter "TSC" frequency 2992710990 Hz quality 800
Timecounters tick every 1.000 msec
acd0: CDROM <TEAC CD-ROM CD-224E-N/3.AB> at ata0-master UDMA33
device_attach: afd0 attach returned 6
acd1: CDROM <VIRTUALCDROM DRIVE/> at ata2-slave PIO3
Waiting 5 seconds for SCSI devices to settle
ses0 at mpt0 bus 0 target 6 lun 0
ses0: <PE/PV 1x2 SCSI BP 1.0> Fixed Processor SCSI-2 device
ses0: 3.300MB/s transfers
ses0: SAF-TE Compliant Device
da0 at mpt0 bus 0 target 0 lun 0
da0: <SEAGATE ST373207LC D701> Fixed Direct Access SCSI-3 device
da0: 320.000MB/s transfers (160.000MHz, offset 63, 16bit), Tagged Queueing Enabled
da0: 70007MB (143374650 512 byte sectors: 255H 63S/T 8924C)
Trying to mount root from ufs:/dev/da0s1a
WARNING: / was not properly dismounted
WARNING: /home was not properly dismounted
WARNING: /usr was not properly dismounted
WARNING: /var was not properly dismounted
ipfw2 (+ipv6) initialized, divert loadable, rule-based forwarding disabled, default to deny, logging disabled
root on s1#
  ----- Original Message ----- 
  From: Derek Ragona 
  To: Grant Peel ; freebsd-questions at freebsd.org 
  Sent: Thursday, March 09, 2006 6:19 PM
  Subject: Re: System Freezing -Again


  A freeze like you describe, no logging sounds like a hardware problem.  You didn't include a current dmesg or other system inventory.  You may have shared irq's causing the problem, or a power supply, or a management board.

  Those are the three things I would look at.

          -Derek


  At 05:00 PM 3/9/2006, Grant Peel wrote:

    Hi all,

    I have been pestering everyone who will listen on this, and have yet to find a solution ... so here goes again!

    I have a FreeBSD system on a Dell PE 1850 that has been randomly freezing up from time to time in the past two weeks.

    Firewall is enabled, but is very solid as I am using the same rules on several other machines.

    I spent last Saturday and Sunday at the NOC, running Dell 32 bit diagnostics on it, along with memtest32, no errors found over many hours of testing.

    NO heat of note from the Power supply CPU or system fan.

    Nothing ever in log files, no core dumps. I have the DUMDEV="AUTO" set in rc.conf, but ran dumpon -v /dev/da0s1b today, the output showed it should be dumping to my swap partition.

    APIC is disabled now, but has been enabled in the past, with the same results. POwer management is shut off in the bios.

    No System Events are being recorded in BIOS, and, the logs were harvested last week with no abnormalitied showing.

    The only item of note, that I found after todays freeze, was in /var/log/maillog, and /var/log/exim/mainlog,   about 15 lines of ^@ recorded at about the exact tine of the freeze. Does anyone think this is significant, or is it simply a symptom of the crash?

    ANY help will be greatly appreciated.

    -GRant 

    _______________________________________________
    freebsd-questions at freebsd.org mailing list
    http://lists.freebsd.org/mailman/listinfo/freebsd-questions
    To unsubscribe, send any mail to "freebsd-questions-unsubscribe at freebsd.org"


More information about the freebsd-questions mailing list