FreeBSD 6.x CVSUP today crashes with zero load ...

Marc G. Fournier scrappy at hub.org
Sun Jun 25 02:12:27 UTC 2006


On Sat, 24 Jun 2006, Nate Lawson wrote:

> Marc G. Fournier wrote:
>> On Sat, 24 Jun 2006, Marc G. Fournier wrote:
>> 
>>> 
>>> 'k, looks like I'm going to have to back this out ... just upgraded 
>>> another server to 6.x, CVSup latest -STABLE, built, installed, rebooted 
>>> ... up fine ...
>>> 
>>> Running a single 'rsync' to copy files from another server over, it has 
>>> crashed twice in a row so far ...
>>> 
>>> I'm enabling dumpdev right now, and will see if I can a core dump out of 
>>> it, but, so far, there is nothing being reported in /var/log/messages to 
>>> indicate a problem ...
>>> 
>>> Does anyone know of any problems with current source tree that I should 
>>> avoid?  And, if so, can someone recommend a "stable date" to CVSup in and 
>>> try?  This server isn't production yet, and I'm not panic'd right now to 
>>> make it so (basically, I've got a couple of days if I need it) ...
>> 
>> 
>> Just found this in my /var/log/messages file after the last reboot to 
>> enable savecore/dumpdev:
>> 
>> Jun 25 00:19:59 jupiter kernel: ACPI-0356: *** Error: Region SystemIO(1) 
>> has no handler
>> Jun 25 00:19:59 jupiter kernel: ACPI-1304: *** Error: Method execution 
>> failed [\_SB_.LN02._STA] (Node 0xc9071920), AE_NOT_EXIST
>> Jun 25 00:19:59 jupiter kernel: ACPI-0239: *** Error: Method execution 
>> failed [\_SB_.LN02._STA] (Node 0xc9071920), AE_NOT_EXIST
>> 
>> For those on the -acpi list, this machine is an Intel Dual-PIII motherboard 
>> ...
>
> What changes if acpi is disabled?  Are you running a custom AML?

'k, this server is, unfortunately, a remote server, so disabling ACPI 
isn't something I can easily do ... :(  At least not until Monday ...

re: custom AML ... stupid question, but what is an AML? :(

> The SystemIO thing seems troubling.  Is there an earlier message that 
> explains why?

Nothing:

Jun 25 00:19:24 jupiter kernel: Trying to mount root from ufs:/dev/da0s1a
Jun 25 00:19:24 jupiter savecore: no dumps found
Jun 25 00:19:24 jupiter named[381]: starting BIND 9.3.2 -t /var/named -u bind
Jun 25 00:19:24 jupiter named[381]: command channel listening on 127.0.0.1#953
Jun 25 00:19:24 jupiter named[381]: zone 0.0.127.IN-ADDR.ARPA/IN: loading master file master/localhost.rev: file not found
Jun 25 00:19:24 jupiter named[381]: zone 1.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.IP6.ARPA/IN: loading master file master/localhost-v6.rev: file not found
Jun 25 00:19:24 jupiter named[381]: running
Jun 25 00:19:24 jupiter kernel: fxp0: promiscuous mode enabled
Jun 25 00:19:59 jupiter kernel: ACPI-0356: *** Error: Region SystemIO(1) has no handler
Jun 25 00:19:59 jupiter kernel: ACPI-1304: *** Error: Method execution failed [\_SB_.LN02._STA] (Node 0xc9071920), AE_NOT_EXIST
Jun 25 00:19:59 jupiter kernel: ACPI-0239: *** Error: Method execution failed [\_SB_.LN02._STA] (Node 0xc9071920), AE_NOT_EXIST

> These are new messages?

Most definitely ... but, I've attached my dmesg.boot, which includes a 
bunch of ACPI Warnings ... from searching Google, it was my understanding 
that they are/were benign though ... I get the same Warnings on my other 
Dual-PIII box, but its a duplicate of this one, so that is to be expected 
...

> Have you passed the memtest x86 CD?

Just as an appendum to this report ... I have rebooted the server using 
the 6.1-RC1 kernel that was originally installed, before I upgraded to 
-STABLE, and am now running both an rsync that *so far* hasn't giving me 
any SegFaults, and am running a 'make buildworld' with a RELENG_6_1 source 
tree, that also hasn't cause any problems ... in fact, tail of my 
/var/log/messages shows:

# tail /var/log/messages 
Jun 25 01:23:27 jupiter named[391]: zone 1.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.IP6.ARPA/IN: loading master file master/localhost-v6.rev: file not found
Jun 25 01:23:27 jupiter named[391]: running
Jun 25 01:23:27 jupiter kernel: fxp0: promiscuous mode enabled
Jun 25 01:24:44 jupiter fsck: /dev/da0s1d: 11 files, 7 used, 63344 free (40 frags, 7913 blocks, 0.1% fragmentation)
Jun 25 01:25:47 jupiter fsck: /dev/da0s1e: 191421 files, 833872 used, 688631 free (18623 frags, 83751 blocks, 1.2% fragmentation)
Jun 25 01:25:48 jupiter fsck: /dev/da0s1f: Reclaimed: 0 directories, -1 files, -1 fragments
Jun 25 01:25:48 jupiter fsck: /dev/da0s1f: 217 files, 817 used, 62535 free (71 frags, 7808 blocks, 0.1% fragmentation)
Jun 25 01:31:12 jupiter fsck: /dev/da0s1g: 429378 files, 3425842 used, 60096474 free (6314 frags, 7511270 blocks, 0.0% fragmentation)
Jun 25 02:00:00 jupiter kernel: fxp0: promiscuous mode disabled
Jun 25 02:00:00 jupiter kernel: fxp0: promiscuous mode enabled
jupiter# uptime
  2:06AM  up 43 mins, 3 users, load averages: 1.38, 1.36, 1.22
jupiter#

With -STABLE of today, I couldn't even run a complete cvsup without it 
giving me an error:

--------
  Checkout src/sys/kern/uipc_socket2.c


***
*** runtime error:
***    Attempt to dereference NIL
***    file "/vm/ports/usr/ports/lang/ezm3/work/ezm3-1.2/libs/libm3/src/rw/Common/RdImpl.m3", line 39
***

   use option @M3stackdump to get a stack trace
-----------

----
Marc G. Fournier           Hub.Org Networking Services (http://www.hub.org)
Email . scrappy at hub.org                              MSN . scrappy at hub.org
Yahoo . yscrappy               Skype: hub.org        ICQ . 7615664
-------------- next part --------------
Copyright (c) 1992-2006 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
	The Regents of the University of California. All rights reserved.
FreeBSD 6.1-RC1 #0: Mon Apr 10 17:03:22 UTC 2006
    root at opus.cse.buffalo.edu:/usr/obj/usr/src/sys/SMP
acpi_alloc_wakeup_handler: can't alloc wake memory
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: Intel(R) Pentium(R) III CPU - S         1266MHz (1263.45-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x6b4  Stepping = 4
  Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE>
real memory  = 4227792896 (4031 MB)
avail memory = 4140093440 (3948 MB)
ACPI APIC Table: <INTEL  SCB20   >
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 cpu0 (BSP): APIC ID:  3
 cpu1 (AP): APIC ID:  0
    ACPI-0698: *** Warning: Type override - [DEB_] had invalid type (Integer) for Scope operator, changed to (Scope)
    ACPI-0698: *** Warning: Type override - [MLIB] had invalid type (Integer) for Scope operator, changed to (Scope)
    ACPI-0698: *** Warning: Type override - [DATA] had invalid type (String) for Scope operator, changed to (Scope)
    ACPI-0698: *** Warning: Type override - [SIO_] had invalid type (String) for Scope operator, changed to (Scope)
    ACPI-0698: *** Warning: Type override - [SB__] had invalid type (String) for Scope operator, changed to (Scope)
    ACPI-0698: *** Warning: Type override - [PM__] had invalid type (String) for Scope operator, changed to (Scope)
    ACPI-0698: *** Warning: Type override - [ICNT] had invalid type (String) for Scope operator, changed to (Scope)
    ACPI-0698: *** Warning: Type override - [ACPI] had invalid type (String) for Scope operator, changed to (Scope)
    ACPI-0698: *** Warning: Type override - [LEDP] had invalid type (String) for Scope operator, changed to (Scope)
    ACPI-0698: *** Warning: Type override - [WUES] had invalid type (String) for Scope operator, changed to (Scope)
    ACPI-0698: *** Warning: Type override - [WUSE] had invalid type (String) for Scope operator, changed to (Scope)
    ACPI-0698: *** Warning: Type override - [CSB5] had invalid type (String) for Scope operator, changed to (Scope)
    ACPI-0698: *** Warning: Type override - [PM__] had invalid type (String) for Scope operator, changed to (Scope)
    ACPI-0698: *** Warning: Type override - [BIOS] had invalid type (Integer) for Scope operator, changed to (Scope)
    ACPI-0698: *** Warning: Type override - [CMOS] had invalid type (Integer) for Scope operator, changed to (Scope)
MADT: Forcing active-low polarity and level trigger for SCI
ioapic0 <Version 1.1> irqs 0-15 on motherboard
ioapic1 <Version 1.1> irqs 16-31 on motherboard
lapic3: Forcing LINT1 to edge trigger
kbd1 at kbdmux0
npx0: [FAST]
npx0: <math processor> on motherboard
npx0: INT 16 interface
acpi0: <INTEL SCB20> on motherboard
acpi0: Power Button (fixed)
Timecounter "ACPI-safe" frequency 3579545 Hz quality 1000
acpi_timer0: <32-bit timer at 3.579545MHz> port 0x508-0x50b on acpi0
cpu0: <ACPI CPU> on acpi0
cpu1: <ACPI CPU> on acpi0
pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
pci0: <ACPI PCI bus> on pcib0
fxp0: <Intel 82550 Pro/100 Ethernet> port 0x1400-0x143f mem 0xfeae0000-0xfeae0fff,0xfeaa0000-0xfeabffff irq 21 at device 3.0 on pci0
miibus0: <MII bus> on fxp0
inphy0: <i82555 10/100 media interface> on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
fxp0: Ethernet address: 00:03:47:30:a7:1b
fxp1: <Intel 82550 Pro/100 Ethernet> port 0x1440-0x147f mem 0xfea80000-0xfea80fff,0xfea60000-0xfea7ffff irq 20 at device 4.0 on pci0
miibus1: <MII bus> on fxp1
inphy1: <i82555 10/100 media interface> on miibus1
inphy1:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
fxp1: Ethernet address: 00:03:47:30:a7:1c
pci0: <display, VGA> at device 12.0 (no driver attached)
isab0: <PCI-ISA bridge> at device 15.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <ServerWorks CSB5 UDMA100 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x3a0-0x3af,0x410-0x413 at device 15.1 on pci0
ata0: <ATA channel 0> on atapci0
ata1: <ATA channel 1> on atapci0
ohci0: <OHCI (generic) USB controller> mem 0xfea40000-0xfea40fff irq 10 at device 15.2 on pci0
ohci0: [GIANT-LOCKED]
usb0: OHCI version 1.0, legacy support
usb0: SMM does not respond, resetting
usb0: <OHCI (generic) USB controller> on ohci0
usb0: USB revision 1.0
uhub0: (0x1166) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 4 ports with 4 removable, self powered
pcib1: <ACPI Host-PCI bridge> on acpi0
pci1: <ACPI PCI bus> on pcib1
iir0: <Intel Integrated RAID Controller> mem 0xfc8f0000-0xfc8f3fff irq 30 at device 9.0 on pci1
iir0: [GIANT-LOCKED]
pcib2: <ACPI Host-PCI bridge> on acpi0
pci2: <ACPI PCI bus> on pcib2
fdc0: <floppy drive controller (FDE)> port 0x3f2-0x3f3,0x3f4-0x3f5,0x3f7 irq 6 drq 2 on acpi0
fdc0: [FAST]
sio0: configured irq 4 not in bitmap of probed irqs 0
sio0: port may not be enabled
sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
sio0: type 16550A
sio1: configured irq 3 not in bitmap of probed irqs 0
sio1: port may not be enabled
sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0
sio1: type 16550A
pmtimer0 on isa0
orm0: <ISA Option ROMs> at iomem 0xc0000-0xc7fff,0xcd800-0xcefff,0xcf000-0xd07ff,0xe4000-0xe7fff on isa0
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
ppc0: parallel port not found.
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x300>
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
ukbd0: DELL DELL USB Keyboard, rev 1.10/1.05, addr 2, iclass 3/1
kbd2 at ukbd0
Timecounters tick every 1.000 msec
acd0: CDROM <SAMSUNG CD-ROM SN-124/QM15> at ata0-master PIO4
Waiting 5 seconds for SCSI devices to settle
ses0 at iir0 bus 1 target 6 lun 0
ses0: <ESG-SHV SCA HSBP M16 0.05> Fixed Processor SCSI-2 device 
ses0: SAF-TE Compliant Device
da0 at iir0 bus 2 target 0 lun 0
da0: <Intel Host Drive   #00 > Fixed Direct Access SCSI-2 device 
da0: Tagged Queueing Enabled
da0: 139878MB (286471080 512 byte sectors: 255H 63S/T 17832C)
lapic0: Forcing LINT1 to edge trigger
SMP: AP CPU #1 Launched!
Trying to mount root from ufs:/dev/da0s1a
WARNING: / was not properly dismounted
WARNING: /tmp was not properly dismounted
WARNING: /usr was not properly dismounted
WARNING: /var was not properly dismounted
WARNING: /vm was not properly dismounted


More information about the freebsd-stable mailing list