Xserve G4 stability (random processes crashing)

Kevin Day toasty at dragondata.com
Mon Mar 29 17:33:04 UTC 2010


Thanks to some help, we've got 8.0-STABLE running on several Xserve G4 boxes now, in both UP and SMP configurations.

However, all of them are showing weird stability problems. Running OS X Server, they were completely stable for years doing pretty hard work (video encoding) with no errors. They all pass Apple's hardware burn-in, too. But, doing a "buildworld" or "buildkernel" will result in random segfaults, invalid .o files being created, or ICEs that go away after immediately retrying. (i.e. it doesn't appear to be data from the disks being cached incorrectly, I don't have to force a re-read to fix) Pure CPU tasks (like memtester from ports) work fine for days. 

Are there any known issues with 8.0 on an XServe G4?

-- Kevin




FreeBSD 8.0-STABLE #1: Sun Mar 28 01:04:25 UTC 2010
    root@:/usr/src/sys/powerpc/compile/SERVER powerpc
cpu0: Motorola PowerPC 7455 revision 3.3, 1333.41 MHz
cpu0: Features 9c000000<PPC32,ALTIVEC,FPU,MMU>
cpu0: HID0 8450c0bc<EMCP,TBEN,NAP,DPM,ICE,DCE,SGE,BTIC,LRSTK,FOLD,BHT>
real memory  = 1329098752 (1267 MB)
avail memory = 1278320640 (1219 MB)
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
cpu0: dev=ff883c10 (BSP)
cpu1: dev=ff885050
ispfw: registered firmware <isp_1040>
ispfw: registered firmware <isp_1040_it>
ispfw: registered firmware <isp_1080>
ispfw: registered firmware <isp_1080_it>
ispfw: registered firmware <isp_12160>
ispfw: registered firmware <isp_12160_it>
ispfw: registered firmware <isp_2100>
ispfw: registered firmware <isp_2200>
ispfw: registered firmware <isp_2300>
ispfw: registered firmware <isp_2322>
ispfw: registered firmware <isp_2400>
ispfw: registered firmware <isp_2400_multi>
ispfw: registered firmware <isp_2500>
ispfw: registered firmware <isp_2500_multi>
kbd0 at kbdmux0
nexus0: <Open Firmware Nexus device>
cpulist0: <Open Firmware CPU Group> on nexus0
cpu0: <Open Firmware CPU> on cpulist0
cpu1: <Open Firmware CPU> on cpulist0
unin0: <Apple UniNorth System Controller> on nexus0
unin0: Version 36
pcib0: <Apple UniNorth Host-PCI bridge> on nexus0
pci0: <OFW PCI bus> on pcib0
bge0: <Broadcom BCM5703 A2, ASIC rev. 0x001002> mem 0xa0000000-0xa000ffff irq 48 at device 16.0 on pci0
miibus0: <MII bus> on bge0
brgphy0: <BCM5703 10/100/1000baseTX PHY> PHY 1 on miibus0
brgphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
bge0: Ethernet address: 00:03:93:f4:4d:62
bge0: [ITHREAD]
pcib1: <Apple UniNorth Host-PCI bridge> on nexus0
pci1: <OFW PCI bus> on pcib1
pcib2: <OFW PCI-PCI bridge> at device 13.0 on pci1
pci2: <OFW PCI bus> on pcib2
macio0: <KeyLargo I/O Controller> mem 0x80000000-0x8007ffff at device 7.0 on pci2
openpic0: <OpenPIC Interrupt Controller> mem 0x40000-0x7ffff on macio0
macgpio0: <MacIO GPIO Controller> mem 0x50-0x7f on macio0
pmuextint0: <Apple PMU99 External Interrupt> extint-gpio 1 irq 47 on macgpio0
scc0: <Zilog Z8530 dual channel SCC> mem 0x13000-0x13fff,0x8400-0x84ff,0x8500-0x85ff,0x8600-0x86ff,0x8700-0x87ff irq 22,5,6,23,7,8 on macio0
scc0: [FILTER]
scc0: [FILTER]
uart0: <z8530, channel A> on scc0
uart0: [FILTER]
uart1: <z8530, channel B> on scc0
uart1: [FILTER]
pmu0: <Apple PMU99 Controller> mem 0x16000-0x17fff irq 25 on macio0
pmu0: [ITHREAD]
iichb0: <Keywest I2C controller> mem 0x18000-0x18fff irq 26 on macio0
iichb0: [ITHREAD]
iicbus0: <OFW I2C bus> on iichb0
iicbus0: <unknown card> at addr 0x1c0
ata0: <Apple MacIO Ultra ATA Controller> mem 0x1f000-0x1ffff,0x8a00-0x8aff irq 19,11 on macio0
ata0: [ITHREAD]
ohci0: <Apple KeyLargo USB controller> mem 0x80081000-0x80081fff irq 27 at device 8.0 on pci2
ohci0: [ITHREAD]
usbus0: <Apple KeyLargo USB controller> on ohci0
ohci1: <Apple KeyLargo USB controller> mem 0x80080000-0x80080fff irq 28 at device 9.0 on pci2
ohci1: [ITHREAD]
usbus1: <Apple KeyLargo USB controller> on ohci1
pcib3: <OFW PCI-PCI bridge> at device 17.0 on pci1
pci3: <OFW PCI bus> on pcib3
pcib1: failed to reserve resource for (null)
vgapci0: <VGA-compatible display> mem 0x88000000-0x8fffffff,0x80100000-0x8010ffff irq 52 at device 2.0 on pci3
atapci0: <Promise PDC20271 UDMA133 controller> port 0x1090-0x1097,0x1080-0x1083,0x1070-0x1077,0x1060-0x1063,0x1050-0x105f mem 0x90030000-0x9003ffff irq 58 at device 21.0 on pci1
atapci0: [ITHREAD]
ata2: <ATA channel 0> on atapci0
ata2: [ITHREAD]
ata3: <ATA channel 1> on atapci0
ata3: [ITHREAD]
atapci1: <Promise PDC20271 UDMA133 controller> port 0x1040-0x1047,0x1030-0x1033,0x1020-0x1027,0x1010-0x1013,0x1000-0x100f mem 0x90010000-0x9001ffff irq 63 at device 27.0 on pci1
atapci1: [ITHREAD]
ata4: <ATA channel 0> on atapci1
ata4: [ITHREAD]
ata5: <ATA channel 1> on atapci1
ata5: [ITHREAD]
pcib4: <Apple UniNorth Host-PCI bridge> on nexus0
pci4: <OFW PCI bus> on pcib4
fwohci0: <Apple UniNorth> mem 0xf5000000-0xf5000fff irq 40 at device 14.0 on pci4
fwohci0: [ITHREAD]
fwohci0: OHCI version 1.10 (ROM=0)
fwohci0: No. of Isochronous channels is 8.
fwohci0: EUI64 00:0a:95:ff:fe:75:6e:d0
fwohci0: invalid speed 7 (fixed to 3).
fwohci0: Phy 1394a available S800, 3 ports.
fwohci0: Link S800, max_rec 4096 bytes.
firewire0: <IEEE1394(FireWire) bus> on fwohci0
fwe0: <Ethernet over FireWire> on firewire0
if_fwe0: Fake Ethernet address: 02:0a:95:75:6e:d0
fwe0: Ethernet address: 02:0a:95:75:6e:d0
fwohci0: Initiate bus reset
fwohci0: fwohci_intr_core: BUS reset
fwohci0: fwohci_intr_core: node_id=0x00000001, SelfID Count=2, CYCLEMASTER mode
gem0: <Apple UniNorth2 GMAC Ethernet> mem 0xf5200000-0xf53fffff irq 41 at device 15.0 on pci4
miibus1: <MII bus> on gem0
brgphy1: <BCM5754 10/100/1000baseTX PHY> PHY 0 on miibus1
brgphy1:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
gem0: 10kB RX FIFO, 4kB TX FIFO
gem0: Ethernet address: 00:0a:95:75:6e:d0
gem0: [ITHREAD]
ata1: <Uninorth2 Kauai ATA Controller> mem 0xf5004000-0xf5007fff irq 39 at device 13.0 on pci4
ata1: [ITHREAD]
sc0: <System console> on nexus0
sc0: Unknown <16 virtual consoles, flags=0x300>
Timecounter "decrementer" frequency 41537897 Hz quality 0
Timecounters tick every 10.000 msec
firewire0: 2 nodes, maxhop <= 1 cable IRM irm(1)  (me) 
firewire0: bus manager 1 
usbus0: 12Mbps Full Speed USB v1.0
usbus1: 12Mbps Full Speed USB v1.0
ugen0.1: <Apple> at usbus0
uhub0: <Apple OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus0
ugen1.1: <Apple> at usbus1
uhub1: <Apple OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1> on usbus1
acd0: CDROM <QSI CD-ROM TCR-241/WL11> at ata0-master PIO4 
ad0: 57241MB <Seagate ST360015A 3.31> at ata2-master UDMA133 
ad1: 57241MB <Seagate ST360015A 3.31> at ata3-master UDMA133 
ad2: 57241MB <Seagate ST360015A 3.31> at ata4-master UDMA133 
ad3: 57241MB <Seagate ST360015A 3.31> at ata5-master UDMA133 
SMP: AP CPU #1 launched
uhub0: 2 ports with 2 removable, self powered
uhub1: 2 ports with 2 removable, self powered
ugen0.2: <Mitsumi Electric> at usbus0
uhub2: <Mitsumi Electric Hub in Apple Extended USB Keyboard, class 9/0, rev 1.10/1.22, addr 2> on usbus0
uhub2: 3 ports with 2 removable, bus powered
ugen0.3: <Mitsumi Electric> at usbus0
ukbd0: <Mitsumi Electric Apple Extended USB Keyboard, class 0/0, rev 1.10/1.22, addr 3> on usbus0
kbd1 at ukbd0
uhid0: <Mitsumi Electric Apple Extended USB Keyboard, class 0/0, rev 1.10/1.22, addr 3> on usbus0
Trying to mount root from ufs:/dev/ad0s3
bge0: link state changed to UP



More information about the freebsd-ppc mailing list