correctable DMA error AFAR

Chris Jackman cjacknospamthanks at klatsch.org
Mon Jul 21 12:44:38 PDT 2003



Hola senores!

Error messages:

pcib0: correctable DMA error AFAR 0x476d6140 AFSR 0x40e600003f800000
and
pcib0: correctable DMA error AFAR 0x40adbc40 AFSR 0x40c400003f800000

My e250 has locked up twice in the last few weeks with these
error messages.  The error gets repeated over and over
again on the serial console, and I can't do anything to the
box except power cycle it.  The first time it happened, I was
transferring about 10 5gig files from another machine
on the same switch.  The second time, the machine was idle.

I see the error message in /u/s/sys/sparc64/pci/psycho.c, 
in psycho_ce() at line 751.  My world and kernel are 
from ~July 10th, and I have the latest psycho.c (1.41).

How come this error is correctable, but the other functions 
around this one are all uncorrectable?  Perhaps this function 
should also panic, since my machine is unusable when this 
error occurs.  

Also, is there a way to send a break over the serial console? 
I can send it with cu using ~#, but the e250 doesn't respond
to it.  My guess is that it is a solaris feature to catch
the break signal, and drop to the Openboot Firmware.  

I'll hook up a keyboard to this machine with boot.conf
settings to get the console output to the serial port, 
and if the error happens again I'll try ctrl-alt-escape 
on the keyboard to try and get to the debugger.


Thanks!



More information about the freebsd-sparc64 mailing list