freebsd panic on HP Proliant DL360
Ernest Natiello
enatiello at broadviewnet.net
Thu Oct 12 15:56:42 UTC 2006
here we go:
(kgdb) frame 7
#7 0xc072191d in ip_ctloutput (so=0x1, sopt=0xe9226c90)
at /usr/src/sys/netinet/ip_output.c:1184
1184 INP_LOCK(inp);
(kgdb) p *sopt
$1 = {sopt_dir = SOPT_SET, sopt_level = 0, sopt_name = 1, sopt_val =
0x0,
sopt_valsize = 0, sopt_td = 0xc73add80}
(kgdb) frame 11
#11 0xc06c3ce6 in setsockopt (td=0xc73add80, uap=0x1)
at /usr/src/sys/kern/uipc_syscalls.c:1307
1307 return (kern_setsockopt(td, uap->s, uap->level,
uap->name,
(kgdb) p td->td_proc->p_comm
$2 = "tcpserver\000\000\000\000\000\000\000\000\000\000"
(kgdb)
On Thu, 2006-10-12 at 19:48 +0400, Gleb Smirnoff wrote:
> On Thu, Oct 12, 2006 at 11:18:03AM -0400, Ernest Natiello wrote:
> E> Hello,
> E> Thank you very much for all of the help. I am trying to understand
> E> this issue, as it has been plaguing me for quite some time.
> E> So, extrapolating from the below kgdb output, am I to assume that
> E> the process causing the error is tcpserver?
>
> Probably it is. However, can you run the gdb commands I mentioned
> in previous post, to make us sure.
>
> E> And should I further infer
> E> that tcpserver would cause this issue on all instances of FreeBSD
> E> RELENG_6, regardless of hardware?
>
> I think so. A tcpserver(8) in given configuration.
>
> E> I have three other servers HP Proliant DL380s (2u) which are
> E> operating in a _similar_ capacity, (incoming vs. outgoing mailservers)
> E> running the exact same software, which have never had a problem.
> E> These three servers are running: FreeBSD unix29 6.1-PRERELEASE
> E> FreeBSD 6.1-PRERELEASE #0: Mon Mar 27 10:42:56 EST 2006
> E> root at unix34.broadviewnet.net:/usr/obj/usr/src/sys/UNIX34 i386
> E> The operating system on this machine was rsync'd from one of the
> E> servers that is having the panic issue, yet it continues to operate
> E> flawlessly.
>
> The discussed problem is a race between remote client closing TCP
> connection (may be resetting?), and local software performing
> setsockopt() system call on the same socket.
>
> It may happen that this particulat server has to deal with clients
> that drop the connection randomly, and other servers don't. That's
> why other servers are stable.
>
> E> I guess I could try swapping the services between two of the
> E> servers and see if the behavior follows the move. Does that sound
> E> viable?
>
> You can try it.
>
> And don't forget to run gdb commands, and see what is the actual
> socket option that causes the problem.
>
More information about the freebsd-stable
mailing list