bsd-unix at embarqmail.com
Sun Nov 18 13:41:56 PST 2007
On Sun, 18 Nov 2007 22:12:34 +0100
"n j" <nino80 at gmail.com> wrote:
> > Does it happened before, or does it happened everyday at 3 am, or is this the first time your box shutdown without explaination?
> No, this is the first time this has occurred, that is what makes it
> completely unexpected.
> > If this is the first time, I would say there are many possibilities. Say an accidental quick push on power button or - humor me - the cleaning lady is with the conserve energy movement and thought your box just another forgotten-to-shutdown desktop, that alone could explain your mysterious shutdown incident.
> The machine is located in a server room within a server rack with a
> (detachable) panel on the front side of the machine (Dell Poweredge)
> that is covering the power-off button. No cleaning lady is entering
> the room, especially at 3 AM. Due to all the circumstances I had
> described, I ruled out (physical) human factor as the cause of
> The box has two independent AC power supplies, no hardware error is
> found in RAC card logs, no other server (in the same rack/room) shut
> down at that time. That is what leads me to believe that the problem
> is software-related.
> I know there are many possibilities out there, but I am pondering this
> for the whole day and ruled out everything that came to mind. So, any
> other ideas - even humorous - are welcome.
A few months ago I started having random mysterious lockups, no
panics, no messages, no hints, no keyboard and no ssh. It forced
me to recycle power to get the system back.
After playing the RAM swap game, updating sources, and other such
dead-ends, I felt the hard drives (Maxtor 7200RPM 250G type) and
they were quite warm. I did a little hardware re-arranging so that the
hard drives got more air and I've not had a lockup since. I had also
been monitoring the temperature but didn't see any indication that it
was the CPU or motherboard components.
This is all ancedotal since I don't have any hard evidence to point
to exactly one thing since I also swapped out a fan and reinserted
connectors in the process. My feeling is that it was hard
drive heat-related so my suggestion is to do some poking around for hot
spots, clogged fan filters and any other factors affecting temperatures.
In any case, in the grand scheme of things, *all* hardware will
fail ... eventually ;-)
More information about the freebsd-questions