Disc lock up on 8.0-STABLE

Florian Smeets flo at smeets.im
Tue Dec 22 14:00:21 UTC 2009


On 12/22/09 1:16 PM, Pete French wrote:
> I've been gradually testing 8.0 on several machines propr to deploying it
> live, but I currently have a machine which appears to lock-up at 3am
> every day. The symptoms are that the machine is still pingable, but doing
> anything which requires access to the disc just freezes (so you cant login for
> example). I've seen simiilar behaviour behore on machines when the disc
> syste has locked up for some reason, so am ttentatively guessing that
> this is the cause.
> 
> The machine is an HP DL360 G5 with a ciss0 controller for the drives.
> I have upgraded to the latest STABLE but the freeze still happens.
> Am including a dmesg below, and will compile it with KDB, DDB to
> see what happens.
> 
> The machine is booting from a UFS partition, but is using ZFS for everything
> else. The fcat it deadlocks at 3am makes me thing this is something to
> do with scheduled jobs maybe ? Then again, I have an almost identical DL360
> which is running 8.0 and is rock solid.
> 

Hi Pete,

i'm trying to track down the same problem. The box in question has
everything on UFS (mirrored ataraid) and a backup disk with ZFS on it
attached to USB.

The freeze happens at 3am too, i have these log messages:
Dec 20 03:00:00 XXX newsyslog[2810]: logfile turned over due to size>100K
Dec 20 03:03:21 XXX kernel: Approaching the limit on PV entries,
consider increasing either the vm.pmap.shpgperproc or the
vm.pmap.pv_entry_max sysctl.
Dec 20 03:03:22 XXX kernel: maxproc limit exceeded by uid 0, please see
tuning(7) and login.conf(5).
Dec 20 03:03:53 XXX last message repeated 31 times

I had increased vm.pmap.pv_entry_max and vm.pmap.shpgperproc then only
the maxproc limit exceeded message remained.

As the box is remote with only ssh access, it's a little difficult to
debug this. During the weekend i waited till 3 o'clock with a top
running, and saw that hundreds/thousands of /bin/sh processes were
started. After that i commented out periodic daily in /etc/crontab, that
"solved" the problem for me.

I was not able to debug this any further yet, i have one other box with
all UFS and a ZFS backup disc also running latest 8-STABLE but it does
not exhibit the problem.

Cheers,
Florian


More information about the freebsd-stable mailing list