Help! two machines ran out of swap and corrupted their zpools!

Mark Saad nonesuch at longcount.org
Mon Nov 21 17:59:00 UTC 2016


On Mon, Nov 21, 2016 at 12:50 PM, Jan Bramkamp <crest at rlwinm.de> wrote:
> On 21/11/2016 18:47, Pete French wrote:
>>
>> So, I am off sick and my colleagues decided to load test our set of five
>> servers excesively. All ran out of swap. So far so irritating, but whats
>> has
>> happened is that twoof them now will not boot, as it appears the ZFS pool
>> they are booting from has become corrupted.
>>
>> One starts to boot, then crases importing the root pool. The other doenst
>> even get that far with gptzfsboot saying it can't find the pool to boot
>> from!
>>
>> Now I can recover these, but I am a bit worried, that it got like this at
>> all, as I havent ever seen ZFS corrupt a pool like this. Anyone got any
>> insights,
>> or suggstions as to how to stop it happening again ?
>>
>> We are swapping to a separate partition, not to the pool by theway.
>
>
> How much trust do you put in your hardware? Have you ever put the hardware
> under full load for extended periods before e.g. run poudriere to build pkg
> repos?
>
> -- Jan Bramkamp
>
> _________



Pete
   I am thinking like Jan, that this points to a hardware issue. What
can you tell us about the servers ? What sort of cpu, do you have ecc
ram ? How much ram what was running at the time ? Do you use a slog,
l2arc , whats
the zpool status -v show?


______________________________________
> freebsd-stable at freebsd.org mailing list
> https://lists.freebsd.org/mailman/listinfo/freebsd-stable
> To unsubscribe, send any mail to "freebsd-stable-unsubscribe at freebsd.org"



-- 
mark saad | nonesuch at longcount.org


More information about the freebsd-stable mailing list