[Bug 257890] Storage controller lockup on zfs scrub [smartpqi][zfs]

From: <bugzilla-noreply_at_freebsd.org>
Date: Thu, 09 Dec 2021 12:15:17 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=257890

Nils Beyer <nbe@renzel.net> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |nbe@renzel.net

--- Comment #5 from Nils Beyer <nbe@renzel.net> ---
Hi,

any updates on this? I'm using three Adaptec 1100-4i HBAs each connected to a
seperate SuperMicro BPN-SAS3-216EL1 backplane for a total of 72 bays.

My zpool is created with 67 SSDs in a simple "RAID0"-config:

        zpool create atime=off mountpoint=none test da0 [..] da66

and each time I realiably can lockup a random controller by creating enough
load using:

        dd if=/dev/zero of=/mnt/test.dat bs=100M

and after a time of five minutes a parallel

        zpool scrub test

with following kernel messages:

        [...heartbeat...] controller is offline
        [...take_ctrl-offline...] Controller FW is not runniung. Lockup code =
1403a

The Adaptec HBA shows after reboot:

        1719-Slot 10 A controller failure event occurred prior to this power-up
          Previous lock up code=0001403A
        POST Messages Ended. Press any key to continue.

I even tried only one Adaptec 1100 HBA and the three backplanes as a cascade;
but the controller locks up using this config as well...



TIA and BR,
Nils

-- 
You are receiving this mail because:
You are the assignee for the bug.