[Bug 246279] ciss device driver not allowing more than 48 drives to be detected by the CAM layer

From: <bugzilla-noreply_at_freebsd.org>
Date: Mon, 08 May 2023 20:00:33 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=246279

--- Comment #25 from Peter Eriksson <pen@lysator.liu.se> ---
Rebooted and tried with hw.ciss.nop_message_heartbeat=1, then logged in via ssh
and ran an "sesutil show" (before zfs had started importing pools), then it
panic:ed with:

login: (da116:ciss2:32:56:0): READ(6). CDB: 08 00 01 48 08 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
ciss2: *** Hot-plug drive removed, Port=2E Box=1 Bay=6 SN=            5PGTSWYC
(da116:ciss2:32:56:0): Retrying command, 3 more tries remain
ciss2: *** Physical drive failure, Port=2E Box=1 Bay=6 reason=0x14
(da116:ciss2:32:56:0): READ(16). CDB: 88 00 00 00 00 05 74 ff ff 00 00 00 01 00
00 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Retrying command, 3 more tries remain
(da116:ciss2:32:56:0): READ(6). CDB: 08 00 01 50 b0 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Retrying command, 3 more tries remain
(da116:ciss2:32:56:0): READ(10). CDB: 28 00 00 00 03 00 00 01 00 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Retrying command, 3 more tries remain
(da116:ciss2:32:56:0): READ(16). CDB: 88 00 00 00 00 05 74 ff fd 00 00 00 01 00
00 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Retrying command, 3 more tries remain
(da116:ciss2:32:56:0): READ(6). CDB: 08 00 01 40 08 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Retrying command, 3 more tries remain
(da116:ciss2:32:56:0): READ(6). CDB: 08 00 01 38 08 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Retrying command, 3 more tries remain
(da116:ciss2:32:56:0): READ(6). CDB: 08 00 01 28 08 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Retrying command, 3 more tries remain
(da116:ciss2:32:56:0): READ(6). CDB: 08 00 01 30 08 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Retrying command, 3 more tries remain
(da116:ciss2:32:56:0): READ(6). CDB: 08 00 01 20 08 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Retrying command, 3 more tries remain
(da116:ciss2:32:56:0): READ(6)/WRITE(6) not supported, increasing
minimum_cmd_size to 10.
(da116:ciss2:32:56:0): READ(16). CDB: 88 00 00 00 00 05 74 ff ff 00 00 00 01 00
00 00 
(da116:ciss2:32:56:0): CAM status: SCSI Status Error
(da116:ciss2:32:56:0): SCSI status: Check Condition
(da116:ciss2:32:56:0): SCSI sense: ILLEGAL REQUEST asc:25,0 (Logical unit not
supported)
(da116:ciss2:32:56:0): Error 6, Unretryable error
(da116:ciss2:32:56:0): Invalidating pack
(da116:ciss2:32:56:0): READ(6)/WRITE(6) not supported, increasing
minimum_cmd_size to 10.
(da116:ciss2:32:56:0): READ(10). CDB: 28 00 00 00 03 00 00 01 00 00 
(da116:ciss2:32:56:0): CAM status: SCSI Status Error
(da116:ciss2:32:56:0): SCSI status: Check Condition
(da116:ciss2:32:56:0): SCSI sense: ILLEGAL REQUEST asc:25,0 (Logical unit not
supported)
(da116:ciss2:32:56:0): Error 6, Unretryable error
(da116:ciss2:32:56:0): READ(16). CDB: 88 00 00 00 00 05 74 ff fd 00 00 00 01 00
00 00 
(da116:ciss2:32:56:0): CAM status: SCSI Status Error
(da116:ciss2:32:56:0): SCSI status: Check Condition
(da116:ciss2:32:56:0): SCSI sense: ILLEGAL REQUEST asc:25,0 (Logical unit not
supported)
(da116:ciss2:32:56:0): Error 6, Unretryable error
(da116:ciss2:32:56:0): READ(6)/WRITE(6) not supported, increasing
minimum_cmd_size to 10.
(da116:ciss2:32:56:0): READ(6)/WRITE(6) not supported, increasing
minimum_cmd_size to 10.
(da116:ciss2:32:56:0): READ(6)/WRITE(6) not supported, increasing
minimum_cmd_size to 10.
(da116:ciss2:32:56:0): READ(6)/WRITE(6) not supported, increasing
minimum_cmd_size to 10.
da116 at ciss2 bus 32 scbus7 target 56 lun 0
da116: <HP MB012000JWDFD HPD2>  s/n 5PGTSWYC detached
May  8 21:56:32 balur03 ZFS[4858]: vdev probe failure, zpool=$FILUR02
path=$/dev/diskid/DISK-5PGTSWYC
(da116:ciss2:32:56:0): READ(6). CDB: 08 00 01 20 08 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Error 5, Periph was invalidated
(da116:ciss2:32:56:0): READ(10). CDB: 28 00 00 00 01 48 00 00 08 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Error 5, Periph was invalidated
(da116:ciss2:32:56:0): READ(10). CDB: 28 00 00 00 01 28 00 00 08 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Error 5, Periph was invalidated
(da116:ciss2:32:56:0): READ(10). CDB: 28 00 00 00 01 30 00 00 08 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Error 5, Periph was invalidated
(da116:ciss2:32:56:0): READ(10). CDB: 28 00 00 00 01 38 00 00 08 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Error 5, Periph was invalidated
(da116:ciss2:32:56:0): READ(10). CDB: 28 00 00 00 01 40 00 00 08 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Error 5, Periph was invalidated
(da116:ciss2:32:56:0): READ(10). CDB: 28 00 00 00 01 50 00 00 b0 00 
(da116:ciss2:32:56:0): CAM status: CCB request completed with an error
(da116:ciss2:32:56:0): Error 5, Periph was invalidated


Fatal trap 12: page fault while in kernel mode
cpuid = 3; apic id = 03
fault virtual address   = 0x180
fault code              = supervisor read data, page not present
(da116:ciss2:32:56:0): Periph destroyed
instruction pointer     = 0x20:0xffffffff8256e51d
stack pointer           = 0x28:0xfffffe058a1d59d0
frame pointer           = 0x28:0xfffffe058a1d5a40
code segment            = base 0x0, limit 0xfffff, type 0x1b
                        = DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags        = interrupt enabled, resume, IOPL = 0
current process         = 45 (solthread 0xfffffff)
trap number             = 12
panic: page fault
cpuid = 3
time = 1683575795
KDB: stack backtrace:
#0 0xffffffff80c325f5 at kdb_backtrace+0x65
#1 0xffffffff80be89c8 at vpanic+0x178
#2 0xffffffff80be8843 at panic+0x43
#3 0xffffffff8110408f at trap_fatal+0x38f
#4 0xffffffff811040df at trap_pfault+0x4f
#5 0xffffffff810dbd58 at calltrap+0x8
#6 0xffffffff8256e49a at vdev_dtl_reassess+0x5a
#7 0xffffffff8256e49a at vdev_dtl_reassess+0x5a
#8 0xffffffff825639e2 at spa_vdev_state_exit+0x42
#9 0xffffffff8255d04f at spa_async_thread_vd+0x17f
#10 0xffffffff80ba969e at fork_exit+0x7e
#11 0xffffffff810dcd8e at fork_trampoline+0xe
System is going down.
Uptime: 4m40s

I've seen things you people wouldn't believe.
Attack ships on fire off the shoulder of Orion.
I watched C-beams glitter in the dark near the
Tannhäuser Gate. All those moments will be lost
in time, like tears in rain.

Time to die.

-- 
You are receiving this mail because:
You are on the CC list for the bug.