[Bug 277499] panic in doneq0 xpt_done_td xpt_done_process after HDD falling off the bus (Periph destroyed)

From: <bugzilla-noreply_at_freebsd.org>
Date: Tue, 05 Mar 2024 09:59:02 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=277499

            Bug ID: 277499
           Summary: panic in doneq0 xpt_done_td xpt_done_process after HDD
                    falling off the bus (Periph destroyed)
           Product: Base System
           Version: 15.0-CURRENT
          Hardware: amd64
                OS: Any
            Status: New
          Severity: Affects Only Me
          Priority: ---
         Component: kern
          Assignee: bugs@FreeBSD.org
          Reporter: bsd@kobyla.org

Created attachment 248946
  --> https://bugs.freebsd.org/bugzilla/attachment.cgi?id=248946&action=edit
detailed kgdb trace log

The only active geom there is the single vdev zfs partition (without a
replication).
gmirror "efi" one is idle and such usually do not cause any problems, only
active zfs matters

FreeBSD 15.0-CURRENT #2 main-n268534-32fdcff8703d: Mon Feb 26 01:08:14 EET 2024

(ada3:ahcich3:0:0:0): READ_FPDMA_QUEUED. ACB: 60 b0 88 e0 81 40 57 00 00 07 00
00
(ada3:ahcich3:0:0:0): CAM status: Command timeout
(ada3:ahcich3:0:0:0): Retrying command, 2 more tries remain
ahcich3: SATA connect time=100us status=00000133
ahcich3: AHCI reset: device found
ahcich3: AHCI reset: device ready after 100ms
ahcich3: Timeout on slot 26 port 0
ahcich3: is 00000000 cs 00000000 ss 3c000000 rs 3c000000 tfd 50 serr 00000000
cmd 0000dd17
ahcich3: AHCI reset...
(ada3:ahcich3:0:0:0): READ_FPDMA_QUEUED. ACB: 60 b0 88 e0 81 40 57 00 00 07 00
00
(ada3:ahcich3:0:0:0): CAM status: Command timeout
(ada3:ahcich3:0:0:0): Retrying command, 1 more tries remain
ahcich3: SATA connect time=100us status=00000133
ahcich3: AHCI reset: device found
ahcich3: AHCI reset: device ready after 100ms
ipfw: 24111 Nat TCP 10.0.11.17:39346 10.0.6.178:10050 in via em0
ahcich3: Timeout on slot 2 port 0
ahcich3: is 00000000 cs 00000000 ss 0000003c rs 0000003c tfd 50 serr 00000000
cmd 0000c517
ahcich3: AHCI reset...
(ada3:ahcich3:0:0:0): READ_FPDMA_QUEUED. ACB: 60 b0 88 e0 81 40 57 00 00 07 00
00
(ada3:ahcich3:0:0:0): CAM status: Command timeout
(ada3:ahcich3:0:0:0): Retrying command, 0 more tries remain
ahcich3: SATA connect time=100us status=00000133
ahcich3: AHCI reset: device found
ahcich3: AHCI reset: device ready after 100ms
(ada3:ahcich3:0:0:0): Periph destroyed
GEOM_MIRROR: Device efi: provider ada3p99 disconnected.


Fatal trap 12: page fault while in kernel mode
cpuid = 1; apic id = 02
fault virtual address   = 0x18
fault code              = supervisor write data, page not present
instruction pointer     = 0x20:0xffffffff80325322
stack pointer           = 0x28:0xfffffe0121bf5e80
frame pointer           = 0x28:0xfffffe0121bf5ea0
code segment            = base 0x0, limit 0xfffff, type 0x1b
                        = DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags        = interrupt enabled, resume, IOPL = 0
current process         = 4 (doneq0)
rdi: fffff80001e2b660 rsi: 0000000000000001 rdx: 00000000000013f6
rcx: 0000000000000000  r8: ffffffff80c87ff0  r9: 0000000000526415
rax: 0000000000000000 rbx: fffff80001e2b660 rbp: fffffe0121bf5ea0
r10: 0000000000001388 r11: 000000008059e074 r12: 0000000000000000
r13: fffffe0121bf5eb8 r14: 0000000000000000 r15: fffff80001f42740
trap number             = 12
panic: page fault
cpuid = 1
time = 1709613907
KDB: stack backtrace:
db_trace_self_wrapper() at 0xffffffff8045142b =
db_trace_self_wrapper+0x2b/frame 0xfffffe0121bf5b60
vpanic() at 0xffffffff8074c855 = vpanic+0x135/frame 0xfffffe0121bf5c90
panic() at 0xffffffff8074c713 = panic+0x43/frame 0xfffffe0121bf5cf0
trap_fatal() at 0xffffffff80bb43dc = trap_fatal+0x40c/frame 0xfffffe0121bf5d50
trap_pfault() at 0xffffffff80bb442f = trap_pfault+0x4f/frame 0xfffffe0121bf5db0
calltrap() at 0xffffffff80b8c238 = calltrap+0x8/frame 0xfffffe0121bf5db0
--- trap 0xc, rip = 0xffffffff80325322, rsp = 0xfffffe0121bf5e80, rbp =
0xfffffe0121bf5ea0 ---
xpt_done_process() at 0xffffffff80325322 = xpt_done_process+0x222/frame
0xfffffe0121bf5ea0
xpt_done_td() at 0xffffffff80327485 = xpt_done_td+0xf5/frame 0xfffffe0121bf5ef0
fork_exit() at 0xffffffff80705acf = fork_exit+0x7f/frame 0xfffffe0121bf5f30
fork_trampoline() at 0xffffffff80b8d29e = fork_trampoline+0xe/frame
0xfffffe0121bf5f30
--- trap 0, rip = 0, rsp = 0, rbp = 0 ---
KDB: enter: panic

-- 
You are receiving this mail because:
You are the assignee for the bug.