Re: [External] Re: ZFS - reboot during resilver doesn't work

From: David Christensen <dpchrist_at_holgerdanske.com>
Date: Mon, 08 Sep 2025 04:08:32 UTC
On 9/7/25 17:00, Frank Leonhardt wrote:
> ... I did suspect SMR 
> had something to do with it, which I why I threw that in.
> 
> ... I'm sitting at the data centre watching it now 
> (it's well past midnight). I've shut everything else down apart from 
> sshd to give it the best chance.
> 
> <snip>
> 
> pool: zr
> state: DEGRADED
> status: One or more devices is currently being resilvered. The pool will continue to function, possibly in a degraded state.
> action: Wait for the resilver to complete.
> scan: resilver in progress since Sun Sep 7 21:30:58 2025 322G / 922G scanned at 196B/s, 206G / 922G issued at 125B/s 206G resilvered, 22.32% done, no estimated completion time 
> config: NAME STATE READ WRITE CKSUM
> zr DEGRADED 0 0 0 
> mirror-0 DEGRADED 0 0 0 
> replacing-0 DEGRADED 3 0 0 ada0p3/old UNAVAIL 0 0 0
> cannot open ada0p3 ONLINE 0 0 5 (resilvering)
> ada1p3 ONLINE 0 0 0 
>
> This after a hard reset two hours ago - before this it was estimating 
> five hours. Oh, hang on, six hours. No, wait, seven hours...
> 
> The SMR drive is now at 37C (the other is at 30C). I'm thinking of 
> pulling it and putting a spare six-year-old Constellation in its place, 
> which I brought along just-in-case. Regards, Frank.


The few times I have replaced and resilvered a HDD, the process took 
about 4 hours per TB of data.  The disks were 3 TB SATA Seagate 
Barracuda or Constellation ES.2.


If you are wondering if the existing drive and/or the replacement drive 
are having issues, perhaps running `smartctl -x ...` could confirm or 
refute.  And, look at /var/log/messages for errors.


David