Re: Sudden zpool checksums errors
- In reply to: Andrea Venturoli : "Re: Sudden zpool checksums errors"
- Go to: [ bottom of page ] [ top of archives ] [ this month ]
Date: Mon, 07 Apr 2025 16:36:31 UTC
On Mon, 7 Apr 2025, at 15:15, Andrea Venturoli wrote: > On 4/7/25 15:07, mike tancsa wrote: > All "non-error" drives report: > SCT Error Recovery Control: > Read: Disabled > > Write: Disabled > > All "error" drives report: > SCT Error Recovery Control: > Read: 655 (65.5 seconds) > Write: 670 (67.0 seconds) > > I wonder if this could be the culprit... > I guess I should enable or disable it on all drives; however I've been > reading mixed opinions on whether this is good or bad for ZFS. > > Any suggestion? I would have a short timeout and rely on zfs to handle cleanup. The thinking is that it is better for latency to return (failed) fast, and let zfs give the correct data, then clean up afterwards, than potentially have the entire drive be marked offline by zfs because of the longer delay time. Does this seem reasonable? https://github.com/AMDmi3/scterc-rc.d & https://forums.truenas.com/t/checking-for-tler-erc-etc-support-on-a-drive/1497 may be useful. A+ Dave