From nobody Fri Nov 05 16:39:35 2021 X-Original-To: freebsd-fs@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 6BCEF1831543 for ; Fri, 5 Nov 2021 16:40:12 +0000 (UTC) (envelope-from freebsd-listen@fabiankeil.de) Received: from smtprelay03.ispgateway.de (smtprelay03.ispgateway.de [80.67.18.15]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 4Hm5pN2fKxz3p09 for ; Fri, 5 Nov 2021 16:40:12 +0000 (UTC) (envelope-from freebsd-listen@fabiankeil.de) Received: from [91.20.68.89] (helo=fabiankeil.de) by smtprelay03.ispgateway.de with esmtpsa (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1mj2GO-0004hu-Kj for freebsd-fs@freebsd.org; Fri, 05 Nov 2021 17:40:24 +0100 Date: Fri, 5 Nov 2021 17:39:35 +0100 From: Fabian Keil To: freebsd-fs@freebsd.org Subject: Re: ZFS operations hanging, but no visible errors? Message-ID: <20211105173935.7aa53269@fabiankeil.de> In-Reply-To: References: Reply-To: freebsd-fs@freebsd.org List-Id: Filesystems List-Archive: https://lists.freebsd.org/archives/freebsd-fs List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-fs@freebsd.org MIME-Version: 1.0 Content-Type: multipart/signed; boundary="Sig_/tBj+BYLZ16DGO2OLxlmvMTu"; protocol="application/pgp-signature"; micalg=pgp-sha1 X-Df-Sender: Nzc1MDY3 X-Rspamd-Queue-Id: 4Hm5pN2fKxz3p09 X-Spamd-Bar: ---- Authentication-Results: mx1.freebsd.org; none X-Spamd-Result: default: False [-4.00 / 15.00]; REPLY(-4.00)[] X-ThisMailContainsUnwantedMimeParts: N --Sig_/tBj+BYLZ16DGO2OLxlmvMTu Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Chris Ross wrote on 2021-11-05: > Hey there. I have a server running FreeBSD 13.0-RELEASE, > with a large ZFS zpool. I have a UFS mirror on hardware > raid, then a bunch of JBOD disks in a pool. I recently > added a new vdev to this pool, which may or may not be related. >=20 > Today, I started an rsync of a large (100GB) file from > the pool to another host. After a while (7%), it seemed > no progres was being made. I tried to kill the rsync, > which didn=E2=80=99t exit, or suspend. Now anything that touches > the pool seems to hang. But, the system is otherwise > functional, console shows no issues, the controller > (via out-of-band management interface) shows all disks > as having no errors or issues. >=20 > Any idea what I should be looking for, and if there=E2=80=99s > any way to recover it without reboot? You can check with "procstat -kk" where the hanging processes are stuck. The output of: dtrace -qn 'zfs-dbgmsg {printf("%Y: %s\n", walltimestamp, stringof(arg0))}' could be useful as well. Fabian --Sig_/tBj+BYLZ16DGO2OLxlmvMTu Content-Type: application/pgp-signature Content-Description: OpenPGP digital signature -----BEGIN PGP SIGNATURE----- iF0EARECAB0WIQTKUNd6H/m3+ByGULIFiohV/3dUnQUCYYVeSAAKCRAFiohV/3dU nR5+AKDKNghhEaIH8ksYjE82yz/+NpyQ9wCgiAEkOv6otYFHdrbBgNkZlDx2/j8= =Krpn -----END PGP SIGNATURE----- --Sig_/tBj+BYLZ16DGO2OLxlmvMTu--