From nobody Sat Jul 10 10:51:19 2021 X-Original-To: bugs@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 2569C124697D for ; Sat, 10 Jul 2021 10:51:19 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from mxrelay.nyi.freebsd.org (mxrelay.nyi.freebsd.org [IPv6:2610:1c1:1:606c::19:3]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (4096 bits) client-digest SHA256) (Client CN "mxrelay.nyi.freebsd.org", Issuer "R3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4GMRfH0VBFz4dyQ for ; Sat, 10 Jul 2021 10:51:19 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from kenobi.freebsd.org (kenobi.freebsd.org [IPv6:2610:1c1:1:606c::50:1d]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mxrelay.nyi.freebsd.org (Postfix) with ESMTPS id ED4A726676 for ; Sat, 10 Jul 2021 10:51:18 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from kenobi.freebsd.org ([127.0.1.5]) by kenobi.freebsd.org (8.15.2/8.15.2) with ESMTP id 16AApIhE014870 for ; Sat, 10 Jul 2021 10:51:18 GMT (envelope-from bugzilla-noreply@freebsd.org) Received: (from www@localhost) by kenobi.freebsd.org (8.15.2/8.15.2/Submit) id 16AApIV6014869 for bugs@FreeBSD.org; Sat, 10 Jul 2021 10:51:18 GMT (envelope-from bugzilla-noreply@freebsd.org) X-Authentication-Warning: kenobi.freebsd.org: www set sender to bugzilla-noreply@freebsd.org using -f From: bugzilla-noreply@freebsd.org To: bugs@FreeBSD.org Subject: [Bug 257094] SSD detach / reattach 2-3 times a day, WRITE(10). CDB: 2a 00 06 15 ec 28 00 00 08 00 / CAM status: SCSI Status Error Date: Sat, 10 Jul 2021 10:51:19 +0000 X-Bugzilla-Reason: AssignedTo X-Bugzilla-Type: new X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: Base System X-Bugzilla-Component: kern X-Bugzilla-Version: 12.2-RELEASE X-Bugzilla-Keywords: X-Bugzilla-Severity: Affects Only Me X-Bugzilla-Who: nerozero@gmail.com X-Bugzilla-Status: New X-Bugzilla-Resolution: X-Bugzilla-Priority: --- X-Bugzilla-Assigned-To: bugs@FreeBSD.org X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_id short_desc product version rep_platform op_sys bug_status bug_severity priority component assigned_to reporter Message-ID: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: https://bugs.freebsd.org/bugzilla/ Auto-Submitted: auto-generated List-Id: Bug reports List-Archive: https://lists.freebsd.org/archives/freebsd-bugs List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-bugs@freebsd.org MIME-Version: 1.0 X-ThisMailContainsUnwantedMimeParts: N https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D257094 Bug ID: 257094 Summary: SSD detach / reattach 2-3 times a day, WRITE(10). CDB: 2a 00 06 15 ec 28 00 00 08 00 / CAM status: SCSI Status Error Product: Base System Version: 12.2-RELEASE Hardware: Any OS: Any Status: New Severity: Affects Only Me Priority: --- Component: kern Assignee: bugs@FreeBSD.org Reporter: nerozero@gmail.com Hello,=20 facing issue for quite a while. SSD detach/reattach again in a 3-10 sec. Two identical SSD drives are in the ZFS mirror used as a rootfs. I had similar issues on the other machine with generic hard drives which was fixed by disabling hard drive built-in EPC/APM. But this SSD doesn't seems = to support EPC, and disabling or setting it to 254 (maximum performance) has little to no effect. The system is literally 4-5 month old. SSD drives long smart test shows no errors. Replacing SSD drives with same new ones produces same result... Tested SSD on a linux desktop pc basic stand-by for a week -= no issues observed.=20 I also have tried to read 100MB blocks from drive with dd every 2 hours whi= ch reduced detach / reattach frequency from 3-5 times per 24 hour to 1-2. also camcontrol failng to set APM values while smartctl has no issues doing= the same: # camcontrol apm /dev/da1 -l 254 camcontrol: ATA SETFEATURES ENABLE APM via pass_16 failed So i have a strong filling that this looks like a bug...=20 logs:=20 ----- 8< ----- /boot/loader.conf mrsas_load=3DYES hw.mfi.mrsas_enable=3D1 Hardware: Platform: Dell PowerEdge R540 Storage Controller: PERC H730P Adapter / JBOD Drive - SSDx2: KINGSTON SA400S3 120GB Drive - SASx4: TOSHIBA MG04SCA40ENY=20 Kernel messages: Jul 8 08:17:56 vmhost kernel: (da1:mrsas0:1:1:0): ATA COMMAND PASS THROUGH(16). CDB: 85 0d 06 00 01 00 01 00 00 00 00 00 00 40 06 00=20 Jul 8 08:17:56 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Sta= tus Error Jul 8 08:17:56 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK Jul 8 08:17:56 vmhost kernel: (da1:mrsas0:1:1:0): Invalidating pack Jul 8 08:17:56 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun= 0 Jul 8 08:17:56 vmhost kernel: da1: s/n xxxxxxxxxxxxxxxxxx detached Jul 8 08:17:56 vmhost kernel: mrsas0: System PD deleted target ID: 0x1 Jul 8 08:17:56 vmhost kernel: (da1:mrsas0:1:1:0): Periph destroyed Jul 8 08:18:13 vmhost kernel: mrsas0: System PD created target ID: 0x1 Jul 8 08:18:13 vmhost kernel: ses2: pass3,da1 in 'Drive Slot 1', SAS S= lot: 2 phys at slot 1 Jul 8 08:18:13 vmhost kernel: ses2: phy 0: SATA device Jul 8 08:18:13 vmhost kernel: ses2: phy 0: parent 500056b36d81e5ff ad= dr 500056b36d81e5c1 Jul 8 08:18:13 vmhost kernel: ses2: phy 1: SAS device type 0 phy 0 Jul 8 08:18:13 vmhost kernel: ses2: phy 1: parent 0 addr 0 Jul 8 08:18:13 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun= 0 Jul 8 08:18:13 vmhost kernel: da1: Fixed Direct Access SPC-4 SCSI device Jul 8 08:18:13 vmhost kernel: da1: Serial Number xxxxxxxxxxxxxxxxxx=20= =20=20=20 Jul 8 08:18:13 vmhost kernel: da1: 150.000MB/s transfers Jul 8 08:18:13 vmhost kernel: da1: 114473MB (234441648 512 byte sector= s) Jul 8 19:29:39 vmhost kernel: (da1:mrsas0:1:1:0): ATA COMMAND PASS THROUGH(16). CDB: 85 0d 06 00 01 00 01 00 00 00 00 00 00 40 06 00=20 Jul 8 19:29:39 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Sta= tus Error Jul 8 19:29:39 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK Jul 8 19:29:39 vmhost kernel: (da1:mrsas0:1:1:0): Invalidating pack Jul 8 19:29:39 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun= 0 Jul 8 19:29:39 vmhost kernel: da1: s/n xxxxxxxxxxxxxxxxxx detached Jul 8 19:29:39 vmhost kernel: mrsas0: System PD deleted target ID: 0x1 Jul 8 19:29:39 vmhost kernel: (da1:mrsas0:1:1:0): Periph destroyed Jul 8 19:29:54 vmhost kernel: mrsas0: System PD created target ID: 0x1 Jul 8 19:29:54 vmhost kernel: ses2: pass3,da1 in 'Drive Slot 1', SAS S= lot: 2 phys at slot 1 Jul 8 19:29:54 vmhost kernel: ses2: phy 0: SATA device Jul 8 19:29:54 vmhost kernel: ses2: phy 0: parent 500056b36d81e5ff ad= dr 500056b36d81e5c1 Jul 8 19:29:54 vmhost kernel: ses2: phy 1: SAS device type 0 phy 0 Jul 8 19:29:54 vmhost kernel: ses2: phy 1: parent 0 addr 0 Jul 8 19:29:54 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun= 0 Jul 8 19:29:54 vmhost kernel: da1: Fixed Direct Access SPC-4 SCSI device Jul 8 19:29:54 vmhost kernel: da1: Serial Number xxxxxxxxxxxxxxxxxx=20= =20=20=20 Jul 8 19:29:54 vmhost kernel: da1: 150.000MB/s transfers Jul 8 19:29:54 vmhost kernel: da1: 114473MB (234441648 512 byte sector= s) Jul 9 09:47:07 vmhost kernel: (da1:mrsas0:1:1:0): WRITE(10). CDB: 2a 0= 0 06 1c 38 58 00 00 08 00=20 Jul 9 09:47:07 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Sta= tus Error Jul 9 09:47:07 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK Jul 9 09:47:07 vmhost kernel: (da1:mrsas0:1:1:0): Invalidating pack Jul 9 09:47:07 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun= 0 Jul 9 09:47:07 vmhost kernel: da1: s/n xxxxxxxxxxxxxxxxxx detached Jul 9 09:47:07 vmhost kernel: mrsas0: System PD deleted target ID: 0x1 Jul 9 09:47:07 vmhost kernel: (da1:mrsas0:1:1:0): Periph destroyed Jul 9 09:47:23 vmhost kernel: mrsas0: System PD created target ID: 0x1 Jul 9 09:47:23 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun= 0 Jul 9 09:47:23 vmhost kernel: da1: Fixed Direct Access SPC-4 SCSI device Jul 9 09:47:23 vmhost kernel: da1: Serial Number xxxxxxxxxxxxxxxxxx=20= =20=20=20 Jul 9 09:47:23 vmhost kernel: da1: 150.000MB/s transfers Jul 9 09:47:23 vmhost kernel: da1: 114473MB (234441648 512 byte sector= s) Jul 9 09:47:23 vmhost kernel: ses2: pass3,da1 in 'Drive Slot 1', SAS S= lot: 2 phys at slot 1 Jul 9 09:47:23 vmhost kernel: ses2: phy 0: SATA device Jul 9 09:47:23 vmhost kernel: ses2: phy 0: parent 500056b36d81e5ff ad= dr 500056b36d81e5c1 Jul 9 09:47:23 vmhost kernel: ses2: phy 1: SAS device type 0 phy 0 Jul 9 09:47:23 vmhost kernel: ses2: phy 1: parent 0 addr 0 Jul 9 10:35:42 vmhost kernel: (da0:mrsas0:1:0:0): ATA COMMAND PASS THROUGH(16). CDB: 85 0d 06 00 01 00 01 00 00 00 00 00 00 40 06 00=20 Jul 9 10:35:42 vmhost kernel: (da0:mrsas0:1:0:0): CAM status: SCSI Sta= tus Error Jul 9 10:35:42 vmhost kernel: (da0:mrsas0:1:0:0): SCSI status: OK Jul 9 10:35:42 vmhost kernel: (da0:mrsas0:1:0:0): Invalidating pack Jul 9 10:35:42 vmhost kernel: da0 at mrsas0 bus 1 scbus17 target 0 lun= 0 Jul 9 10:35:42 vmhost kernel: da0: s/n xxxxxxxxxxxxxxxxxx detached Jul 9 10:35:42 vmhost kernel: mrsas0:=20 Jul 9 10:35:42 vmhost kernel:=20 Jul 9 10:35:42 vmhost kernel: System PD deleted target ID: 0x0 Jul 9 10:35:42 vmhost kernel: (da0:mrsas0:1:0:0): Periph destroyed Jul 9 10:35:57 vmhost kernel: mrsas0: System PD created target ID: 0x0 Jul 9 10:35:57 vmhost kernel: ses2: pass2,da0 in 'Drive Slot 0', SAS S= lot: 2 phys at slot 0 Jul 9 10:35:57 vmhost kernel: ses2: phy 0: SATA device Jul 9 10:35:57 vmhost kernel: ses2: phy 0: parent 500056b36d81e5ff ad= dr 500056b36d81e5c0 Jul 9 10:35:57 vmhost kernel: ses2: phy 1: SAS device type 0 phy 0 Jul 9 10:35:57 vmhost kernel: ses2: phy 1: parent 0 addr 0 Jul 9 10:35:57 vmhost kernel: da0 at mrsas0 bus 1 scbus17 target 0 lun= 0 Jul 9 10:35:57 vmhost kernel: da0: Fixed Direct Access SPC-4 SCSI device Jul 9 10:35:57 vmhost kernel: da0: Serial Number xxxxxxxxxxxxxxxxxx=20= =20=20=20 Jul 9 10:35:57 vmhost kernel: da0: 150.000MB/s transfers Jul 9 10:35:57 vmhost kernel: da0: 114473MB (234441648 512 byte sector= s) Jul 9 14:49:31 vmhost kernel: (da1:mrsas0:1:1:0): ATA COMMAND PASS THROUGH(16). CDB: 85 0d 06 00 01 00 01 00 00 00 00 00 00 40 06 00=20 Jul 9 14:49:31 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Sta= tus Error Jul 9 14:49:31 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK Jul 9 14:49:31 vmhost kernel: (da1:mrsas0:1:1:0): Invalidating pack Jul 9 14:49:31 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun= 0 Jul 9 14:49:31 vmhost kernel: da1: s/n xxxxxxxxxxxxxxxxxx detached Jul 9 14:49:32 vmhost kernel: mrsas0: System PD deleted target ID: 0x1 Jul 9 14:49:32 vmhost kernel: (da1:mrsas0:1:1:0): Periph destroyed Jul 9 14:49:44 vmhost kernel: mrsas0: System PD created target ID: 0x1 Jul 9 14:49:45 vmhost kernel: ses2: pass3,da1 in 'Drive Slot 1', SAS S= lot: 2 phys at slot 1 Jul 9 14:49:45 vmhost kernel: ses2: phy 0: SATA device Jul 9 14:49:45 vmhost kernel: ses2: phy 0: parent 500056b36d81e5ff ad= dr 500056b36d81e5c1 Jul 9 14:49:45 vmhost kernel: ses2: phy 1: SAS device type 0 phy 0 Jul 9 14:49:45 vmhost kernel: ses2: phy 1: parent 0 addr 0 Jul 9 14:49:45 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun= 0 Jul 9 14:49:45 vmhost kernel: da1: Fixed Direct Access SPC-4 SCSI device Jul 9 14:49:45 vmhost kernel: da1: Serial Number xxxxxxxxxxxxxxxxxx=20= =20=20=20 Jul 9 14:49:45 vmhost kernel: da1: 150.000MB/s transfers Jul 9 14:49:45 vmhost kernel: da1: 114473MB (234441648 512 byte sector= s) Jul 9 19:54:58 vmhost kernel: (da1:mrsas0:1:1:0): ATA COMMAND PASS THROUGH(16). CDB: 85 0d 06 00 01 00 01 00 00 00 00 00 00 40 06 00=20 Jul 9 19:54:58 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Sta= tus Error Jul 9 19:54:58 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK Jul 9 19:54:58 vmhost kernel: (da1:mrsas0:1:1:0): Invalidating pack Jul 9 19:54:58 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun= 0 Jul 9 19:54:58 vmhost kernel: da1: s/n xxxxxxxxxxxxxxxxxx detached Jul 9 19:54:58 vmhost kernel: mrsas0:=20 Jul 9 19:54:58 vmhost kernel:=20 Jul 9 19:54:58 vmhost kernel: System PD deleted target ID: 0x1 Jul 9 19:54:58 vmhost kernel: (da1:mrsas0:1:1:0): Periph destroyed Jul 9 19:55:13 vmhost kernel: mrsas0: System PD created target ID: 0x1 Jul 9 19:55:13 vmhost kernel: ses2: pass3,da1 in 'Drive Slot 1', SAS S= lot: 2 phys at slot 1 Jul 9 19:55:13 vmhost kernel: ses2: phy 0: SATA device Jul 9 19:55:13 vmhost kernel: ses2: phy 0: parent 500056b36d81e5ff ad= dr 500056b36d81e5c1 Jul 9 19:55:13 vmhost kernel: ses2: phy 1: SAS device type 0 phy 0 Jul 9 19:55:13 vmhost kernel: ses2: phy 1: parent 0 addr 0 Jul 9 19:55:13 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun= 0 Jul 9 19:55:13 vmhost kernel: da1: Fixed Direct Access SPC-4 SCSI device Jul 9 19:55:13 vmhost kernel: da1: Serial Number xxxxxxxxxxxxxxxxxx=20= =20=20=20 Jul 9 19:55:13 vmhost kernel: da1: 150.000MB/s transfers Jul 9 19:55:13 vmhost kernel: da1: 114473MB (234441648 512 byte sector= s) Jul 9 21:49:13 vmhost kernel: (da1:mrsas0:1:1:0): WRITE(10). CDB: 2a 0= 0 07 19 f9 78 00 00 08 00=20 Jul 9 21:49:13 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Sta= tus Error Jul 9 21:49:13 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK Jul 10 04:01:18 vmhost kernel: (da1:mrsas0:1:1:0): WRITE(10). CDB: 2a 0= 0 06 0f 73 88 00 00 08 00=20 Jul 10 04:01:18 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Sta= tus Error Jul 10 04:01:18 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK Jul 10 12:58:21 vmhost kernel: (da1:mrsas0:1:1:0): WRITE(10). CDB: 2a 0= 0 06 15 ec 28 00 00 08 00=20 Jul 10 12:58:21 vmhost kernel: (da1:mrsas0:1:1:0): CAM status: SCSI Sta= tus Error Jul 10 12:58:21 vmhost kernel: (da1:mrsas0:1:1:0): SCSI status: OK Jul 10 12:58:21 vmhost kernel: (da1:mrsas0:1:1:0): Invalidating pack Jul 10 12:58:21 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun= 0 Jul 10 12:58:21 vmhost kernel: da1: s/n xxxxxxxxxxxxxxxxxx detached Jul 10 12:58:21 vmhost kernel: mrsas0: System PD deleted target ID: 0x1 Jul 10 12:58:21 vmhost kernel: (da1:mrsas0:1:1:0): Periph destroyed Jul 10 12:58:41 vmhost kernel: mrsas0: System PD created target ID: 0x1 Jul 10 12:58:41 vmhost kernel: ses2: pass3,da1 in 'Drive Slot 1', SAS S= lot: 2 phys at slot 1 Jul 10 12:58:41 vmhost kernel: ses2: phy 0: SATA device Jul 10 12:58:41 vmhost kernel: ses2: phy 0: parent 500056b36d81e5ff ad= dr 500056b36d81e5c1 Jul 10 12:58:41 vmhost kernel: ses2: phy 1: SAS device type 0 phy 0 Jul 10 12:58:41 vmhost kernel: ses2: phy 1: parent 0 addr 0 Jul 10 12:58:41 vmhost kernel: da1 at mrsas0 bus 1 scbus17 target 1 lun= 0 Jul 10 12:58:41 vmhost kernel: da1: Fixed Direct Access SPC-4 SCSI device Jul 10 12:58:41 vmhost kernel: da1: Serial Number xxxxxxxxxxxxxxxxxx=20= =20=20=20 Jul 10 12:58:41 vmhost kernel: da1: 150.000MB/s transfers Jul 10 12:58:41 vmhost kernel: da1: 114473MB (234441648 512 byte sector= s) Thanks --=20 You are receiving this mail because: You are the assignee for the bug.=