Re: WD Blue 510 SSD and strange write performance

From: Karl Denninger <karl_at_denninger.net>
Date: Fri, 15 Mar 2024 19:09:56 UTC
I've got both the Kingston and Micron versions of these in production 
use and have seen nothing like this at all; they get hit pretty hard 
too, including release building (both direct and cross-builds) and 
similar stuff.

On 3/15/2024 2:17 PM, mike tancsa wrote:
> On 3/14/2024 4:58 PM, Frank Leonhardt wrote:
>> Sorry - not that deeply into modern SSD (never written a driver for 
>> one), but based on my understanding your TRIM theory makes sense to 
>> me. I'd try turning it off. It does seem to be an ongoing source of 
>> snafus.
>>
>> I did use WD Blue SSDs but I suspect they vary quite a bit. I've had 
>> rather too many early failures. I wouldn't use them in production but 
>> okay for Windoze. We all know deep down there's a reason the 
>> enterprise SSDs cost what they do :-)
>>
>> I'll keep thinking
>
> Thanks for the input. I think these drives are just kinda broken :( I 
> noticed we had the 2TB versions of this line, but they seem rather 
> different and I am not able to trigger the errors with them 
> thankfully.   Even stranger, I have a 1TB version of this drive I 
> bought from a while back that has the same firmware, but does NOT have 
> this issue. However, the output of the identifier is slightly 
> different.  Who knows, it could be some component WD uses that *should 
> be* the same but is not and is causing some subtle pathology.
>
>
> I tried turning off NCQ on the controller and it didnt seem to make a 
> difference. Then I turned off autotrim and did a manual trim of the 
> pool, then did the tests and same sorts of errors.  I think I am just 
> gonna cut my losses with these disks :(   Even if I figured out some 
> work around at this point, I would not deploy them into production.  I 
> doubt I will be able to get anywhere with WD. Farewell my 400 bucks :(
>
> (da6:mpr0:0:16:0): READ(10). CDB: 28 00 6d e0 ae 28 00 00 08 00
> (da6:mpr0:0:16:0): CAM status: CCB request completed with an error
> (da6:mpr0:0:16:0): Retrying command, 3 more tries remain
> (da6:mpr0:0:16:0): WRITE(10). CDB: 2a 00 0c cb 3f 00 00 00 e8 00
> (da6:mpr0:0:16:0): CAM status: CCB request completed with an error
> (da6:mpr0:0:16:0): Retrying command, 3 more tries remain
> (da6:mpr0:0:16:0): READ(10). CDB: 28 00 6d e0 ad 28 00 01 00 00
> (da6:mpr0:0:16:0): CAM status: CCB request completed with an error
> (da6:mpr0:0:16:0): Retrying command, 3 more tries remain
> (da6:mpr0:0:16:0): READ(10). CDB: 28 00 6d e0 ac 28 00 00 f8 00
> (da6:mpr0:0:16:0): CAM status: CCB request completed with an error
> (da6:mpr0:0:16:0): Retrying command, 3 more tries remain
> (da6:mpr0:0:16:0): WRITE(10). CDB: 2a 00 40 07 df 88 00 01 00 00
> (da6:mpr0:0:16:0): CAM status: CCB request completed with an error
> (da6:mpr0:0:16:0): Retrying command, 3 more tries remain
> (da6:mpr0:0:16:0): WRITE(10). CDB: 2a 00 3f 48 72 08 00 01 00 00
> (da6:mpr0:0:16:0): CAM status: SCSI Status Error
> (da6:mpr0:0:16:0): SCSI status: Check Condition
> (da6:mpr0:0:16:0): SCSI sense: UNIT ATTENTION asc:29,0 (Power on, 
> reset, or bus device reset occurred)
> (da6:mpr0:0:16:0): Retrying command (per sense data)
> mpr0: Controller reported scsi ioc terminated tgt 15 SMID 2036 loginfo 
> 31110f00
> mpr0: Controller reported scsi ioc terminated tgt 15 SMID 637 loginfo 
> 31110f00
> (da5:mpr0:0:15:0): WRITE(10). CDB: 2a 00 41 98 42 00 00 01 00 00
> mpr0: Controller reported scsi ioc terminated tgt 15 SMID 1242 loginfo 
> 31110f00
> mpr0: Controller reported scsi ioc terminated tgt 15 SMID 979 loginfo 
> 31110f00
> mpr0: Controller reported scsi ioc terminated tgt 15 SMID 1243 loginfo 
> 31110f00
> mpr0: Controller reported scsi ioc terminated tgt 15 SMID 2091 loginfo 
> 31110f00
> mpr0: Controller reported scsi ioc terminated tgt 15 SMID 1612 loginfo 
> 31110f00
> mpr0: Controller reported scsi ioc terminated tgt 15 SMID 2093 loginfo 
> 31110f00
> mpr0: Controller reported scsi ioc terminated tgt 15 SMID 152 loginfo 
> 31110f00
> mpr0: Controller reported scsi ioc terminated tgt 15 SMID 2132 loginfo 
> 31110f00
> (da5:mpr0:0:15:0): CAM status: CCB request completed with an error
> (da5:mpr0:0:15:0): Retrying command, 3 more tries remain
> (da5:mpr0:0:15:0): WRITE(10). CDB: 2a 00 43 17 dc 88 00 01 00 00
> (da5:mpr0:0:15:0): CAM status: CCB request completed with an error
> (da5:mpr0:0:15:0): Retrying command, 3 more tries remain
> (da5:mpr0:0:15:0): WRITE(10). CDB: 2a 00 41 98 43 00 00 00 50 00
> (da5:mpr0:0:15:0): CAM status: CCB request completed with an error
> (da5:mpr0:0:15:0): Retrying command, 3 more tries remain
> (da5:mpr0:0:15:0): WRITE(10). CDB: 2a 00 0c d4 f6 80 00 00 68 00
> (da5:mpr0:0:15:0): CAM status: CCB request completed with an error
> (da5:mpr0:0:15:0): Retrying command, 3 more tries remain
> (da5:mpr0:0:15:0): WRITE(10). CDB: 2a 00 0c d4 f5 80 00 01 00 00
> (da5:mpr0:0:15:0): CAM status: CCB request completed with an error
> (da5:mpr0:0:15:0): Retrying command, 3 more tries remain
> (da5:mpr0:0:15:0): READ(10). CDB: 28 00 05 dc 12 28 00 00 f8 00
> (da5:mpr0:0:15:0): CAM status: CCB request completed with an error
> (da5:mpr0:0:15:0): Retrying command, 3 more tries remain
> (da5:mpr0:0:15:0): READ(10). CDB: 28 00 05 dc 0f b0 00 00 88 00
> (da5:mpr0:0:15:0): CAM status: CCB request completed with an error
> (da5:mpr0:0:15:0): Retrying command, 3 more tries remain
> (da5:mpr0:0:15:0): WRITE(10). CDB: 2a 00 02 96 7e 80 00 00 10 00
> (da5:mpr0:0:15:0): CAM status: CCB request completed with an error
> (da5:mpr0:0:15:0): Retrying command, 3 more tries remain
> (da5:mpr0:0:15:0): READ(10). CDB: 28 00 6f 5b 8d 68 00 01 00 00
> (da5:mpr0:0:15:0): CAM status: CCB request completed with an error
> (da5:mpr0:0:15:0): Retrying command, 3 more tries remain
> (da5:mpr0:0:15:0): WRITE(10). CDB: 2a 00 41 98 42 00 00 01 00 00
> (da5:mpr0:0:15:0): CAM status: SCSI Status Error
> (da5:mpr0:0:15:0): SCSI status: Check Condition
> (da5:mpr0:0:15:0): SCSI sense: UNIT ATTENTION asc:29,0 (Power on, 
> reset, or bus device reset occurred)
> (da5:mpr0:0:15:0): Retrying command (per sense data)
>
>
>
>
-- 
-- Karl Denninger
/The Market-Ticker/
S/MIME Email accepted and preferred