smart(8) Call for Testing
Charles Sprickman
spork at bway.net
Thu Mar 29 17:38:02 UTC 2018
--
Charles Sprickman
NetEng/SysAdmin
Bway.net - New York's Best Internet www.bway.net
spork at bway.net - 212.982.9800
> On Mar 29, 2018, at 9:43 AM, Lev Serebryakov <lev at freebsd.org> wrote:
>
> On 29.03.2018 16:27, Chuck Tuffli wrote:
>
>>>> Maybe one of the vendors who sells FreeBSD as part of an appliance has shown some interest in this?
>>>>
>>>> If you’re hardware is well-defined and thus the output is consistent, I could imagine it’s not too difficult to parse this.
>>> smartd is very important part of smartmontools, smartctl is not so.
>>>
>>> And periodic self-test triggering & check is most important feature of
>>> smartd, IMHO.
>>>
>>> Modern HDDs are liers in SMART. And only regular self-test discover
>>> real errors on surfaces in my experience.
>>>
>>> So, tool without support for HDD self-tests is of little usage for
>>> appliances, IMHO.
>
>> Thank you for the feedback! As I don't have any experience with
>> smartd, can you help me better understand which parts of it are most
>> useful to you? Is it just periodically triggering the self test or
>> are there other features as well? For example, logging the SMART
>> values, emailing / triggering notifications when certain criteria are
>> met, monitoring the self tests, reading the error logs, etc.?
>
> Triggering of short and full self-tests and alerting (via e-mail) when
> test failed.
>
> Monitoring of values and alerting is VERY important (number of
> Relocations is main indicator of spinning HDD health and when it raises
> it must be known ASAP), but it could be implemented with simple smart(8)
> utility and some scripting, so this is not problem.
It would be nice if we could grab these values via snmp… Right now I use
either nrpe or check_by_ssh in nagios to run scripts to parse smartctl output
and it would be weird to have SMART functions in base but not have that
tied to the stock snmpd.
> But all my dead HDDs were replaced on self-test fail — it is what
> allows me to replace them BEFORE data were lost.
Yep, lots of folks claim the data is useless, but generally I see some signs of
failure before the drive dies, and sometimes those signs are spotted because
smartd is triggering regular self-tests. And on SSDs, watching the MWI seems
to work very well - these drives are much smarter (no pun intended) than spinny
disks.
Charles
>
> --
> // Lev Serebryakov
More information about the freebsd-fs
mailing list