HAST - detect failure and restore avoiding an outage?

Pawel Jakub Dawidek pjd at FreeBSD.org
Sat Feb 23 20:49:55 UTC 2013


On Fri, Feb 22, 2013 at 12:00:43AM +0200, Mikolaj Golub wrote:
> It looks currently logs are only way to detect errors from hastd side.
> Here is a patch that adds local i/o error statistics, accessable avia
> hastctl:
> 
> http://people.freebsd.org/~trociny/hast.stat_error.1.patch
> 
> hastctl output:
> 
>   role: secondary
>   provname: test
>   localpath: /dev/md102
>   extentsize: 2097152 (2.0MB)
>   keepdirty: 0
>   remoteaddr: kopusha:7771
>   replication: memsync
>   status: complete
>   dirty: 0 (0B)
>   statistics:
>     reads: 0
>     writes: 366
>     deletes: 0
>     flushes: 0
>     activemap updates: 0
>     local i/o errors: 269
> 
> Pawel, what do you think about this patch?


I'm fine with the patchi except for missing breaks in switch added to
hastd/primary.c.

I'm also wondering... You count all those errors separately just to
print them as one number. If we do that already let's print them
separately, eg.

	local i/o errors: read(0), write(3), delete(5), flush(9)

BTW. Why not to count activemap update errors as write and flush errors?

-- 
Pawel Jakub Dawidek                       http://www.wheelsystems.com
FreeBSD committer                         http://www.FreeBSD.org
Am I Evil? Yes, I Am!                     http://tupytaj.pl
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 196 bytes
Desc: not available
URL: <http://lists.freebsd.org/pipermail/freebsd-questions/attachments/20130223/d9dfec37/attachment.sig>


More information about the freebsd-questions mailing list