Issue with hast replication
Phil Regnauld
regnauld at x0.dk
Tue Mar 13 21:19:31 UTC 2012
Mikolaj Golub (to.my.trociny) writes:
>
> Ok. So it is send(2). I suppose the network driver could generate the
> error. Did you tell what network adaptor you had?
Not yet.
bce0: <HP NC382i DP Multifunction Gigabit Server Adapter (C0)> mem 0xf4000000-0xf5ffffff irq 16 at device 0.0 on pci2
bce0: ASIC (0x57092003); Rev (C0); Bus (PCIe x2, 2.5Gbps); B/C (4.6.4); Bufs (RX:2;TX:2;PG:8); Flags (SPLT|MSI|MFW); MFW (NCSI 1.0.3)
> PR> No obvious errors there either, but again what should I look out for ?
>
> I would look at sysctl -a dev.<nic> statistics and try to find if there is correlation
> between ENOMEM failures and growing of error counters.
0 errors:
dev.bce.0.l2fhdr_error_count: 0
dev.bce.0.stat_emac_tx_stat_dot3statsinternalmactransmiterrors: 0
dev.bce.0.stat_Dot3StatsCarrierSenseErrors: 0
dev.bce.0.stat_Dot3StatsFCSErrors: 0
dev.bce.0.stat_Dot3StatsAlignmentErrors: 0
> Looking at buffer usage from 'netstat -nax' output ran during synchronization
> (on both hosts) could provide useful info where the bottleneck is. top -HS
> output might be useful too.
Good point.
I'll have to attempt to recreate the problem, as the volume has replicated
without errors. Typical.
Cheers,
Phil
More information about the freebsd-stable
mailing list