ib1: timing out; N sends not completed

Julian Stecklina jsteckli at os.inf.tu-dresden.de
Mon Jun 10 17:05:21 UTC 2013


On 06/10/2013 06:03 PM, Anthony Cornehl wrote:
> 
> On Jun 10, 2013 6:41 AM, "Julian Stecklina"
> <jsteckli at os.inf.tu-dresden.de <mailto:jsteckli at os.inf.tu-dresden.de>>
> wrote:
>>
>> Hello,
>>
>> I have two machines connected back-to-back via Infiniband with Mellanox
>> Infinihost III adapters. One machine runs Linux (Fedora 19) and the
>> other 9-STABLE.
>>
>> I sometimes get:
>>
>> ib1: timing out; 47 sends not completed
>> ib1: timing out; 1 sends not completed
>> ib1: timing out; 56 sends not completed
>>
>> or similar and TCP connections will be stuck after each timeout for a
>> while. It is relatively easy to reproduce this behavior with NetPIPE.
>>
>> Any advice?
>>
>> Julian
>>
> 
> Hey Julian,
> 
> Just some questions to try and clarify the issue...
> 
> - which machine is the OpenSM master running on?

The Linux box: opensm-3.3.15

> - what does your qkey violation count look like when you run a portinfo
> on the ports?

Is there a way to do this from the command line? I can only find the
corresponding C function.


> - does the issue persist when a switch is added between the hosts?

I can't tell you, because I don't have one available right now.

Julian


-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 198 bytes
Desc: OpenPGP digital signature
URL: <http://lists.freebsd.org/pipermail/freebsd-infiniband/attachments/20130610/dee9d37c/attachment.sig>


More information about the freebsd-infiniband mailing list