[lustre-discuss] strange errors on Lustre servers

Lixin Liu liu at sfu.ca
Sat Aug 11 18:04:14 PDT 2018


Hi,

I am getting these errors on all our MDS and OSS servers (Lustre 2.10.1):

Aug 11 11:45:52 ndc-oss5b kernel: LNet: 24727:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119 at o2ib version 12/12 incarnation 1533927051163335/1533998625080752
Aug 11 11:55:52 ndc-oss5b kernel: LNet: 105990:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119 at o2ib version 12/12 incarnation 1533927051163335/1533998625080752
Aug 11 12:05:52 ndc-oss5b kernel: LNet: 105990:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119 at o2ib version 12/12 incarnation 1533927051163335/1533998625080752
Aug 11 12:15:52 ndc-oss5b kernel: LNet: 105990:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119 at o2ib version 12/12 incarnation 1533927051163335/1533998625080752
Aug 11 12:25:52 ndc-oss5b kernel: LNet: 105990:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119 at o2ib version 12/12 incarnation 1533927051163335/1533998625080752

This is a new node we brought online recently. Is it an indication that we have problem with
it OPA interface on the node? This machine has a 8160F CPU (OPA interface on chip).

Thanks,

Lixin Liu
High Performance Computing
Simon Fraser University

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20180812/bf92af9b/attachment.html>


More information about the lustre-discuss mailing list