[lustre-discuss] strange errors on Lustre servers

Zeeshan Ali Shah javaclinic at gmail.com
Sat Aug 11 23:14:05 PDT 2018


What is output of opainfo ?

Sent from my iPhone

> On 12 Aug 2018, at 04:04, Lixin Liu <liu at sfu.ca> wrote:
> 
> Hi,
>  
> I am getting these errors on all our MDS and OSS servers (Lustre 2.10.1):
>  
> Aug 11 11:45:52 ndc-oss5b kernel: LNet: 24727:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119 at o2ib version 12/12 incarnation 1533927051163335/1533998625080752
> Aug 11 11:55:52 ndc-oss5b kernel: LNet: 105990:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119 at o2ib version 12/12 incarnation 1533927051163335/1533998625080752
> Aug 11 12:05:52 ndc-oss5b kernel: LNet: 105990:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119 at o2ib version 12/12 incarnation 1533927051163335/1533998625080752
> Aug 11 12:15:52 ndc-oss5b kernel: LNet: 105990:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119 at o2ib version 12/12 incarnation 1533927051163335/1533998625080752
> Aug 11 12:25:52 ndc-oss5b kernel: LNet: 105990:0:(o2iblnd_cb.c:2410:kiblnd_passive_connect()) Conn stale 172.19.142.119 at o2ib version 12/12 incarnation 1533927051163335/1533998625080752
>  
> This is a new node we brought online recently. Is it an indication that we have problem with
> it OPA interface on the node? This machine has a 8160F CPU (OPA interface on chip).
>  
> Thanks,
>  
> Lixin Liu
> High Performance Computing
> Simon Fraser University
>  
> _______________________________________________
> lustre-discuss mailing list
> lustre-discuss at lists.lustre.org
> http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20180812/2ed6f38e/attachment.html>


More information about the lustre-discuss mailing list