[lustre-discuss] mounting over ipoib via opa (was: mount issue and ecmp?)

Amir Shehata amir.shehata.whamcloud at gmail.com
Mon Feb 11 12:15:17 PST 2019


If your routers have multiple OPA/MLX interfaces we found that linux
routing can return the wrong HW address, which causes address resolution
error.

You can try the following linux routing config to see if it helps:
https://wiki.whamcloud.com/display/LNet/MR+Cluster+Setup

On Mon, 11 Feb 2019 at 12:04, Michael Di Domenico <mdidomenico4 at gmail.com>
wrote:

> i've narrowed down that my issue seems to stem from running over ipoib
> on an opa network
>
> i managed to pull all the routing and other things around so the only
> difference was whether i road the ipoib or not
>
> when i mount via ethernet, it works fine
>
> when i try the same mount via ipoib running ontop of opa it gets
> "input/output error".  i can however lctl ping the storage and i see
> connections from the client to the MGS.  so some of the connectivity
> is working, but it's breaking down somewhere else
>
> is anyone else running over ipoib on an opa network?  if so, do you
> have lnet routing?
>
> some particulars
>
> rhel 7.6 clients
> 2.10.5 clients
> 2.5.x lustre servers (cray)
> lnet routing between storage and other networks
> currently running tcp ethernet, qdr infinipath, and fdr10 mellanox to
> the storage through routers
> no other machines are having mount issues
>
>
> On Fri, Feb 8, 2019 at 9:33 AM Michael Di Domenico
> <mdidomenico4 at gmail.com> wrote:
> > poking at this further, it doesn't look like it's ECMP issue.
> >
> > Are there any known reports of issues when running Lustre over ipoib
> > over an opa fabric?  seems a stretch, but it's the only difference in
> > the network at this point.
> >
> > can anyone suggest somewhere to look for more debug info?
> > /var/log/messages and dmesg, don't reveal much info
> >
> > On Mon, Feb 4, 2019 at 9:19 AM Michael Di Domenico
> > <mdidomenico4 at gmail.com> wrote:
> > >
> > > Has anyone heard of lustre having trouble mounting when ECMP is used
> > > on the compute nodes default gateway?
> > >
> > > I'm trying to mount an existing lustre filesystem on a new cluster,
> > > where the connections ride over OPA IPoIB, which is then converted to
> > > 10ge via four routers.  I'm using ECMP to distribute the packets over
> > > the four routers.
> > >
> > > I can mount lustre on other ethernet clients, but not the ones behind
> > > my ECMP gateways.  Changing the compute node gateway from ECMP to a
> > > single device doesn't change anything.  I'm not easily able to revert
> > > the network side from ECMP to a single route, so i haven't tried that.
> > >
> > > The output i get from mount is, "failed: Input/output error retries
> left: 0"
> > >
> > > syslog on the client and the MGS seem to show that the connection is
> > > being broken between the MGS and client during the mount with a "timed
> > > oout for slow reply" message.
> _______________________________________________
> lustre-discuss mailing list
> lustre-discuss at lists.lustre.org
> http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20190211/924526d8/attachment.html>


More information about the lustre-discuss mailing list