[Lustre-discuss] network failover with IB+eth?
Erich Focht
efocht at hpce.nec.com
Wed Apr 9 09:46:06 PDT 2008
Hi Cliff,
thanks for the answer. I guess I had a wrong picture on how this works.
Will get the system to fail over when a link breaks.
Regards,
Erich
On Dienstag 08 April 2008, Cliff White wrote:
> Erich Focht wrote:
> > Hello,
> >
> > on a setup with o2ib and ethernet configured on both, lustre servers and
> > clients I'd expect that unplugging the infiniband cable on one of the
> > OSSes would lead the client to switch over to ethernet and continue I/O.
>
> No, unfortunately that's not how multiple interfaces work with LNET.
> When multiple interfaces are present at connection setup we pick the
> 'best' route.
> Once we establish a connection, we expect that connection to continue.
> Connections do not fail over if multiple interfaces are present.
>
> > Unfortunately this doesn't happen, the client I/O stalls and continues
> > only after the IB cable is plugged back.
>
> Yup, that's expected behaviour.
> >
> > Is there anything wrong with the setup? It's with pairwise failover
> > servers,
> > so maybe that's part of the problem? Is the order of failnode arguments
> > correct?
>
> The setup appears to be correct, all the failnode does is complicate the
> situation slightly, as the failnode is tried first instead of just
> failing right away. You have a list of failover connections for each
> network type. LNET will try the failover only on the common network. So
> a tcp connection would first retry the tcp address, and as you show the
> IB side attempts retry on the IB failnode.
> cliffw
> >
> > Here's what we have: (sorry for the many details...)
> >
> > MGS/MGT are mounted on the same node:
> > Target: MGS
> > Index: unassigned
> > Lustre FS: lustre
> > Mount type: ldiskfs
> > Flags: 0x174 (MGS needs_index first_time update writeconf )
> > Persistent mount opts: errors=remount-ro,iopen_nopriv,user_xattr
> > Parameters:
> > failover.node=10.3.0.227 at o2ib,192.168.130.227 at tcp,10.3.0.226 at o2ib,192.168.130.226 at tcp
> > mgsnode=10.3.0.227 at o2ib,192.168.130.227 at tcp,10.3.0.226 at o2ib,192.168.130.226 at tcp
> >
> > Target: lustre-MDT0000
> > Index: 0
> > Lustre FS: lustre
> > Mount type: ldiskfs
> > Flags: 0x1 (MDT )
> > Persistent mount opts: errors=remount-ro,iopen_nopriv,user_xattr
> > Parameters:
> > mgsnode=10.3.0.226 at o2ib,192.168.130.226 at tcp,10.3.0.227 at o2ib,192.168.130.227 at tcp
> > failover.node=10.3.0.227 at o2ib,192.168.130.227 at tcp
> > mdt.group_upcall=/usr/sbin/l_getgroups
> >
> > OST: parameters were rewritten with tunefs.lustre:
> > tunefs.lustre --ost --erase-param
> > --mgsnode=10.3.0.226 at o2ib0,192.168.130.226 at tcp0:10.3.0.227 at o2ib0,192.168.130.227 at tcp0
> > --failnode=10.3.0.229 at o2ib0,192.168.130.229 at tcp0 --writeconf
> > /dev/mpath/ost100
> >
> >
> > Client notices the failed OST path:
> > # lfs check servers
> > lustre-MDT0000-mdc-ffff810007107000 active.
> > error: check 'lustre-OST0000-osc-ffff810007107000': Connection timed out
> > (110)
> >
> > but tries to connect to the failover OSS partner instead of trying the
> > other
> > network:
> > netptune121: LustreError: 11-0: an error occurred while communicating
> > with 10.3.0.229 at o2ib. The ost_connect operation failed with -19
> > doss2: LustreError: 137-5: UUID 'lustre-OST0000_UUID' is not available
> > for connect (no target)
> >
> > Thanks in advance for any hint...
> >
> > Best regards,
> > Erich
> > <br><br>
More information about the lustre-discuss
mailing list