[lustre-discuss] MGS failover problem

Mohr Jr, Richard Frank (Rick Mohr) rmohr at utk.edu
Wed Jan 11 11:17:19 PST 2017

> On Jan 11, 2017, at 12:39 PM, Vicker, Darby (JSC-EG311) <darby.vicker-1 at nasa.gov> wrote:
>>> Getting failover right over multiple separate networks can be a real
>>> hair-pulling experience.
>> Darby: Do you have the option of (at least temporarily) running the file system with only Infiniband configured?  If you could set up the file system to only use Infiniband, then >that would eliminate any complications from having two fabrics active at the same time.  Then you could see if the problem still persists.
> Yes, but is there any reason why you are choosing IB over Ethernet?  I think I'd prefer to try over the Ethernet is we are going to pick one.  

I just figured that if you had Infiniband, then you would prefer to run with the higher performance interconnect.  But you can try ethernet just as well.

Rick Mohr
Senior HPC System Administrator
National Institute for Computational Sciences

More information about the lustre-discuss mailing list