[lustre-discuss] MGS failover problem

Mohr Jr, Richard Frank (Rick Mohr) rmohr at utk.edu
Wed Jan 11 11:17:19 PST 2017


> On Jan 11, 2017, at 12:39 PM, Vicker, Darby (JSC-EG311) <darby.vicker-1 at nasa.gov> wrote:
> 
>>> Getting failover right over multiple separate networks can be a real
>>> hair-pulling experience.
>> 
>> Darby: Do you have the option of (at least temporarily) running the file system with only Infiniband configured?  If you could set up the file system to only use Infiniband, then >that would eliminate any complications from having two fabrics active at the same time.  Then you could see if the problem still persists.
> 
> Yes, but is there any reason why you are choosing IB over Ethernet?  I think I'd prefer to try over the Ethernet is we are going to pick one.  


I just figured that if you had Infiniband, then you would prefer to run with the higher performance interconnect.  But you can try ethernet just as well.

--
Rick Mohr
Senior HPC System Administrator
National Institute for Computational Sciences
http://www.nics.tennessee.edu



More information about the lustre-discuss mailing list