[lustre-discuss] MGS failover problem

Mohr Jr, Richard Frank (Rick Mohr) rmohr at utk.edu
Wed Jan 11 09:16:30 PST 2017


> On Jan 11, 2017, at 11:58 AM, Ben Evans <bevans at cray.com> wrote:
> 
> Getting failover right over multiple separate networks can be a real
> hair-pulling experience.

Darby: Do you have the option of (at least temporarily) running the file system with only Infiniband configured?  If you could set up the file system to only use Infiniband, then that would eliminate any complications from having two fabrics active at the same time.  Then you could see if the problem still persists.

--
Rick Mohr
Senior HPC System Administrator
National Institute for Computational Sciences
http://www.nics.tennessee.edu



More information about the lustre-discuss mailing list