[lustre-discuss] LNET Routing Question

Michael Di Domenico mdidomenico4 at gmail.com
Wed May 9 09:02:32 PDT 2018


On Wed, May 9, 2018 at 9:50 AM, Makia Minich
<makia at systemfabricworks.com> wrote:
>
> I have an LNET routing question. I’ve attached a quick diagram of the current setup; but basically I have two core networks (one infiniband and one ethernet) with a set of LNET routers in between. There is storage and clients on both sides of these routers and all clients need to see all/most storage. All connections, configurations, etc are all working.
>
> The question is, if an LNET router goes down (which does cause some amount of reconnect or remapping for any clients attempting to use those routes) would this cause any issues or delays for a client’s connection to non-routed storage? Put slightly different, if a job on the ethernet clients is actively using ethernet storage and the lnet routers go down, will job be affected? What about a new job just launching when that lnet router is down?

just for the sake of clarity when you say "routers down" do you mean
all routers or just one/two?


More information about the lustre-discuss mailing list