[Lustre-devel] faking LNET scale
Liang Zhen
Zhen.Liang at Sun.COM
Fri Apr 17 11:33:30 PDT 2009
Nic,
It's very late night for me now, my head is not clear enough for me to
make sure whether I'm saying something crazy, :)
LNet always thinks target is remote network(needs router) if it can't
find a NI with same network ID, for example, if local NI is (ptl0) and
caller wants to send message to (ptl1), then LNet will:
1. Try to find local NI for ptl1, and failed then:
2. try to find if ptl1 is a remote network and whether there is router
for this network (ptl1)
So if you want your server has only one NI instance and can talk with a
set of different networks, and at the same time, it can talk with other
remote networks via routers, I would suggest:
1. create a new command, for example: lctl add_local_net ptl0 ptl[1-N],
which means LNet should allow NI(ptl0) accessing networks( ptl[1-N] as
local networks.
2. add a new structure in LNet, i.e:
struct {
struct list_head ln_list;
__u32 ln_net;
lnet_ni_t *ln_localni;
......
}lnet_localnet_t;
As you see, it's very like current structure lnet_remotenet_t, which is
pending on lnet_t::ln_remote_nets; we can create a lnet_locallnet_t
object and add it to global list (i.e: lnet_t::ln_local_nets) by the
command we mentioned above: lctl add_local_net
3. once upper layer caller sending message, lnet_send() should check
lnet_t::ln_local_nets firstly (before thinking it's a remote network and
checking on lnet_t::ln_remote_nets), if it is on
lnet_t::ln_local_netsthen we can take the local NI. on
lnet_locanet_t::ln_localni;
4. We need add a new flag for LND, only LND with the flag can support
command lctl add_local_net.
5. make the LND wouldn't reject messages from different networks.
again, hope I'm answering what you are asking, :)
Regards
Liang
Nicholas Henke wrote:
> Greetings -
>
> I was looking into ways to simulate scale at the LNET level. It would allow us
> to test the LNDs better with less hardware, not to mention things like LNet
> SelfTest and friends.
>
> With the work in bug 15332 to add multiple nets per NIC, it seemed fairly close
> that we could use that to generate multiple LND connections from a single NIC.
> Ideally we'd have a server or router that would have just one LND instance
> (ptl0) and the client nodes with multiple interfaces (ptl1, ptl2, ...). This
> would increase the load on those server nodes to something interesting.
>
> However, to do this either hacking up lnet_ptlcompat_matchXXX to look at another
> flag besides the_lnet.ln_ptlcompat or some other way of allowing a server with a
> single NET (ptl0) to accept requests from a variety of nets (ptl1, ptl2, etc).
> One cannot use multiple interfaces for the same net type with ln_ptlcompat enabled.
>
> Is there a better way to do this ? What would be the least abusive of th e rules ?
>
> Cheers,
> Nic
> _______________________________________________
> Lustre-devel mailing list
> Lustre-devel at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-devel
>
More information about the lustre-devel
mailing list