[Lustre-discuss] Which NID to use?

Chan Ching Yu Patrick cychan at clustertech.com
Fri Feb 28 01:17:07 PST 2014


Hi Mohr,

The reason why I made this setup is I'm not sure how Lustre selects the 
interface in mult-rail environment.

Especially when all node have Infiniband and Ethernet, how can I ensure 
Infiniband is used between client and OSS?


Regards,
Patrick



On 02/27/2014 12:28 PM, Mohr Jr, Richard Frank (Rick Mohr) wrote:
> On Feb 26, 2014, at 7:14 PM, "Chan Ching Yu, Patrick"<cychan at clustertech.com>
>   wrote:
>
>> [root at mds1 ~]# lctl list_nids
>> 192.168.122.240 at tcp
>> 192.168.100.100 at tcp1
>>
>> [root at oss1 ~]# lctl list_nids
>> 192.168.122.194 at tcp
>> 192.168.100.101 at tcp1
>>
>> [root at client ~]# lctl list_nids
>> 192.168.122.70 at tcp
>> 192.168.100.102 at tcp1
>>
> <snip>
>> On Lustre client, I intentionally mount it with tcp1
>>
>> [root at client ~]# mount | grep lustre
>> 192.168.100.100 at tcp1:/data on /lustre type lustre (rw)
>>
>>
>> Now I dd a file on Lustre filesystem, you can see that tcp0 is used when writing on OST.
>> Why?
> I am not an expert on the inner workings of lustre, but as far as I understand it, when oss1 connects to the mgs, it will report the nids it has available.  When the client connects to mgs to get info about the oss1 server, it will receive a list of all the oss1 nids.  The client then steps through that list and compares the oss1 nids with its local nids to find a match (i.e. - nids that are on the same lnet network).  If it matches tcp0 first, then that is the connection it uses.  The lnet network used to connect to the mgs is irrelevant at that point.  However, I do not know if there are any guarantees about the ordering of the nids that the mgs will report (ie - will tcp0 always be the first nid?).
>
> If there is an error in my description, hopefully a lustre developer will point out the flaw.
>
> It is not clear what you are trying to accomplish with this multi rail setup.  Are you trying to force mds traffic over one client link and oss traffic over the other?  Or are you trying to utilize both links simultaneously for all traffic?
>





More information about the lustre-discuss mailing list