[lustre-devel] adding IOCTL for ping

Patrick Farrell paf at cray.com
Tue Sep 15 14:12:56 PDT 2015


Hmmm.  What about a proc interface instead?  This feels - to me - better suited to that, since it's a control that isn't part of the I/O path in any way.
________________________________________
From: lustre-devel [lustre-devel-bounces at lists.lustre.org] on behalf of Ben Evans [bevans at cray.com]
Sent: Tuesday, September 15, 2015 3:53 PM
To: Christopher J. Morrone; lustre-devel at lists.lustre.org
Subject: Re: [lustre-devel] adding IOCTL for ping

You¹re correct, targets and client Ids would probably be better.  I was
simply thinking that it would be a relatively straightforward piece of
code to write that could add some good flexibility into Lustre.  Combine
it with an "X is dead, evict it² IOCTL and you could probably do quite a
bit of good, especially if you¹ve got a management system that is also
monitoring aliveness, locations of targets, etc.

-Ben Evans

On 9/15/15, 4:37 PM, "lustre-devel on behalf of Christopher J. Morrone"
<lustre-devel-bounces at lists.lustre.org on behalf of morrone2 at llnl.gov>
wrote:

>Maybe there is just not enough detail in the proposal, but I am not
>seeing why associating this with NIDs is the right way to go.  I believe
>that the RAS work that would use the gossip protocol dealt, at least in
>part, with higher level concepts like targets.  The existing ptlrpc
>pinger pings targets, not NIDs.  That is one of the problematic design
>points of the existing system, that when N OSTs live on the same node
>clients need to send N pings to the same node.
>
>Chris
>
>On 09/15/2015 08:04 AM, Ben Evans wrote:
>> Would there be any interest in adding an IOCTL to update the ping
>> time/status for a particular NID?
>>
>> This should allow for implementation of a pinger in userspace which
>> updates the kernel on the status of various NIDs, and if I understand
>> the ping code well enough, would greatly curtail any pings that is sent
>> by the kernel.
>>
>> This might allow for things like Eric¹s gossip implementation to simply
>> bolt on top of Lustre without any internal kernel changes, or for
>> integration of external monitoring systems to tell Lustre that failovers
>> have occurred, etc.
>>
>> -Ben Evans
>>
>>
>> _______________________________________________
>> lustre-devel mailing list
>> lustre-devel at lists.lustre.org
>> http://lists.lustre.org/listinfo.cgi/lustre-devel-lustre.org
>>
>
>_______________________________________________
>lustre-devel mailing list
>lustre-devel at lists.lustre.org
>http://lists.lustre.org/listinfo.cgi/lustre-devel-lustre.org

_______________________________________________
lustre-devel mailing list
lustre-devel at lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-devel-lustre.org


More information about the lustre-devel mailing list