[Lustre-devel] imperative recovery

Nicholas Henke nic at cray.com
Fri Jan 9 11:43:42 PST 2009


Robert Read wrote:
> 
> On Jan 9, 2009, at 07:27 , Nicholas Henke wrote:

>>
>> I do think this will miss a significant case: combo MGS+MDS. A 
>> majority of our
>> customers are deploying with this configuration. Perhaps exposing this 
>> mechanism
>> on the clients via a /proc file would be enough - that way a failover 
>> framework
>> could manually trigger the timeout and/or nid switching.
> 
> Yes, exactly what I was thinking. Exposing this feature via proc (or 
> lctl) on the clients is the first step. It's has minimal impact, 
> requires no changes to the server, and should integrate well with 
> existing failover frameworks.  We also need to get the server to end 
> recovery sooner (without waiting for all the stale exports), but VBR 
> should help with that.
> 
> robert

FWIW: we'd prefer /proc. We don't ship lctl on our computes for memory 
(initramfs) usage reasons. Being in /proc makes it easy for someone to use the 
functionality from another kernel module as well; we can just call the .read or 
.write functions directly.

Nic



More information about the lustre-devel mailing list