[Lustre-devel] global epochs

Peter J. Braam peter.braam at gmail.com
Fri Jan 9 11:21:39 PST 2009

Hi -

Nikita's discussion is excellent.  I read through and thought about this a
bit over the vacation.   Two important issues remain unclear in my mind:

1. the write back cache mode is not good for scaling (eg. for many nodes
creating files in one directory); for that intents save a lot of RPCs.  The
recovery correctness discussion doesn't cover this, and I've found this to
introduce new opportunities for cascading aborts, that are not managed by
client locks.  I think it would be most beautiful if the epoch protocol was
somewhat enhanced to also handle intent style processing (where there is
dependence on RPC responses).  I think some of the response data (FIDs
notably) is not protected by locks (as it is immutable).

2. I'm nervous about broadcasting epoch messages to, for example, 1 million
clients (I think that is the current target).  You'd want a tree model for
this, and do careful calculations how often failures will occur and what
their impact is on recovery time.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-devel-lustre.org/attachments/20090109/1c387527/attachment.htm>

More information about the lustre-devel mailing list