[Lustre-devel] async write and abort_recov
andreas.dilger at ORACLE.COM
Thu Jul 15 09:19:39 PDT 2010
On 2010-07-15, at 02:05, Aurelien Degremont wrote:
> Andreas Dilger a écrit :
>> While I know Lustre will save errors from async write RPCs into the file descriptor for later write calls or fsync), I don't know if we save any IO error into the file descriptor if we discard pages due to eviction. I think only errors due to currently in-flight RPCs that are aborted due to client eviction are returned.
> Sounds like a bug to me? That means, if a process write data on a client, those data goes to page cache. Not yet to OST if there is no local memory pressure. At that moment, if the client is evicted, those pages are dropped. Then client reconnect, the process writes other data. Those I/O are successful, client has missed that some previous I/O failed?
I would agree.
Lustre Technical Lead
Oracle Corporation Canada Inc.
More information about the lustre-devel