[Lustre-discuss] very slow directory access (lstat taking >20s to return)
Frederik Ferner
frederik.ferner at diamond.ac.uk
Tue Nov 9 06:35:22 PST 2010
Thanks both for your replies.
In the mean time we've upgraded our file system to 1.8.3-ddn3.3, which
kept us busy, so sorry for the late reply. In our initial testing we've
not seen this problem but more testing is in progress.
Andreas Dilger wrote:
> On 2010-10-29, at 21:20, Daniel Kobras wrote:
>> On Fri, Oct 29, 2010 at 09:40:33AM +0100, Frederik Ferner wrote:
>>> Doing a 'strace -T -e file ls -n' on one directory with about 750
>>> files, while users were seeing the hanging ls, showed lstat calls
>>> taking seconds, up to 23s.
>> The (l)stat() calls determine the exact size of all files in the
>> displayed directory. This means that each OSTs needs to revoke
>> client write locks for all these files, ie. client-side write
>> caches for all files in the directory are flushed before the
>> (l)stat() returns. This can easily take several seconds if there is
>> heavy write activity on the file.
As far as I was able to see at that time, the actual files were not
written to anymore but additional files in the directory or one
directory below were still created.
> Actually, unlike most other cluster filesystems Lustre does not need
> to revoke the OST write locks in order to determine the file size.
> The OST extent locks are conditionally revoked if the client is no
> longer using them, but if they are in use the clients holding those
> locks only return a "glimpse" of the current file size to the OST,
> which in turn returns the size to the client doing the (l)stat()
> call.
I assume on a busy OST this could still take a while, though?
We'll monitor the situation after the upgrade and if it happens again,
we'll try more debugging again.
Thanks,
Frederik
--
Frederik Ferner
Computer Systems Administrator phone: +44 1235 77 8624
Diamond Light Source Ltd. mob: +44 7917 08 5110
(Apologies in advance for the lines below. Some bits are a legal
requirement and I have no control over them.)
More information about the lustre-discuss
mailing list