[Lustre-discuss] How to determine which lustre clients are loading filesystem.
Craig Prescott
prescott at hpc.ufl.edu
Thu Jul 8 11:52:13 PDT 2010
Hi Wojciech;
We run collectl on each compute node, and toss some interesting numbers
from it into ganglia (r/s, w/s, throughputs, etc).
collectl can be found here:
http://collectl.sourceforge.net/
There also are per-filesystem statistics on each client in the
directories underneath /proc/fs/lustre/llite, and per-OST stats
underneath /proc/fs/lustre/osc. You can feed the 'stats' files in these
dirs to the 'llstat' command to show stats in an interval of your choosing.
Cheers,
Craig Prescott
UF HPC Center
Wojciech Turek wrote:
> Hi,
>
> Our Lustre filesystem (Lustre 1.8.3, RHEL5) got recently very busy and
> users are noticing the slowness. The Lustre system consists of ~550
> clients and currently we have 50 different users running jobs. I can see
> that OSS servers have load oscillating between 100-300 and collectl
> shows that there are lots of I/O going on (mainly read). I would like to
> find a good method of finding out which Lustre clients are generating
> the I/O so I could pinpoint the high load to a particular jobs. I hope
> that some Lustre users can share their experience in that matter.
>
> Best regards,
>
> --
> --
> Wojciech Turek
>
>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-discuss
More information about the lustre-discuss
mailing list