[Lustre-discuss] How to detect process owner on client

Ashley Pittman apittman at ddn.com
Wed Feb 16 00:56:28 PST 2011


On 15 Feb 2011, at 22:17, John Hammond wrote:
> I've written a utility called lltop which gathers I/O statistics from
> Lustre servers, along with job assignment data from cluster batch
> schedulers, to give a job-by-job accounting of filesystem load.  Here's
> its output with names changed to protect the innocent:
> 
>  $ sudo tacc_lltop work
>  JOBID     WR_MB  RD_MB  REQS  OWNER WORKDIR
>  1823815    2101      0  4176     al /work/000/al/job1
>  1823060     774      0  1570    bob /work/000/bob/fftw
>  1823634     323      3  3244   chas /work/000/chas/boltzeq
>  1823768     289      0  5108    deb /work/000/deb/mesh-08
>  1823085      55      0   110     ed /work/000/ed/jumble
>  login3       18      3  2961
> 
> We use it on several systems, only with SGE so far, but it's hookable to
> other schedulers.
> 
> See https://github.com/jhammond/lltop for source and documentation.

That looks very useful!  We won't be able to use this directly at DDN because we don't integrate with the right bits of the stack but I'll be sure to make sure our HPC customers hear about it if they are looking for this kind of data.

I also have some code which would work with other schedulers if people are interested.

Ashley,


More information about the lustre-discuss mailing list