[Lustre-discuss] MDS memory usage

Bernd Schubert bs_lists at aakef.fastmail.fm
Thu Aug 26 06:06:34 PDT 2010


Hello Frederik,


On Wednesday, August 25, 2010, Frederik Ferner wrote:
> Hi Bernd,
> 
> thanks for your reply.
> 
> Bernd Schubert wrote:
> > On Tuesday, August 24, 2010, Frederik Ferner wrote:
> >> on our MDS we noticed that all memory seems to be used. (And it's not
> >> just normal buffers/cache as far as I can tell.)
> >> 
> >> When we put load on the machine, for example by starting rsync
> >> on a few clients, generating file lists to copy data from Lustre to
> >> local disks or just running a MDT backup locally using dd/gzip to copy a
> >> LVM snapshot to a remote server, kswapd starts using a lot of CPU
> >> time, sometimes up to 100% of one CPU core.
> >> 
> >> This is on a Lustre 1.6.7.2.ddn3.5 based file system with about 200TB,
> >> the MDT is 800GB with 200M inodes, ACLs enabled.
> > 
> > Did you recompile it, or did you use the binaries from my home page (or
> > those you got from CV)?
> 
> This is a recompiled Lustre version to include the patch from bug   	
> 22820.
> 
> > Possibly it is a LRU auto-resize problem, but which has been disabled in
> > DDN builds. As our 1.6 releases didn't include a patch for that, you
> > would need to specify the correct command options if you recompiled it.
> 
> I guess it's likely that I have not specified the correct option. So the
>   binaries on your home page are compiled with '--disable-lru-resize'?
> Any other options that you used?

I always enable the health-write, which will help pacemaker to detect IO 
errors (by monitoring /proc/fs/lustre/health_check)

 --enable-health-write 

> 
> > Another reason might be bug 22771, although that should only come up on
> > MDS with more memory you have.
> 
> I had a look at that bug and while we have a default stripe count of 1
> so the stripe count should fit into the inode. On the other hand we use
> ACLs in quite a few places, so it seems we might hit this bug if we
> increase the memory from the 16GB currently, correct?

Yeah and I think 16GB should be sufficient for the MDS. 



-- 
Bernd Schubert
DataDirect Networks



More information about the lustre-discuss mailing list