[Lustre-discuss] Out of Memory on MDS

Roger Spellman roger at terascala.com
Wed Jun 24 07:20:13 PDT 2009


Andreas,

Thanks.  I've opened a bug,
https://bugzilla.lustre.org/show_bug.cgi?id=19950

In that bug I show that the number of ldlm_locks exceeds the limit
reported by lctl get_param ldlm.namespaces.*.pool.limit.

Do you agree that if we write to lru_size on every client, then that
will set a limit on ldlm_locks on the servers?

Do you know of a way to limit ldiskfs_inode_cache?

Thanks.

-Roger

> -----Original Message-----
> From: Andreas.Dilger at sun.com [mailto:Andreas.Dilger at sun.com] On Behalf
Of
> Andreas Dilger
> Sent: Tuesday, June 23, 2009 6:07 PM
> To: Roger Spellman
> Cc: Cliff.White at sun.com; lustre-discuss at lists.lustre.org
> Subject: Re: [Lustre-discuss] Out of Memory on MDS
> 
> On Jun 23, 2009  16:50 -0400, Roger Spellman wrote:
> > The servers are 1.6.7.1.  The clients are a mix of 1.6.7.1 and
1.6.6.
> >
> > Is lru_size an upper limit on the number of entries?
> >
> > Also, lru_max_age does not seem to be working.  I set it to 10
seconds,
> > and it did not clean anything up.
> 
> This is worth filing a bug on, if it isn't working.  The lock LRU size
> should be limited by the size of the RAM.  The upper limit on the
number
> of locks being granted can be gotten via:
> 
> 	lctl get_param ldlm.namespaces.*.pool.limit
> 
> The default limit should be 50 locks per 1MB of RAM.  In your case,
> 4GB is 4096MB, so the LRU limit should be 50 * 4096 = 204800 locks.
> 
> > > Roger Spellman wrote:
> > > > I have an MDS that is crashing with out-of-memory.
> > > >
> > > > Prior to the crash, I started collecting /proc/slabinfo.  I see
that
> > > > ldlm_locks is up to 4,500,000, and each one is 512 bytes, for a
> > > > total of 2.2GB, which is more than half my RAM.
> > > >
> > > > Is there a way to limit this?
> > >
> > > You don't mention the version of Lustre - lru_size might have an
> > > impact, I am not certain. I believe it is the only lock tuneable
> > > of note. (and is auto-sized in recent Lustre)
> 
> Cheers, Andreas
> --
> Andreas Dilger
> Sr. Staff Engineer, Lustre Group
> Sun Microsystems of Canada, Inc.




More information about the lustre-discuss mailing list