[Lustre-discuss] Out of Memory on MDS

Seger, Mark mark.seger at hp.com
Tue Jun 23 10:31:05 PDT 2009


If you're looking to collect slab info, collectl exceeds at this.  Just download it from http://collectl.sourceforge.net/, install and "/etc/init.d/collectl start" and will collectl MDS stats every 10 seconds and slab stats every minute (easily changeable) along with a ton of other stats.  You can then play back the recorded data showing slabs that changed during each interval or even report the 10-n slabs (default=10) sorted by a variety of fields (collectl -showtopopts for help on top options).
-mark

From: lustre-discuss-bounces at lists.lustre.org [mailto:lustre-discuss-bounces at lists.lustre.org] On Behalf Of Roger Spellman
Sent: Tuesday, June 23, 2009 12:56 PM
To: lustre-discuss at lists.lustre.org
Subject: [Lustre-discuss] Out of Memory on MDS

I have an MDS that is crashing with out-of-memory.

Prior to the crash, I started collecting /proc/slabinfo.  I see that ldlm_locks is up to 4,500,000, and each one is 512 bytes, for a total of 2.2GB, which is more than half my RAM.

Is there a way to limit this?

Other heavy memory users are ldisk_inode_cache (421 MB) and ldlm_resources (137 MB).  Is there a way to limit these too?

Thanks.

Roger Spellman
Staff Engineer
Terascala, Inc.
508-588-1501
www.terascala.com http://www.terascala.com/

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20090623/c8bb8134/attachment.htm>


More information about the lustre-discuss mailing list