[lustre-discuss] lustre OSC and system cache

John Bauer bauerj at iodoctors.com
Mon Dec 12 14:50:29 PST 2016


I'm observing some undesirable caching of OSC data in the system 
buffers.  This is a single node, single process application. There are 2 
files of interest, *SCRATCH *and *SCR300*, both are scratch files with 
stripeCount=4.  The system has 128GB of memory.  Lustre maxes out at 
about 59GB of memory used for caching.

*SCRATCH*,  About 22GB is written/read during the first 300 seconds of 
the run.  No further activity to the file ( but remains open ) until 
about 18,700 seconds into the run when another 22GB is written/read.  
Illustrated in the top frame of the first plot below.  In the bottom 
frame of the first plot is the amount of system cache used by each of 
the 4 OSC's associated with the file over the course of the run ( nearly 
identical, as would be expected ).  Note that each the OSC's retains its 
5.5GB of memory even though nothing is happening to the file.

*SCR300*,  A 110GB file, written and repeatedly read between the times 
of the above SCRATCH file's I/O.

What is of interest it that while SCR300 is doing all its I/O, and its 
associated OSC's are fighting each other for caching memory, the 4 OSC's 
for the inactive file(SCRATCH) retain their 22GB of memory.  Why are the 
4 OSC's for the inactive file exempt from giving up their memory?  It is 
very reproducible.

The application is MSC.Nastran, which has the capability to put the data 
for SCR300 inside of SCRATCH(increasing its size to 132GB).  If run in 
this mode, the caching behavior is much better behaved and the job runs 
in 11,500 seconds, versus 19,000. Illustrated in 3rd plot below.  While 
this is a solution for this case, it is not a general solution.

Thanks

John

Plots for *SCRATCH*



Plots for *SCR300*




Plots for *SCR300 *inside of *SCRATCH*


-- 
I/O Doctors, LLC
507-766-0378
bauerj at iodoctors.com

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20161212/9fe5bfd4/attachment-0001.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: bfoimgfaenjmgmii.png
Type: image/png
Size: 8780 bytes
Desc: not available
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20161212/9fe5bfd4/attachment-0003.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: mncccijbfkiekmmn.png
Type: image/png
Size: 16468 bytes
Desc: not available
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20161212/9fe5bfd4/attachment-0004.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: adnondhpelpohhjf.png
Type: image/png
Size: 14165 bytes
Desc: not available
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20161212/9fe5bfd4/attachment-0005.png>


More information about the lustre-discuss mailing list