[lustre-discuss] sudden read performance drop on sequential forward read.

John Bauer bauerj at iodoctors.com
Thu Aug 31 17:52:02 PDT 2017


All,

I have an application that writes a 100GB file forwards, and then begins 
a sequence of reading a 70 GB section of the file forwards and 
backwards. At some point in the run,
not always at the same point, the read performance degrades 
significantly.  The initial forward reads are about 1.3 GB/s.  The 
backwards reads about 300 MB/s.  In an instant,
the forward read performance drops to 2.8 MB/s.  From about 250 seconds 
on, this is the only file that is being read or written by the 
application, running on a dedicated client node.
The file has a stripe count of 4, and stripe size of 512KB.    If the 
stripe count is changed to 1, this behavior does not present itself.  
The cpu usage is minimal during the period of degraded performance.
The LNET traffic is also about 2.8 MB/s during the period of degraded 
performance.  The system has 64GB of memory, meaning Lustre can not 
cache the entire 70GB active set of the file that is being read.
The Lustre client version is 2.9.0.

Any ideas what could be causing this?  What should I be watching in the 
/proc/fs/lustre file system to find some clues?

The behavior is depicted in the image below, which shows the file 
position as a function of wall clock time.  The writes and reads are of 
size 512KB.

Thanks,

John



-- 
I/O Doctors, LLC
507-766-0378
bauerj at iodoctors.com

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20170831/0cb4a580/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: amnaeacmkgigmlan.png
Type: image/png
Size: 15203 bytes
Desc: not available
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20170831/0cb4a580/attachment.png>


More information about the lustre-discuss mailing list