[Lustre-discuss] Performance drop (1.6.5 vs 1.6.4.3, OFED 1.2)?

Mike Bui mbui at datadirect.datadirectnet.com
Fri Oct 17 17:49:43 PDT 2008


Hello,

I have Lustre 1.6.5.1 and want to downgrade to Lustre 1.6.4.3 for better 
performance, how would I do this ?  Please advise.

Thanks,

Mike

Andrei Maslennikov wrote:
>
>
>   *New performance numbers (1.6.5.1 <http://1.6.5.1> vs 1.6.4.3 
> <http://1.6.4.3>):*
>
>   
> ---------------------------------------------------------------------------------------
>   Client   : Intel  X5450 at 3.00GHz <mailto:X5450 at 3.00GHz> 2xQuad core, 
> 16GB RAM,
>                Infiniband, RHEL4 x86_64
>   Servers: Official 1.6.4.1 <http://1.6.4.1/>
>   Single stream writing: (lmdd of=/lustre/tstfileXX bs=1M time=200 
> fsync=1)
>   
> --------------------------------------------------------------------------------------- 
>
>
>
>   _2.6.9-67.0.20.ELsmp unmodified, OFED 1.2,                      *319 
> MB/sec*
> _  _Lustre 1.6.5.1 <http://1.6.5.1/> (with 
> checksumming): _                                        
>   Client loads: lmdd - 100% (1 CPU), ptlrpcd - 5% , pdflush- 15%
>   On 2 OSS servers in use: circa 50% total sys (2 CPUs), circa 10% I/O 
> wait.
>  
>   _2.6.9-67.0.7.EL_lustre.1.6.5.1smp, OFED 1.3,                      
> *340 MB/sec*
> _  _Lustre 1.6.5.1 <http://1.6.5.1/> (with checksumming):_
>   Client loads: lmdd - 100% (1 CPU), ptlrpcd - 5%, pdflush- 15%
>   On 2 OSS servers in use: circa 50% total sys (2 CPUs), circa 12% I/O 
> wait.
>
>   _2.6.9-67.0.20.ELsmp unmodified, OFED 1.2,                       
> *671 MB/sec*
> _  _Lustre 1.6.5.1 <http://1.6.5.1/> (no checksumming)  :_  *
> *
>   Client loads: lmdd - 100% (1 CPU), ptlrpcd - 15%, pdflush- 2-3% 
>   On 2 OSS servers in use: circa 35% total sys (2 CPUs), circa 35% I/O 
> wait.
>  
>   _2.6.9-67.0.7.EL_lustre.1.6.5.1smp, OFED 1.3,                       
> *670 MB/sec*
> _  _Lustre 1.6.5.1 <http://1.6.5.1/> (no checksumming)  :_
>   Client loads: lmdd - 100% (1 CPU), ptlrpcd - 12%, pdflush- 2-3% 
>   On 2 OSS servers in use: circa 32% total sys (2 CPUs), circa 32% I/O 
> wait.
>
>   _2.6.9-67.0.4.EL_lustre.1.6.4.3smp, OFED 1.2,                       
> *843 MB/sec*_
>   _Lustre 1.6.4.3 <http://1.6.4.3> _
>   Client loads: lmdd - 100% (1 CPU), ptlrpcd - 20%, pdflush - 1%
>   On 2 OSS servers in use: circa 33 % total sys (2 CPUs), circa 30% 
> I/O wait.
>
>  
>   
> --------------------------------------------------------------------------------------
>   Running several (2,4) simultaneous jobs on the same 1.6.4.3 
> <http://1.6.4.3> client
>   does not improve the aggregate performance. I have seen 750 MB/sec
>   aggregate with 4 streams, and 806 MB/sec aggregate with 2 streams.
>
>   With 1.6.5.1 <http://1.6.5.1> client with no checksumming I can get 
> up to 800 MB/sec
>   aggregate with 4 streams, and some 730 MB/sec with 2 streams.
>
>   But Lustre 1.6.5.1 <http://1.6.5.1> is visibly (20%) less performant 
> on a single stream when
>   compared with 1.6.4.3 <http://1.6.4.3>.
>
>   Andrei.
>  
>
>
>
>
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-discuss
>   
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20081017/83c56431/attachment.htm>


More information about the lustre-discuss mailing list