[Lustre-discuss] One or two OSS, no difference?

Jeffrey Bennett jab at sdsc.edu
Fri Mar 5 13:53:34 PST 2010


Andreas, if we are using 4kb blocks I understand we only transfer 1 page per RPC call, so are we limited to 10-15K RPC per second or what's the same, 10-15.000 IOPS?

jab


-----Original Message-----
From: Andreas.Dilger at sun.com [mailto:Andreas.Dilger at sun.com] On Behalf Of Andreas Dilger
Sent: Friday, March 05, 2010 2:05 AM
To: Jeffrey Bennett
Cc: Oleg.Drokin at Sun.COM; lustre-discuss at lists.lustre.org
Subject: Re: [Lustre-discuss] One or two OSS, no difference?

On 2010-03-04, at 14:18, Jeffrey Bennett wrote:
> I just noticed the sequential performance is ok, but the random IO  
> (which is what I am measuring) is not. Is there any way to increase  
> random IO performance on Lustre? We have LUNs that can provide  
> around 250.000 random read 4kb IOPS but we are only seeing 3.000 to  
> 10.000 on Lustre.

There is work currently underway to improve the SMP scaling  
performance for the RPC handling layer in Lustre.  Currently that  
limits the delivered RPC rate to 10-15k/sec or so.

> -----Original Message-----
> From: Oleg.Drokin at Sun.COM [mailto:Oleg.Drokin at Sun.COM]
> Sent: Thursday, March 04, 2010 12:49 PM
> To: Jeffrey Bennett
> Cc: lustre-discuss at lists.lustre.org
> Subject: Re: [Lustre-discuss] One or two OSS, no difference?
>
> Hello!
>
>   This is pretty strange. Are there any differences in network  
> topology that can explain this?
>   If you remove the first client, does the second one shows  
> performance
>   at the level of of the first, but as soon as you start the load on  
> the first again, the second
>   client performance drops?
>
> Bye,
>    Oleg
> On Mar 4, 2010, at 1:45 PM, Jeffrey Bennett wrote:
>
>> Hi Oleg, thanks for your reply
>>
>> I was actually testing with only one client. When adding a second  
>> client using a different file, one client gets all the performance  
>> and the other one gets very low performance, any recommendation?
>>
>> Thanks in advance
>>
>> jab
>>
>>
>> -----Original Message-----
>> From: Oleg.Drokin at Sun.COM [mailto:Oleg.Drokin at Sun.COM]
>> Sent: Wednesday, March 03, 2010 5:20 PM
>> To: Jeffrey Bennett
>> Cc: lustre-discuss at lists.lustre.org
>> Subject: Re: [Lustre-discuss] One or two OSS, no difference?
>>
>> Hello!
>>
>> On Mar 3, 2010, at 6:35 PM, Jeffrey Bennett wrote:
>>> We are building a very small Lustre cluster with 32 clients  
>>> (patchless) and two OSS servers. Each OSS server has 1 OST with 1  
>>> TB of Solid State Drives. All is connected using dual-port DDR IB.
>>>
>>> For testing purposes, I am enabling/disabling one of the OSS/OST  
>>> by using the "lfs setstripe" command. I am running XDD and vdbench  
>>> benchmarks.
>>>
>>> Does anybody have an idea why there is no difference in MB/sec or  
>>> random IOPS when using one OSS or two OSS? A quick test with "dd"  
>>> also shows the same MB/sec when using one or two OSTs.
>>
>> I wonder if you just don't saturate even one OST (both backend SSD  
>> and IB interconnect) with this number of clients? Does the total  
>> throughput decreases as you decrease
>> number of active clients and increases as you increase it even  
>> further?
>> Increasing maximum number of in-flight rpcs might help in that case.
>> Also are all of your clients writing to the same file or each  
>> client does io to a separate file (I hope)?
>>
>> Bye,
>>   Oleg
>
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-discuss


Cheers, Andreas
--
Andreas Dilger
Sr. Staff Engineer, Lustre Group
Sun Microsystems of Canada, Inc.




More information about the lustre-discuss mailing list