[Lustre-discuss] Frequent OSS Crashes with heavy load

Wang lu wanglu at ihep.ac.cn
Mon Nov 10 09:01:00 PST 2008


Thanks a lot. I will go on tomorrow.

Brian J. Murrell 写:

> On Mon, 2008-11-10 at 16:42 +0000, Wang lu wrote:
>> I have already 512(max number) IO thread running. Some of them are of "Dead"
>> status. Is it safe to draw conclusion that the OSS is oversubscribed? 
> 
> Until you do some analysis of your storage with the iokit, one cannot
> really draw any conclusions, however if you are already at the maximum
> value of OST threads, it would not be difficult to believe that perhaps
> this is a possibility.
> 
> Try a simple experiment and half the number to 256 and see if you have
> any drop off in throughput to the storage devices.  If not, then you can
> easily assume that 512 was either too much or not necessary.  You can
> try doing this again if you wish.  If you get to a value of OST threads
> where your throughput is lower than it should be, you've gone too low.
> 
> But really, the iokit is the more efficient and accurate way to
> determine this.
> 
> b.
> 



More information about the lustre-discuss mailing list