[Lustre-discuss] High CPU load, only on 1 OSS

Wang Yibin wang.yibin at oracle.com
Tue Nov 16 07:53:19 PST 2010


Hello,

Normally when stripe_offset is set to -1, MDS will do load/space balancing automatically.
What is your use pattern of the filesystem?
It sounds like that your applications are doing extensive I/O on that particular OSS.

To find out why the load on the OSS is so high, please 
- find what processes are hogging the CPUs using top(1).
- get the stripe info of your in-use files to see whether most of them reside on the same OSS.

If the files in use are not distributed among the OSS servers, or your file usage pattern is one-OSS bound, you may want to consider tuning the stripe_count/stripe_size.

在 2010-11-16,下午10:38, Ronald K Long 写道:

> 
> We recently setup a lustre config.  1 MDS 4 OSS's.  Everything is running fine except on the first OSS we are experiencing very high cpu load.  The first OSS is running a CPU load in the high 50's.  The other 3 OSS's are steady at around 8.  Everything is the same between all of the OSS's. 
> 
> The stripe is setup 
> 
> stripe_count:   1 stripe_offset:  -1 
> 
> Red Hat 5 64bit 
> 
> kernel-2.6.18-194.3.1.el5_lustre.1.8.4 
> kernel-devel-2.6.18-194.3.1.el5_lustre.1.8.4 
> lustre-ldiskfs-3.1.3-2.6.18_194.3.1.el5_lustre.1.8.4 
> lustre-1.8.4-2.6.18_194.3.1.el5_lustre.1.8.4 
> lustre-modules-1.8.4-2.6.18_194.3.1.el5_lustre.1.8.4 
> 
> 
> Any thing I can check on the problem OSS to rectify this issue. 
> 
> Thank you in advance
> 
> Rocky 
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-discuss

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20101116/f264ba76/attachment.htm>


More information about the lustre-discuss mailing list