[Lustre-discuss] Unbalanced OST--for discussion purposes

Andreas Dilger adilger at sun.com
Tue Mar 2 20:06:51 PST 2010


On 2010-03-02, at 13:45, Ms. Megan Larko wrote:
> I logged directly into the OSS (OSS4) and just ran a df (along with a
> periodic check of the log files).  I last looked about two weeks ago
> (I know it was after 17 Feb).   Anyway, the OST0007 is more full than
> any of the other OSTs.  The default lustre stripe (I believe that is
> set to 1) is used.    Can just one file shift the size used of one OST
> that significantly?

Sure, this is easy if the size of a single file is be a large fraction  
of the OST size.  This is one reason why we recommend people use  
larger OSTs (up to 16TB in 1.8.2 with RHEL5.4) instead of e.g. 1TB or  
less that is sometimes reported here.

> What other reasonable explanation for a difference on one OST in  
> comparison with the others?  Could this cause a lustre performance  
> hit at this point?

It is possible, if the filesystem is getting very full and it causes  
more seeking to do IO.  At the 84% you report below it is starting to  
get into that range - I wouldn't recommend running the filesystem  
beyond 90% full unless you are more concerned with space usage than  
performance.

You can find the file(s) that are abnormally large on that particular  
OST by running (preferably on a client mountpoint on the MDS):

   lfs find --obd crew8-OST0006_UUID -size +10G /mnt/lustre

>   [root at oss4 ~]# df -h
> Filesystem            Size  Used Avail Use% Mounted on
>
> /dev/sdb1             6.3T  3.6T  2.5T  60% /srv/lustre/OST/crew8- 
> OST0000
> /dev/sdb2             6.3T  4.1T  1.9T  69% /srv/lustre/OST/crew8- 
> OST0001
> /dev/sdc1             6.3T  3.3T  2.8T  55% /srv/lustre/OST/crew8- 
> OST0002
> /dev/sdc2             6.3T  3.3T  2.7T  56% /srv/lustre/OST/crew8- 
> OST0003
> /dev/sdd1             6.3T  3.5T  2.6T  58% /srv/lustre/OST/crew8- 
> OST0004
> /dev/sdd2             6.3T  4.1T  1.9T  69% /srv/lustre/OST/crew8- 
> OST0005
> /dev/sdi1             6.3T  3.9T  2.2T  65% /srv/lustre/OST/crew8- 
> OST0006
> /dev/sdi2             6.3T  5.0T 1015G  84%
> /srv/lustre/OST/crew8-OST0007     <----
> /dev/sdj1             6.3T  3.4T  2.7T  56% /srv/lustre/OST/crew8- 
> OST0008
> /dev/sdj2             6.3T  3.3T  2.7T  56% /srv/lustre/OST/crew8- 
> OST0009
> /dev/sdk1             6.3T  3.4T  2.7T  56% /srv/lustre/OST/crew8- 
> OST0010
> /dev/sdk2             6.3T  3.8T  2.2T  64% /srv/lustre/OST/crew8- 
> OST0011

Cheers, Andreas
--
Andreas Dilger
Sr. Staff Engineer, Lustre Group
Sun Microsystems of Canada, Inc.




More information about the lustre-discuss mailing list