[Lustre-devel] Wide striping

Nathan Rutman Nathan_Rutman at xyratex.com
Thu Oct 6 12:09:26 PDT 2011

On Oct 5, 2011, at 6:51 PM, Andreas Dilger wrote:

> On 2011-10-05, at 9:33 AM, David Dillow <dillowda at ornl.gov> wrote:
>> On Wed, 2011-10-05 at 11:06 -0400, Nathan Rutman wrote:
>>> Sorry if I'm being unclear.
>>> start_index is just an offset into the bitmap.  That's the OST where the first 
>>> stripe will be.  Next stripe will be on the next OST index (unless a hole). 
>>> When we get to the big hole at the end of the used OSTs, these OST index 
>>> locations are all skipped (since they are holes), and the next stripe will 
>>> be at OST index 0, then 1, etc, up to start_index-1 (again, unless holes).
>> Ok, so bitmap position 0 is always OST 0; thanks for clearing up my
>> misunderstanding.
> But this means that the table always needs to be as large as the maximum OST number. If the bitmap started at the starting OST index it would only need to be as large as the number of stripes. 
Yes, the table is as large as the maximum possible OST number.  32,000 stripes fit in a bitmap in the current (non-extended) EA size.  If you started at the starting OST index, you would need to record the last OST number also.  Either way I don't see as a problem.

> That said, the limitation if not being able to migrate objects with this layout is a major one. The ability to so online object migration is just arriving with the layout lock (from HSM), so I expect this to be useful to many users. 
Well, that's why we added the complication of embedding the OST index into the object FIDs that the clients would ask for. Then you could migrate that object to a new OST - but really only for exceptional cases.  General migration for e.g. space rebalancing would result in a bunch of additional overhead to figure out where all the stripes moved to.  So I agree - this is a weakness of the bitmap design, which really implies a fixed ordering. =
This email may contain privileged or confidential information, which should only be used for the purpose for which it was sent by Xyratex. No further rights or licenses are granted to use such information. If you are not the intended recipient of this message, please notify the sender by return and delete it. You may not use, copy, disclose or rely on the information contained in it.
Internet email is susceptible to data corruption, interception and unauthorised amendment for which Xyratex does not accept liability. While we have taken reasonable precautions to ensure that this email is free of viruses, Xyratex does not accept liability for the presence of any computer viruses in this email, nor for any losses caused as a result of viruses.
Xyratex Technology Limited (03134912), Registered in England & Wales, Registered Office, Langstone Road, Havant, Hampshire, PO9 1SA.
The Xyratex group of companies also includes, Xyratex Ltd, registered in Bermuda, Xyratex International Inc, registered in California, Xyratex (Malaysia) Sdn Bhd registered in Malaysia, Xyratex Technology (Wuxi) Co Ltd registered in The People's Republic of China and Xyratex Japan Limited registered in Japan.

More information about the lustre-devel mailing list