[Lustre-discuss] Can lustre be trusted to keep my data safe?

Mike Berg mike.berg at sun.com
Wed May 14 15:11:26 PDT 2008


I should added to my original response, that being a customer once  
many years ago, it is always a good idea to have a second copy of your  
most important/expensive data no matter what a vendor says :-). That  
said, I understand in the large data world, this can be costly.


On May 14, 2008, at 3:21 PM, jrs wrote:

> Thanks for the insight, Jim (and Mike and Aaron),
>
> Unfortunately, I've now gotten contradictory views (not terribly
> surprising: people have different views and experiences, etc...).
>
> Mike (who posted earlier) implied that, if the underlying storage
> and network were solid and if failover is done right that it
> can be trusted.
>
> Jim, would having a support contract change your view?  Or, might
> the progression toward finding that right version/right hardware
> be dangerous even with support?  Is this something related to
> the codes immaturity?  Or just a complex problem?
>
> thanks much,
> John
>
>
> Jim Garlick wrote:
>> John,
>>
>> Lustre can be damn robust if you get the right version on the right
>> hardware.  Also, I think the new engineering practices and future
>> architecture that uses ZFS on the back end will only improve this.
>>
>> That said, your predicament is troubling.  As a general rule I  
>> would not
>> trust any parallel file system that I know of with mission critical  
>> data.
>> Failures do happen; indeed we have lost data in Lustre on several  
>> occasions.
>>
>> In some sense we're in a similar position.  The data we put in Lustre
>> is important to our mission (well some of it anyway), costly to  
>> regenerate,
>> and impractical to back up with a general backup policy.
>>
>> What we do is basically advertise Lustre as temporary scratch space  
>> and
>> provide an HPSS tape archive for users to copy their most critical  
>> data to.
>> That may not work in your case, but if I were you I would at least  
>> have
>> some sort of disaster plan for recovering or regenerating your data.
>> In short, don't trust Lustre or any parallel file system as the sole
>> repository for your mission critical data.
>>
>> Jim
>>
>> On Wed, May 14, 2008 at 02:21:02PM -0400, jrs wrote:
>>> Greetings all,
>>>
>>> I just spoke with someone at a large computing company who
>>> has a close relationship with lustre/sun (a reseller, I guess).
>>> This person described lustre as being something that Sun
>>> "would not recommend for mission critical use."
>>>
>>> Can this be true?
>>>
>>> I work for a small/medium company that does image processing.
>>> We have about 700TB of data presently and might be at 2PB within
>>> the next couple of years.  Owing to the amount of data we don't
>>> make backups for most of it and trust raid 6 on our hardware raid
>>> boxes (nexsan Satabeast) to fail more slowly than we can replace
>>> disks.  Over the last couple of years we've had great luck and,
>>> I believe, have never lost data owing to a failure with this
>>> hardware (software or human error is another matter ;-).
>>> However, the unbacked up data is "mission critical."  Though
>>> it can, probably, all be reconstructed or reacquired, as a practical
>>> matter losing a significant quantity of this data could be
>>> catastrophic for our business.
>>>
>>> So, what do you think, can lustre be trusted to keep our
>>> data safe at our company?  Assume in answering that we have
>>> failover working properly.  We can also withstand some blocking
>>> of the filesystem while a failover event completes, i.e., not
>>> having the filesystem available for some amount of time is
>>> not a problem, but having directory important-data/ disappear
>>> is a HUGE problem.
>>>
>>> Thanks for any help or guidance,
>>>
>>> John
>>> _______________________________________________
>>> Lustre-discuss mailing list
>>> Lustre-discuss at lists.lustre.org
>>> http://lists.lustre.org/mailman/listinfo/lustre-discuss
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-discuss




More information about the lustre-discuss mailing list