[Lustre-discuss] HLRN lustre breakdown

Heiko Schroeter schroete at iup.physik.uni-bremen.de
Thu Aug 21 01:39:31 PDT 2008


Am Mittwoch, 20. August 2008 19:08:34 schrieben Sie:

Hello,

thank you very much for this info.

Good to know that lustre is not the cause.

Not so good is that a silent disk crash can corrupt the whole system because 
we do use plenty of raids in our setup ....

Regards
Heiko

> Hi there
>
> I got the following background information from Juergen Kreuels at SGI
>
> "It turned out that a bad disk ( which did NOT report itself as being
> bad ) killed the lustre leading to data corruption due to inode areas on
> that disk.
> It was finally decided to remake the whole FS and only during that
> action we finally ( after nearly 48 h ) found that bad drive.
>
> It had nothing to do with the lustre FS itself. Lustre had been the
> victim of a HW failure on a Raid6 lun."
>
> I hope that this helps
>
> PJones
>
> Heiko Schroeter wrote:
> > Hello list,
> >
> > does anyone has more background infos of what happened there ?
> >
> > Regards
> > Heiko
> >
> >
> >
> >
> > HLRN News
> > ---------
> >
> >
> > Since Mon Aug 18, 2008 12:00 HLRN-II complex Berlin is open for users,
> > again.
> >
> > During the maintenance it turned out that the Lustre file system holding
> > the users $WORK and $TMPDIR was damaged completely.
> > The file system had to be reconstructed from scratch. All user data in
> > $WORK are lost.
> >
> > We hope that this event remains an exception. SGI apologizes for this
> > event.
> >
> > /Bka
> >
> > ========================================================================
> > This is an announcement for all HLRN Users
> > _______________________________________________
> > Lustre-discuss mailing list
> > Lustre-discuss at lists.lustre.org
> > http://lists.lustre.org/mailman/listinfo/lustre-discuss





More information about the lustre-discuss mailing list