[Lustre-discuss] MDS overload, why?

Arne Brutschy arne.brutschy at ulb.ac.be
Fri Oct 9 07:15:08 PDT 2009


Hi,

thanks for replying!

I understand that without further information we can't do much about the
oopses. I was more hoping for some information regarding possible
sources of such an overload. Is it normal that a MDS gets overloaded
like this, while the OSTs have nothing to do, and what can I do about
it? How can I find the source of the problem?

More specifically, what are the operations that lead to a lot of MDS
load and none for the OSTs? Although our MDS (8GB ram, 2x4core, SATA) is
not a top-notch server, it's fairly recent and I feel the load we're
experiencing is not handable by a single MDS.

My problem is that I can't make out major problems in the user's jobs
running on the cluster, and I can't quantify nor track down the problem
because I don't know what behavior might have caused it. 

As I said, ooppses appeared only twice, and all other problems where
just apparent by a non-responsive MDS.

Thanks,
Arne


On Fr, 2009-10-09 at 07:44 -0400, Brian J. Murrell wrote:
> On Fri, 2009-10-09 at 10:26 +0200, Arne Brutschy wrote:
> > 
> > The clients showed the following error:
> > > Oct  8 09:58:55 majorana kernel: LustreError: 3787:0:(events.c:66:request_out_callback()) @@@ type 4, status -5  req at f6222800 x8702488/t0 o250->MGS at 10.255.255.206@tcp:26/25 lens 304/456 e 0 to 1 dl 1254988740 ref 2 fl Rpc:N/0/0 rc 0/0
> > > Oct  8 09:58:55 majorana kernel: LustreError: 3787:0:(events.c:66:request_out_callback()) Skipped 33 previous similar messages
> > 
> > So, my question is: what could cause such a load? The cluster was not
> > exessively used... Is this a bug or a user's job that creates the load?
> > How can I protect lustre against this kind of failure?
> 
> Without any more information we could not possibly know.  If you really
> are getting oopses then you will need console logs (i.e. serial console)
> so that we can see the stack trace.
> 
> b.
> 
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-discuss
-- 
Arne Brutschy
Ph.D. Student                    Email    arne.brutschy(AT)ulb.ac.be
IRIDIA CP 194/6                  Web      iridia.ulb.ac.be/~abrutschy
Universite' Libre de Bruxelles   Tel      +32 2 650 3168
Avenue Franklin Roosevelt 50     Fax      +32 2 650 2715
1050 Bruxelles, Belgium          (Fax at IRIDIA secretary)




More information about the lustre-discuss mailing list