[Lustre-discuss] Lustre client kernel panic
Nick Jennings
nick at creativemotiondesign.com
Sat Sep 26 05:53:53 PDT 2009
On Sat, 2009-09-26 at 04:11 -0400, Oleg Drokin wrote:
> Hello!
>
> On Sep 26, 2009, at 1:57 AM, Nick Jennings wrote:
>
> > About an hour ago the client completely hung. Hosting co. says it was
> > a kernel panic. I got not useful feedback in /var/log/messages from
> > the
> > client or the MDS. However from the OST I got several complaints.
> > (below).
> > Does anyone have any insight into the problem? All help as to how I
> > can
> > fix this, or avoid the problem, greatly appreciated.
>
> The traces you see is a known bug (19557), it happens when client is
> evicted
> that had too many locks cached.
> Unfortunately that provides us with zero insight into what happened to
> the client
> and MDS.
Hi Oleg! How ya doing? :)
Unfortunately that was the only info I could get. The client had no
information in the logs about what happened. The MDS only had the
following entry near the time:
Sep 25 22:28:43 dbn1 kernel: Lustre: MGS: haven't heard from client
ab5e5f08-e39d-385d-f7e3-fbd1addb0fac (at 10.0.0.21 at tcp1) in 248 seconds.
I think it's dead, and I am evicting it.
Is there any other info I should be gathering when something like this
happens? (Sorry, it's been a while since I've done any lustre bug
reporting) :)
Cheers,
-Nick
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20090926/d52caa22/attachment.pgp>
More information about the lustre-discuss
mailing list