[Lustre-discuss] Frequent appearence of LustreError: no handle for file close ino

Christos Theodosiou ctheodos at grid.auth.gr
Mon Apr 19 01:09:40 PDT 2010


On Fri, 2010-04-16 at 10:49 -0700, Andreas Dilger wrote: 
> On 2010-04-16, at 01:27, Christos Theodosiou wrote:
> > our lustre installation uses two failover MDSes, which serve 10
> > file-systems. We recently upgraded from 1.8.1.1 to 1.8.2 version.
> >
> > By monitoring the MDSes I noticed that we get frequent error messages
> > (1-4 times/hour) which looklike this:
> >
> > Apr 16 10:36:10 lustre01 kernel: LustreError:
> > 31386:0:(mds_open.c:1666:mds_close()) @@@ no handle for file close ino
> > 4011004: cookie 0x699184f2b7f715d6  req at ffff8100621d2c00
> > x1332632126967249/t0 o35->60822f67-b798-f355-d5b1-54d7d7fa9f15@:0/0  
> > lens
> > 408/472 e 0 to 0 dl 1271403376 ref 1 fl Interpret:/0/0 rc 0/0
> 
> This likely means some client was evicted in the past, but later on it  
> is closing a file that was previously opened.  This is not an error to  
> be concerned about, and we should probably just turn it off.
> 
> > o35->60822f67-b798-f355-d5b1-54d7d7fa9f15@:0/0 lens 408/400 e 0 to 0  
> > dl
> 
> You can find which client this is via (something like, I'm not logged on
> my system):
> 
> lfs get_param mds.*.exports.*.uuid | grep 60822f67-b798-f355- 
> d5b1-54d7d7fa9f15

lctl get_param mds.*.exports.*.uuid | grep 60822f67-b798-f355-
d5b1-54d7d7fa9f15

worked fine. This command provides info about the server and the client.
Is there any way to get information about the file of the error message?

> 
> > 1271403376 ref 1 fl Interpret:/0/0 rc -116/0
> > Apr 16 10:36:10 lustre01 kernel: LustreError:
> > 31386:0:(ldlm_lib.c:1848:target_send_reply_msg()) Skipped 1 previous
> > similar message
> >
> > and
> > Apr 16 11:07:51 lustre02 kernel: LustreError:
> > 6305:0:(mds_open.c:1666:mds_close()) @@@ no handle for file close ino
> > 2264949: cookie 0xc4e85523602d2686  req at ffff81006787c000
> > x1332654888280600/t0 o35->52c78d24-f0da-a56e-9bc5-2e16e6e78790@:0/0  
> > lens
> > 408/976 e 0 to 0 dl 1271405277 ref 1 fl Interpret:/0/0 rc 0/0
> > Apr 16 11:07:51 lustre02 kernel: LustreError:
> > 6305:0:(ldlm_lib.c:1848:target_send_reply_msg()) @@@ processing error
> > (-116)  req at ffff81006787c000 x1332654888280600/t0
> > o35->52c78d24-f0da-a56e-9bc5-2e16e6e78790@:0/0 lens 408/688 e 0 to 0  
> > dl
> > 1271405277 ref 1 fl Interpret:/0/0 rc -116/0
> >
> > I would like to ask:
> > a) is this a critical error message?
> > b) is there any way to find out more info about it. e.g. filesystem,
> > filename and lustre client that are related to this error?
> > c) is there any way to resolve this errors?
> >
> 
> 
> Cheers, Andreas
> --
> Andreas Dilger
> Principal Engineer, Lustre Group
> Oracle Corporation Canada Inc.
> 

Thanks, Christos




More information about the lustre-discuss mailing list