[Lustre-discuss] inode weirdness

Stuart Midgley sdm900 at gmail.com
Fri Sep 4 08:31:12 PDT 2009


Evening

The file was created on the same node it was access from.

The error isn't permanent.  When the job crashed, I went and started  
investigating and the file was fine.

No, the file is never unlinked.

How do I go about getting a lustre log?


-- 
Dr Stuart Midgley
sdm900 at gmail.com



On 04/09/2009, at 11:28 PM, Oleg Drokin wrote:

> Hello!
>
> On Sep 4, 2009, at 5:35 AM, Stu Midgley wrote:
>
>> I am having jobs on a cluster client crash.  The job creates a small
>> text file (using cp) and then immediately tries to use it with  
>> another
>> application.  The application fails saying the file doesn't exist.
>
> That's quite strange for such a sequence of actions.
> Is the file created on one node and accessed on another?
> How permanent is the error ? (i.e. does it still happen when you  
> later access the file again?)
> Is the file unlinked at any time, could there be a race with unlink  
> by any chance?
>
>> In the client /var/log/messages, I'm seeing
>> Sep  4 15:58:17 clus039 kernel: LustreError:
>> 15249:0:(file.c:2930:ll_inode_revalidate_fini()) failure -2 inode
>> 75792903
>
> There is bug 16377 about this same message, though it is not clear  
> what happened there.
> Perhaps you can gather -1 lustre logs from mds and a client that  
> creates
> and client that accesses this file and gets an error and attach  
> those to the bug 16377?
>
> Bye,
>    Oleg




More information about the lustre-discuss mailing list