[lustre-discuss] strange problem mounting lustre

Riccardo Veraldi Riccardo.Veraldi at cnaf.infn.it
Fri May 13 17:36:09 PDT 2016


Hello,
Actually I have a problem which could be similar to this:

https://groups.google.com/forum/#!topic/lustre-discuss-list/ED6rxVGuKM8

I am running oss and mds on Centos 6.4 with lustre 2.4

Suddently my JBOD array failed. It has something like 12 OST on it.
When I established it back,  one of the OST was not mounting anymore 
from new clients:

When I try to mount from new clients I get this error: failed: Function 
not implemented

But the old clients which has the OST already mounted are working fine. 
Anyway if I dismount the lustre partiion, they cannot mount it anymore 
with same error:

failed: Function not implemented

the mds is running not on the failed JBOD array but on another storage.

So I have no clue why this is happening. On the MDS and OSS side I have 
no error and no hint in the lustre logs when clients try to mount.
I have the error on client side and that particular OST cannot be 
mounted anymore.

I already did a fsck on the /mgsgdt device partition which is a LVM.
but this did not help fixing the problem.

How could I debug what is going wrong ?
It never happened before.
Looks like something is corrupted somewhere but clients which are 
already mounting the the OST (thru mds) are working like a charm.
Remounting after dismounting won't work.
I did not reboot the oss yet because I am afraid of the consequences.
So I first wrote here to have some hint.

Cheers

Rick

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20160513/fbf92ff6/attachment.htm>


More information about the lustre-discuss mailing list