[Lustre-discuss] making a client reconnect to OST

Jim Harm harm1 at llnl.gov
Mon Feb 4 08:31:40 PST 2008


Is there a tool that will really attempt a reconnect from a client to 
a single OST?
it would be helpful for those rare cases
	when this happens and there is nothing really wrong with either.
i imagine original cause could be something as simple as repeated delays
	on a very busy network?
Other OSTs from the same OSS remained connected to the same client
	during this problem.
If umount and mount could be avoided,
	it would be less disruptive to other processes on the client.

At 2:10 PM -0800 1/25/08, Jim Harm wrote:
>On the client i tried the lctl --device $number deactivate
>which worked
>followed by
>llctl --device $number activate
>which i believe should have done the same thing
>this failed without error notice to me.
>
>i ended up having to umount and mount, which finally reconnected the ost.
>
>At 12:55 PM -0700 1/25/08, Andreas Dilger wrote:
>>On Jan 24, 2008  10:23 -0500, Brock Palen wrote:
>>>   I have a client (one of our login nodes) that was evicted by one of
>>>   the OST's but not both of them.  So some files are accessible others
>>>   are not.  Strange thing is that both the OST's live on the same OSS.
>>>
>>>   Is there a way to ask lustre to restore this?  Up
>>>   till this point, the client would recover quickly, but this time its
>>>   just waiting.
>>
>>You could try "lctl --device {OSC device in question} recover".
>>
>>Cheers, Andreas
>>--
>>Andreas Dilger
>>Sr. Staff Engineer, Lustre Group
>>Sun Microsystems of Canada, Inc.
>>
>>_______________________________________________
>>Lustre-discuss mailing list
>>Lustre-discuss at lists.lustre.org
>>http://lists.lustre.org/mailman/listinfo/lustre-discuss
>
>
>--
>}}}===============>>  LLNL
>James E. Harm (Jim); jharm at llnl.gov
>System Administrator, ICCD Clusters
>(925) 422-4018 Page: 423-7705x57152
>_______________________________________________
>Lustre-discuss mailing list
>Lustre-discuss at lists.lustre.org
>http://lists.lustre.org/mailman/listinfo/lustre-discuss


-- 
}}}===============>>  LLNL
James E. Harm (Jim); jharm at llnl.gov
System Administrator, ICCD Clusters
(925) 422-4018 Page: 423-7705x57152



More information about the lustre-discuss mailing list