[Lustre-discuss] making a client reconnect to OST

Brock Palen brockp at umich.edu
Thu Jan 24 07:23:54 PST 2008


I have a client (one of our login nodes) that was evicted by one of  
the OST's but not both of them.  So some files are accessible others  
are not.  Strange thing is that both the OST's live on the same OSS.

The errors in dmesg are:

LustreError: 11-0: an error occurred while communicating with  
141.212.30.181 at tcp. The obd_ping operation failed with -107
Lustre: nobackup-OST0001-osc-000001007d548400: Connection to service  
nobackup-OST0001 via nid 141.212.30.181 at tcp was lost; in progress  
operations using this service will wait for recovery to complete.
LustreError: 167-0: This client was evicted by nobackup-OST0001; in  
progress operations using this service will fail.
LustreError: 29595:0:(file.c:1052:ll_glimpse_size()) obd_enqueue  
returned rc -5, returning -EIO
LustreError: 29629:0:(file.c:1052:ll_glimpse_size()) obd_enqueue  
returned rc -5, returning -EIO


OST0000 also lives at 141.212.30.181, so its strange that only one  
will kill it off.  Is there a way to ask lustre to restore this?  Up  
till this point, the client would recover quickly, but this time its  
just waiting.

Brock Palen
Center for Advanced Computing
brockp at umich.edu
(734)936-1985





More information about the lustre-discuss mailing list