[Lustre-discuss] making a client reconnect to OST
Brock Palen
brockp at umich.edu
Thu Jan 24 07:23:54 PST 2008
I have a client (one of our login nodes) that was evicted by one of
the OST's but not both of them. So some files are accessible others
are not. Strange thing is that both the OST's live on the same OSS.
The errors in dmesg are:
LustreError: 11-0: an error occurred while communicating with
141.212.30.181 at tcp. The obd_ping operation failed with -107
Lustre: nobackup-OST0001-osc-000001007d548400: Connection to service
nobackup-OST0001 via nid 141.212.30.181 at tcp was lost; in progress
operations using this service will wait for recovery to complete.
LustreError: 167-0: This client was evicted by nobackup-OST0001; in
progress operations using this service will fail.
LustreError: 29595:0:(file.c:1052:ll_glimpse_size()) obd_enqueue
returned rc -5, returning -EIO
LustreError: 29629:0:(file.c:1052:ll_glimpse_size()) obd_enqueue
returned rc -5, returning -EIO
OST0000 also lives at 141.212.30.181, so its strange that only one
will kill it off. Is there a way to ask lustre to restore this? Up
till this point, the client would recover quickly, but this time its
just waiting.
Brock Palen
Center for Advanced Computing
brockp at umich.edu
(734)936-1985
More information about the lustre-discuss
mailing list