[Lustre-discuss] [SPAM] Re: [wc-discuss] The ost_connect operation failed with -16

HUANG Qiulan huangql at ihep.ac.cn
Wed May 30 03:47:15 PDT 2012


Hi Adrian,


> -----原始邮件-----
> 发件人: "Adrian Ulrich" <adrian at blinkenlights.ch>
> 发送时间: 2012年5月30日 星期三
> 收件人: huangql <huangql at ihep.ac.cn>
> 抄送: lustre-discuss <lustre-discuss at lists.lustre.org>, wc-discuss <wc-discuss at whamcloud.com>
> 主题: [SPAM] Re: [wc-discuss] The ost_connect operation failed with -16
> 
> Hello,
> 
> 
> > May 30 09:58:36 ccopt kernel: LustreError: 11-0: an error occurred while communicating with 192.168.50.123 at tcp. The ost_connect operation failed with -16
> 
> Error -16 stands for -EBUSY
> 
> 
> > When you got this error message, you failed to run "ls", "df" ,"vi", "touch" and so on, which affect us to do anything in the file system.
> 
> That's to be expected in such a situation. I suppose that 'lfs check servers' returned 'temporarily unavailable' for some OSTs ?
> 

Yes, we can get 'temporarily unavailable' message on the client and it can reconnect to the OST after minutes or even hours.

However, the users cannot do any interactive actions in the file system which is not accepted by them. Do you have some other measures to solve this problem?


> 
> > I think the ost_connect failure could report some error messages to users instead of  causing any interactive actions stuck.
> 
> No: Users shouldn't get an error in such a situation: The filesystem will just hang until the situation recovered (= the client was able to re-connect to the OST).
> 
> 
> 
> Regards,
>  Adrian




More information about the lustre-discuss mailing list