[Lustre-discuss] ldlm_cli_cancel_req()) Got rc -108 from cancel RPC: canceling anyway

Heiko Schröter schroete at iup.physik.uni-bremen.de
Wed Dec 23 03:57:27 PST 2009


Am Mittwoch 23 Dezember 2009 12:22:17 schrieb Christopher J. Walker:
> > 
> > Dec 22 17:18:49 proof kernelLustreError: 10917:0:
> > (ldlm_request.c:1030:ldlm_cli_cancel_req()) Got rc -108 from cancel RPC: 
> > canceling anyway
> > Dec 22 17:18:49 proof kernelLustreError: 10917:0:
> > (ldlm_request.c:1030:ldlm_cli_cancel_req()) Skipped 169 previous similar 
> > messages
> > Dec 22 17:18:49 proof kernelLustreError: 10917:0:
> > (ldlm_request.c:1533:ldlm_cli_cancel_list()) ldlm_cli_cancel_list: -108
> > Dec 22 17:18:49 proof kernelLustreError: 10917:0:
> > (ldlm_request.c:1533:ldlm_cli_cancel_list()) Skipped 169 previous similar 
> > messages
> > Dec 22 17:18:49 proof kernelLustre: client ffff81042fccf400 umount complete
> > Dec 22 17:19:02 proof kernelLustre: Client userdata-client has started
> > 
> > Is anybody else seeing these messages in this situation? Does anyboyd know for 
> > a workaround??
> 
> Like Ewan, our Lustre filesystem is automounted. Whilst I haven't done a 
> detailed study, it does look as though these messages occur immediately 
> before unmounting the filesystem.

Yes. These messages do occur before 'auto'-un-mounting. So nothing to worry about.
The above is the mount process.

Unmounting should look like this:
Jun 17 04:00:16 cluster1 LustreError: 6460:0:(ldlm_request.c:1043:ldlm_cli_cancel_req()) Got rc -108 from cancel RPC: canceling anyway
Jun 17 04:00:16 cluster1 LustreError: 6460:0:(ldlm_request.c:1632:ldlm_cli_cancel_list()) ldlm_cli_cancel_list: -108
Jun 17 04:00:16 cluster1 Lustre: client ffff8100c44d1000 umount complete

If you don't see the last line 'umount complete' automount + lustre will hang and there should be no further access to the lustre system.
Happend to us in our scenario.

> 
> Is automounting a bad idea?

It depends. We had some bad experiences with lustre-1.6.6 and automount. See the mail archive about it. Subject: 'Stalled autofs + lustre'
Our problem should be resolved with upgrading to 1.8.x.
We will test again in Jan/Feb 2010 when the upgrade is sheduled.

Regards
Heiko



More information about the lustre-discuss mailing list