[Lustre-discuss] MDT connection refusal: still busy with 2 active RPCs

Brian J. Murrell Brian.Murrell at Sun.COM
Thu Apr 9 10:44:46 PDT 2009


On Thu, 2009-04-09 at 19:17 +0200, Thomas Roth wrote:
> Hi all,

Hi.

> ldlm_lib.ctarget_handle_connect lustre-MDT0000: refuse reconnection from
> 77cbd453-ee72-fe75-cb06-c49179e0a011 at Lustre-Client@tcp to
> 0xffff810111341000; still busy with 2 active RPCs
> 
> These messages are surrounded by an increasing amount of "triggered
> watchdogs" and Log-dumps, which contain pretty much what can also be
> seen in /var/log/kern.log.

If the server never makes progress with those outstanding RPCs, that's
usually a deadlock issue.  If you are not already on 1.6.7 I would
suggest upgrading to 1.6.7.1 (or patching 1.6.7 yourself with the MDS
corruption bug fix) when it becomes available and see if the problem
persists.

b.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20090409/81b31603/attachment.pgp>


More information about the lustre-discuss mailing list