[lustre-discuss] mdt: unhealthy - healthy

Andreas Dilger adilger at whamcloud.com
Mon Jul 29 20:29:51 PDT 2019

On Jul 26, 2019, at 04:28, Thomas Roth <t.roth at gsi.de<mailto:t.roth at gsi.de>> wrote:

Hi all,

this morning one of our MDT went 'unhealthy',

Jul 26 10:15:13 lxmds20 kernel: LustreError: 9510:0:(service.c:3285:ptlrpc_svcpt_health_check())
mdt: unhealthy - request has been waiting 1017s

However, somewhat later,

lxmds20:~# cat /sys/fs/lustre/health_check

and all Lustre operations seem to be good, too.

This means that some RPC has been stuck, but if the RPC eventually completes then there is no reason for the MDS to be "unhealthy" anymore.

Cheers, Andreas
Andreas Dilger
Principal Lustre Architect

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20190730/e2f786d6/attachment.html>

More information about the lustre-discuss mailing list