[lustre-discuss] mdt: unhealthy - healthy
adilger at whamcloud.com
Mon Jul 29 20:29:51 PDT 2019
On Jul 26, 2019, at 04:28, Thomas Roth <t.roth at gsi.de<mailto:t.roth at gsi.de>> wrote:
this morning one of our MDT went 'unhealthy',
Jul 26 10:15:13 lxmds20 kernel: LustreError: 9510:0:(service.c:3285:ptlrpc_svcpt_health_check())
mdt: unhealthy - request has been waiting 1017s
However, somewhat later,
lxmds20:~# cat /sys/fs/lustre/health_check
and all Lustre operations seem to be good, too.
This means that some RPC has been stuck, but if the RPC eventually completes then there is no reason for the MDS to be "unhealthy" anymore.
Principal Lustre Architect
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the lustre-discuss