[Lustre-discuss] Hung Lustre filesystem until a remount

Jeremy Mann jeremy at biochem.uthscsa.edu
Fri Jan 23 12:51:21 PST 2009

Andreas Dilger wrote:

> With so many skipped messages, it appears this node is in a tight loop for
> some reason.  Is this client mounted on the same node as the MDS perhaps?
> That isn't an excuse for hitting such a problem, but might explain why
> it was in such a tight loop that it was DOS-ing your filesystem.

We separated the MGS/MDT into a separate node quite awhile ago. This is
just a client connecting to our OSTs.

> If it is it might be the statahead bug.  Please check archives for
> many discussions for workarouds.  There was also a recent patch (not in
> any
> release yet) to fix the lock dynamic LRU sizing code to use less CPU,
> which
> may have contributed to this problem.

Thank you Andreas, I will do that.

