[Lustre-discuss] meta freezing
Andreas Dilger
adilger at sun.com
Tue Oct 14 13:22:06 PDT 2008
On Oct 14, 2008 13:07 +0200, Papp Tamas wrote:
> Since we switched from 1.6.4.3 to 1.6.5.1 on one of our cluster we have
> a wierd problem.
>
> One of the node of the cluster lock up and only reset can help, it's
> usually the meta node. It's already not good, but there is also
> something. When the node gets up again and the recovery is starting
> agaian it locks up over and over again. It's counting back and sometimes
> there is only a few clients, sometimes there is no more clients, but
> it's always locks up.
>
> So I mount the mdt, umount -f, mount again, recovery is in Sstatus
> INACTIVE and the cluster is working.
>
> Now I'm out of ideas.
>
> The cluster was made with 1.6.5.1. Is it safe to move back to 1.6.4.3? I
> mean just changing he utilities and the kernel and that's all, or do I
> need the do further steps?
Yes, it should always be possible to downgrade to the older version.
In some cases in the future (e.g. 2.0 -> 1.8.x downgrade) it will be
needed to remount the clients, but general consensus is that if you
are downgrading you already have major problems so a remount will not
contribute significantly to the problem.
Cheers, Andreas
--
Andreas Dilger
Sr. Staff Engineer, Lustre Group
Sun Microsystems of Canada, Inc.
More information about the lustre-discuss
mailing list