[Lustre-discuss] mds server crashing

Bernd Schubert bernd.schubert at fastmail.fm
Sat Mar 14 04:35:08 PDT 2009


On Saturday 14 March 2009, Mag Gam wrote:
> We are having a problem with a MDS server (which also has 1 OST) on the
> box.
>
> When the server boots up, we notice there is an ll_mdt process running
> at 100% and we keep on waiting close  to 10-15 mins. We only have 8
> clients. (I assume this normal recovery process). However if I
> manually mount up the mdt without any recovery everything is fine

Hmm, I have seen that with 1.6.4.3 and NFS exports. But that should be fixed 
in 1.6.5. Although I'm not sure, since we switched NFS exports to unfs3 ever 
since the problem came up.

>
> Mar 12 10:11:02 protected_host_01 kernel: Pid: 10375, comm: ll_mdt_10
> Tainted: G      2.6.18-92.1.17.el5_lustre.1.6.7smp #1
> Mar 12 10:11:02 protected_host_01 kernel: RIP:
> 0010:[<ffffffff888ed8df>]  [<ffffffff888ed8df>]
>
> :ldiskfs:do_split+0x3ef/0x560
>
> Mar 12 10:11:02 protected_host_01 kernel: RSP: 0018:ffff8103d2a5f460
> EFLAGS: 00000216
> Mar 12 10:11:02 protected_host_01 kernel: RAX: 0000000000000000 RBX:
> 0000000000000080 RCX: 0000000000000000
> Mar 12 10:11:02 protected_host_01 kernel: RDX: 0000000000000080 RSI:
> ffff8103cd52177c RDI: ffff8103cd52176c

Any chance you can send traces with line wrap disabled? With line wrapping it 
is quite hard to read.


Cheers,
Bernd




More information about the lustre-discuss mailing list