[Lustre-discuss] two problems

Stefano Elmopi stefano.elmopi at sociale.it
Thu May 27 03:15:32 PDT 2010



Hi,

A clarification on what I wrote, the command that go server MGS/MDS in  
Kernel Panic is:

lfsck -d -v --mdsdb /root/mds_home_db --ostdb /root/home_ost00db /root/ 
home_ost02db /LUSTRE

and not as I wrote previously

lfsck -c -v --mdsdb /root/mds_home_db --ostdb /root/home_ost00db /root/ 
home_ost02db /LUSTRE

When, on Lustre Client, I launch:
lfsck -c -v --mdsdb /root/mds_home_db --ostdb /root/home_ost00db /root/ 
home_ost02db /LUSTRE

on the screen appears:
	.
	.
	.
[1] zero-length orphan objid 59
[1] zero-length orphan objid 28
[1] zero-length orphan objid 60
[1] zero-length orphan objid 29
[1] zero-length orphan objid 61
[1] zero-length orphan objid 30
[1] zero-length orphan objid 62
[1] zero-length orphan objid 31
[1] zero-length orphan objid 63
lfsck: ost_idx 1: pass3 OK (55 files total)
lfsck: can't find file for ost_idx 2
Files affected by missing ost info are : -
lfsck: pass4: check for duplicate object references
lfsck: pass4 OK (no duplicates)
lfsck: exit with 11 unfixed errors

and the log:
May 27 11:55:56 mdt02prdpom kernel: LustreError: 8030:0:(lov_ea.c: 
248:lsm_unpackmd_v1()) OST index 1 missing
May 27 11:55:56 mdt02prdpom kernel: LustreError: 8030:0:(lov_ea.c: 
248:lsm_unpackmd_v1()) Skipped 20 previous similar messages
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20003, magic 0x0bd10bd0, pattern 0x1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x2
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20005, magic 0x0bd10bd0, pattern 0x1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x3
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20006, magic 0x0bd10bd0, pattern 0x1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x4
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20008, magic 0x0bd10bd0, pattern 0x1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x5
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b2000a, magic 0x0bd10bd0, pattern 0x1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x6
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b2000c, magic 0x0bd10bd0, pattern 0x1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x7
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b2000e, magic 0x0bd10bd0, pattern 0x1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x8
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20014, magic 0x0bd10bd0, pattern 0x1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x23
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20015, magic 0x0bd10bd0, pattern 0x1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x42
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20017, magic 0x0bd10bd0, pattern 0x1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x62
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20018, magic 0x0bd10bd0, pattern 0x1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:55:56 mdt02prdpom kernel: Lustre: 8030:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x82


instead, when I launch on Lustre Client:
lfsck -d --mdsdb /root/mds_home_db --ostdb /root/home_ost00db /root/ 
home_ost02db /LUSTRE

the command hangs:
	.
	.
	.
	.
lfsck -d --mdsdb /root/mds_home_db --ostdb /root/home_ost00db /root/ 
home_ost02db /LUSTRE
lfsck 1.41.10.sun2 (24-Feb-2010)
lfsck: ost_idx 0: pass1: check for duplicate objects
lfsck: ost_idx 0: pass1 OK (0 files total)
lfsck: ost_idx 0: pass2: check for missing inode objects
lfsck: ost_idx 0: pass2 OK (0 objects)
lfsck: ost_idx 0: pass3: check for orphan objects
lfsck: ost_idx 0: pass3 OK (218 files total)
lfsck: ost_idx 1: pass1: check for duplicate objects
lfsck: ost_idx 1: pass1 OK (11 files total)
lfsck: ost_idx 1: pass2: check for missing inode objects
[1]: /LUSTRE/BAK_CentOS-5.4-x86_64-bin-DVD.iso object 2 not created
[1]: /LUSTRE/UBUNTU_CentOS-5.4-x86_64-bin-DVD.iso object 3 not created
[1]: /LUSTRE/Windows_XP-Capodarco.iso object 4 not created
[1]: /LUSTRE/CentOS-5.3-i386-bin-DVD.iso object 5 not created
[1]: /LUSTRE/ubuntu-9.10-dvd-i386.iso object 6 not created
[1]: /LUSTRE/2.iso object 7 not created
[1]: /LUSTRE/BBBBB_CentOS-5.4-x86_64-bin-DVD.iso object 8 not created
[1]: /LUSTRE/XXXXXXXXX_CentOS-5.4-x86_64-bin-DVD.iso object 35 not  
created
[1]: /LUSTRE/FFFFF_CentOS-5.4-x86_64-bin-DVD.iso object 66 not created
[1]: /LUSTRE/zero.dat object 98 not created
[1]: /LUSTRE/KK_CentOS-5.4-x86_64-bin-DVD.iso object 130 not created
lfsck: ost_idx 1: pass2 ERROR: 11 dangling inodes found (11 files total)
lfsck: ost_idx 1: pass3: check for orphan objects

the server MGS/MDS go to in Kernel Panic and the Lustre Client log say:
May 27 11:57:44 mdt02prdpom kernel: LustreError: 8031:0:(lov_ea.c: 
248:lsm_unpackmd_v1()) OST index 1 missing
May 27 11:57:44 mdt02prdpom kernel: LustreError: 8031:0:(lov_ea.c: 
248:lsm_unpackmd_v1()) Skipped 10 previous similar messages
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20003, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x2
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20005, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x3
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20005, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x3
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20006, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x4
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20008, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x5
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b2000a, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x6
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b2000c, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x7
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20006, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x4
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20008, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b2000e, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x8
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x5
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b2000a, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x6
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b2000c, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x7
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b2000e, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20014, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x23
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20015, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x42
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20017, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x62
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20018, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8032:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x82
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x8
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20014, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x23
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20015, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x42
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20017, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:44 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x62
May 27 11:57:45 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
64:lov_dump_lmm_common()) objid 0x1b20018, magic 0x0bd10bd0, pattern 0x1
May 27 11:57:45 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
May 27 11:57:45 mdt02prdpom kernel: Lustre: 8031:0:(lov_pack.c: 
84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x82
May 27 11:58:16 mdt02prdpom kernel: Lustre: 8034:0:(client.c: 
1463:ptlrpc_expire_one_request()) @@@ Request x1336351691757947 sent  
from lustre01-MDT0000-mdc-ffff810133bb5c00 to NID 172.16.100.111 at tcp  
7s ago has timed out (7s prior to deadline).
May 27 11:58:16 mdt02prdpom kernel:   req at ffff81011f916000  
x1336351691757947/t0 o101->lustre01-MDT0000_UUID at 172.16.100.111@tcp: 
12/10 lens 568/1112 e 0 to 1 dl 1274954296 ref 1 fl Rpc:P/0/0 rc 0/0
May 27 11:58:16 mdt02prdpom kernel: Lustre: 8034:0:(client.c: 
1463:ptlrpc_expire_one_request()) Skipped 14 previous similar messages
May 27 11:58:16 mdt02prdpom kernel: Lustre: lustre01-MDT0000-mdc- 
ffff810133bb5c00: Connection to service lustre01-MDT0000 via nid  
172.16.100.111 at tcp was lost; in progress operations using this service  
will wait for recovery to complete.
May 27 11:58:31 mdt02prdpom kernel: Lustre: 3797:0:(import.c: 
517:import_select_connection()) lustre01-MDT0000-mdc-ffff810133bb5c00:  
tried all connections, increasing latency to 2s


anyone can help me ??
Thanks !!






Ing. Stefano Elmopi
Gruppo Darco - Resp. ICT Sistemi
Via Ostiense 131/L Corpo B, 00154 Roma

cell. 3466147165
tel.  0657060500
email:stefano.elmopi at sociale.it

"Ai sensi e per effetti della legge sulla tutela  della  riservatezza  
personale
(D.lgs n. 196/2003),  questa @mail e' destinata  unicamente alle  
persone sopra
indicate e le informazioni in essa contenute sono da considerarsi  
strettamente
riservate. E' proibito leggere, copiare, usare o diffondere il  
contenuto della
presente @mail  senza  autorizzazione. Se avete ricevuto  questo  
messaggio per
errore, siete pregati di rispedire la stessa al mittente. Grazie"

Il giorno 26/mag/10, alle ore 17:43, Stefano Elmopi ha scritto:

>
>
> Hi,
>
> My version of Lustre is 1.8.3
> My filesystem is composed of one MGS/MDS server and two OSS.
> By testing, I tried to delete a OST and replace it with another OST
> and now the situation is this:
>
> cat /proc/fs/lustre/lov/lustre01-mdtlov/target_obd
> 0: lustre01-OST0000_UUID ACTIVE
> 2: lustre01-OST0002_UUID ACTIVE
>
> - first problem
> lustre01-OST0001_UUID ACTIVE is the OST was canceled and it had files,
> which of course now there are not more:
>
> ls -lrt
> total 12475312
> ?--------- ? ?    ?             ?            ? zero.dat
> ?--------- ? ?    ?             ?            ? ubuntu-9.10-dvd- 
> i386.iso
> ?--------- ? ?    ?             ?            ? XXXXXXXXX_CentOS-5.4- 
> x86_64-bin-DVD.iso
> ?--------- ? ?    ?             ?            ? Windows_XP- 
> Capodarco.iso
> ?--------- ? ?    ?             ?            ? UBUNTU_CentOS-5.4- 
> x86_64-bin-DVD.iso
> ?--------- ? ?    ?             ?            ? KK_CentOS-5.4-x86_64- 
> bin-DVD.iso
> ?--------- ? ?    ?             ?            ? FFFFF_CentOS-5.4- 
> x86_64-bin-DVD.iso
> ?--------- ? ?    ?             ?            ? CentOS-5.3-i386-bin- 
> DVD.iso
> ?--------- ? ?    ?             ?            ? BBBBB_CentOS-5.4- 
> x86_64-bin-DVD.iso
> ?--------- ? ?    ?             ?            ? BAK_CentOS-5.4-x86_64- 
> bin-DVD.iso
> ?--------- ? ?    ?             ?            ? 2.iso
>
>
> I to delete them, follow these steps:
>
> on MGS/MDS server:
>
> e2fsck -n -v --mdsdb /root/mds_home_db /dev/mpath/mpath2
>
> copy the file mds_home_db on OSS_1 and, one OSS_1 launch the  
> following command:
>
> e2fsck -n -v --mdsdb /root/mds_home_db --ostdb /root/home_ost00db / 
> dev/mpath/mpath1
>
> and do the same thing on the OSS_2:
>
> e2fsck -n -v --mdsdb /root/mds_home_db --ostdb /root/home_ost01db / 
> dev/mpath/mpath2
>
> then copy the files mds_home_db, home_ost00db and home_ost01db on  
> the Lustre Client,
> mount the lustre filesystem and run the commnand:
>
> lfsck -c -v --mdsdb /root/mds_home_db --ostdb /root/home_ost00db / 
> root/home_ost02db /LUSTRE
>
> but the command hangs:
> 	
> 	.
> 	.
> 	.
> 	.
> [0] zero-length orphan objid 1182
> [0] zero-length orphan objid 1214
> [0] zero-length orphan objid 1246
> [0] zero-length orphan objid 1183
> [0] zero-length orphan objid 1215
> [0] zero-length orphan objid 1247
> lfsck: ost_idx 0: pass3 OK (218 files total)
> MDS: max_id 161 OST: max_id 65
> lfsck: ost_idx 1: pass1: check for duplicate objects
> lfsck: ost_idx 1: pass1 OK (11 files total)
> lfsck: ost_idx 1: pass2: check for missing inode objects
>
>
> and the server MGS/MDS go to in Kernel Panic
> and the Lustre Client log say:
> May 26 17:39:35 mdt02prdpom kernel: LustreError: 7105:0:(lov_ea.c: 
> 248:lsm_unpackmd_v1()) OST index 1 missing
> May 26 17:39:35 mdt02prdpom kernel: LustreError: 7105:0:(lov_ea.c: 
> 248:lsm_unpackmd_v1()) Skipped 21 previous similar messages
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 64:lov_dump_lmm_common()) objid 0x1b20003, magic 0x0bd10bd0, pattern  
> 0x1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x2
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 64:lov_dump_lmm_common()) objid 0x1b20005, magic 0x0bd10bd0, pattern  
> 0x1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x3
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 64:lov_dump_lmm_common()) objid 0x1b20006, magic 0x0bd10bd0, pattern  
> 0x1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x4
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 64:lov_dump_lmm_common()) objid 0x1b20008, magic 0x0bd10bd0, pattern  
> 0x1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x5
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 64:lov_dump_lmm_common()) objid 0x1b2000a, magic 0x0bd10bd0, pattern  
> 0x1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x6
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 64:lov_dump_lmm_common()) objid 0x1b2000c, magic 0x0bd10bd0, pattern  
> 0x1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x7
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 64:lov_dump_lmm_common()) objid 0x1b2000e, magic 0x0bd10bd0, pattern  
> 0x1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x8
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 64:lov_dump_lmm_common()) objid 0x1b20014, magic 0x0bd10bd0, pattern  
> 0x1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x23
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 64:lov_dump_lmm_common()) objid 0x1b20015, magic 0x0bd10bd0, pattern  
> 0x1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x42
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 64:lov_dump_lmm_common()) objid 0x1b20017, magic 0x0bd10bd0, pattern  
> 0x1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x62
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 64:lov_dump_lmm_common()) objid 0x1b20018, magic 0x0bd10bd0, pattern  
> 0x1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 67:lov_dump_lmm_common()) stripe_size 1048576, stripe_count 1
> May 26 17:39:35 mdt02prdpom kernel: Lustre: 7105:0:(lov_pack.c: 
> 84:lov_dump_lmm_objects()) stripe 0 idx 1 subobj 0x0/0x82
>
>
> - second problem
> doing tests with Quotas, when I go to run the command:
>
> lfs quotacheck -ug /LUSTRE/
> quotacheck failed: Input/output error
>
>
> and the log say:
>
> kernel: LustreError: 7103:0:(quota_check.c:251:lov_quota_check())  
> lov idx 1 inactive
>
>
>
> Thanks !!
>
>
>
>
> Ing. Stefano Elmopi
> Gruppo Darco - Resp. ICT Sistemi
> Via Ostiense 131/L Corpo B, 00154 Roma
>
> cell. 3466147165
> tel.  0657060500
> email:stefano.elmopi at sociale.it
>
> "Ai sensi e per effetti della legge sulla tutela  della   
> riservatezza personale
> (D.lgs n. 196/2003),  questa @mail e' destinata  unicamente alle  
> persone sopra
> indicate e le informazioni in essa contenute sono da considerarsi  
> strettamente
> riservate. E' proibito leggere, copiare, usare o diffondere il  
> contenuto della
> presente @mail  senza  autorizzazione. Se avete ricevuto  questo  
> messaggio per
> errore, siete pregati di rispedire la stessa al mittente. Grazie"
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20100527/39f9dc4e/attachment.htm>


More information about the lustre-discuss mailing list