[Lustre-discuss] Using drbd: reformat disk or only sync ?

Dam Thanh Tung tungdt at isds.vn
Fri Nov 20 18:36:59 PST 2009


Hi list

We had a problem with our OST few days ago ( i also posted my question here
), after rebuilding our RAID partition, we use drbd to re-synchronize our
data from active OST to this backup one. We just started drbd from OST (
which has been rebuild RAID partition ) and connect with drbd on an working
OST. Everything was fine and the synchronization completed without any error
report. But, when we mount this backup OST in to our system, some of web
client can't connect to it ( MDS and some others can ) and after a short
time, in that OST message log, we see many error report like this :

*Nov 19 19:59:36 OST6 kernel: LDISKFS-fs error (device drbd6):
ldiskfs_lookup: unlinked inode 159588368 in dir #261333022

Nov 19 19:59:36 OST6 kernel: LustreError:
3893:0:(filter_lvb.c:90:filter_lvbo_init()) lustre-OST0006: bad object
996598/0: rc -2

Nov 19 19:59:36 OST6 kernel: LustreError:
3893:0:(filter_lvb.c:90:filter_lvbo_init()) Skipped 7 previous similar
messages

Nov 19 19:59:36 OST6 kernel: LustreError:
3893:0:(ldlm_resource.c:858:ldlm_resource_add()) lvbo_init failed for
resource 996598: rc -2

Nov 19 19:59:36 OST6 kernel: LustreError:
3893:0:(ldlm_resource.c:858:ldlm_resource_add()) Skipped 7 previous similar
messages

*

*Nov 19 19:59:40 OST6 kernel: LDISKFS-fs error (device drbd6):
ldiskfs_lookup: unlinked inode 261038084 in dir #261333008*

*Nov 19 19:59:45 OST6 kernel: LDISKFS-fs error (device drbd6):
ldiskfs_lookup: unlinked inode 229924884 in dir #261333024*

*Nov 19 19:59:47 OST6 kernel: LDISKFS-fs error (device drbd6):
ldiskfs_lookup: unlinked inode 228163899 in dir #261333024*

*Nov 19 19:59:54 OST6 kernel: LDISKFS-fs error (device drbd6):
ldiskfs_lookup: unlinked inode 165830658 in dir #261333015*

I tried umount this drbd disk and use e2fsck to check it, it showed me that
this disk is clean, but after re-mount, it went wrong once again

>From now, i umounted it, and reformat this disk like this:

 *mkfs.lustre --reformat --verbose --writeconf --ost
--mgsnode=192.168.1.78 at tcp:192.168.1.80 at tcp
--failover=192.168.1.83 at tcp--index=6 /dev/sdd
*

it completed without any error. And i'm trying resynchronize this drbd disk
with an working OST node. It will take pretty long time.

And, after all, the question i want to rise here is:

In order to using drbd as back up solution as i described above, do we need
to reformat disk before synchronize data or just sync it directly ?

Could you please give me an advice or suggestion in my situation ?

Thanks in advance

Best regards
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20091121/a576787e/attachment.htm>


More information about the lustre-discuss mailing list