[Lustre-discuss] Fwd: can't mount our lustre filesystem after tunefs.lustre --writeconf

Stu Midgley sdm900 at gmail.com
Sat Mar 17 02:10:33 PDT 2012


---------- Forwarded message ----------
From: Stu Midgley <sdm900 at gmail.com>
Date: Sat, Mar 17, 2012 at 5:10 PM
Subject: can't mount our lustre filesystem after tunefs.lustre --writeconf
To: wc-discuss at whamcloud.com


Afternoon

We have a rather severe problem with our lustre file system.  We had a
full config log and the advice was to rewrite it with a new one.  So,
we unmounted our lustre file system off all clients, unmount all the
ost's and then unmounted the mds.  I then did

mds:
  tunefs.lustre --writeconf --erase-params /dev/md2

oss:
  tunefs.lustre --writeconf --erase-params --mgsnode=mds001 /dev/md2



After the tunefs.lustre on the mds I saw

Mar 17 14:33:02 mds001 kernel: Lustre: MGS MGS started
Mar 17 14:33:02 mds001 kernel: Lustre: MGC172.16.0.251 at tcp: Reactivating import
Mar 17 14:33:02 mds001 kernel: Lustre: MGS: Logs for fs p1 were
removed by user request.  All servers must be restarted in order to
regenerate the logs.
Mar 17 14:33:02 mds001 kernel: Lustre: Enabling user_xattr
Mar 17 14:33:02 mds001 kernel: Lustre: p1-MDT0000: new disk, initializing
Mar 17 14:33:02 mds001 kernel: Lustre: p1-MDT0000: Now serving
p1-MDT0000 on /dev/md2 with recovery enabled

which scared me a little...



the mds and the oss's mount happily BUT I can't mount the file system
on my clients... on the mds I see


Mar 17 16:42:11 mds001 kernel: LustreError: 137-5: UUID
'prod_mds_001_UUID' is not available  for connect (no target)


On the client I see


Mar 17 16:00:06 host kernel: LustreError: 11-0: an error occurred
while communicating with 172.16.0.251 at tcp. The mds_connect operation
failed with -19


now, it appears the writeconf renamed the UUID of the mds from
prod_mds_001_UUID to p1-MDT0000_UUID but I can't work out how to get
it back...


for example I tried


# tunefs.lustre --mgs --mdt --fsname=p1 /dev/md2
checking for existing Lustre data: found CONFIGS/mountdata
Reading CONFIGS/mountdata

 Read previous values:
Target:     p1-MDT0000
Index:      0
UUID:       prod_mds_001_UUID
Lustre FS:  p1
Mount type: ldiskfs
Flags:      0x405
            (MDT MGS )
Persistent mount opts: errors=remount-ro,iopen_nopriv,user_xattr
Parameters:

tunefs.lustre: cannot change the name of a registered target
tunefs.lustre: exiting with 1 (Operation not permitted)



I'm now stuck not being able to mount a 1PB file system... which isn't good :(

--
Dr Stuart Midgley
sdm900 at gmail.com


-- 
Dr Stuart Midgley
sdm900 at gmail.com



More information about the lustre-discuss mailing list