[Lustre-discuss] unable to mount ost after mdt migration and mds/mgs failover configuration

Michael Barnes Michael.Barnes at jlab.org
Thu Feb 24 07:38:59 PST 2011


Hello list,

I've migrated our mdt data to a new machine via tar and getfattr, and then did the following:

mds/mgs: rm OBJECTS/* CATALOGS

mds/mgs: tunefs.lustre --writeconf --mgs --mdt --fsname=lustre --erase-param --param mdt.quota_type=ug2 --param mdt.group_upcall=/usr/sbin/l_getgroups --param failover.node=172.17.4.124 at o2ib /dev/sdb

mds/mgs: mount -t lustre /dev/sdb /mdt


Everything seems OK up to this point.


I then modified each ost with:

oss: tunefs.lustre --writeconf --erase-param --param mgsnode=172.17.4.123 at o2ib:172.17.4.124 at o2ib --param ost.quota_type=ug2 /dev/sdb


Then, when I try to mount the ost I get in dmesg:

Lustre: 3269:0:(client.c:1476:ptlrpc_expire_one_request()) @@@ Request x1361639653769217 sent from MGC172.17.4.123 at o2ib to NID 172.17.4.124 at o2ib 0s ago has failed due to network error (5s prior to deadline).
  req at ffff810133f24800 x1361639653769217/t0 o250->MGS at MGC172.17.4.123@o2ib_0:26/25 lens 368/584 e 0 to 1 dl 1298560772 ref 1 fl Rpc:N/0/0 rc 0/0
LustreError: 3164:0:(client.c:858:ptlrpc_import_delay_req()) @@@ IMP_INVALID  req at ffff810106b39c00 x1361639653769218/t0 o253->MGS at MGC172.17.4.123@o2ib_0:26/25 lens 4736/4928 e 0 to 1 dl 0 ref 1 fl Rpc:/0/0 rc 0/0
LustreError: 3164:0:(obd_mount.c:1097:server_start_targets()) Required registration failed for lustre-OST0037: -108
LustreError: 3164:0:(obd_mount.c:1655:server_fill_super()) Unable to start targets: -108
LustreError: 3164:0:(obd_mount.c:1438:server_put_super()) no obd lustre-OST0037
LustreError: 3164:0:(obd_mount.c:147:server_deregister_mount()) lustre-OST0037 not registered


The first mds/mgsnode has the mdt mounted.  lctl ping and regular TCP/ip works over the ib interface.  Did I miss something that the ost needs to join the new mds/mgs configuration?

Thanks,

-mb 

--
+-----------------------------------------------
| Michael Barnes
|
| Thomas Jefferson National Accelerator Facility
| Scientific Computing Group
| 12000 Jefferson Ave.
| Newport News, VA 23606
| (757) 269-7634
+-----------------------------------------------







More information about the lustre-discuss mailing list