[lustre-discuss] trouble mounting after a tunefs

Cowe, Malcolm J malcolm.j.cowe at intel.com
Sun Jun 14 13:47:03 PDT 2015


I believe that this message is benign, and is presented when first starting the MDS. It has something to do with the OSTs not being online, IIRC. I get a similar warning on any system I run, for example:

May 31 20:53:56 ie2-mds1.lfs.intl kernel: LustreError: 11-0: demo-MDT0000-lwp-MDT0000: Communicating with 0 at lo, operation mds_connect failed with -11.

This is from one of our lab systems. If the MDT shows up as mounted, there may not be a case to answer, although you will still need to verify that your connectivity works as expected :).

Check that the storage target is mounted, that service is started (kernel threads are running), and that the content of /proc/fs/lustre/health_check says "healthy", etc. "lctl dl" on the MDS should list the services that are up including the MDT, and  "lfs check servers" on the client should return with a positive outlook (all targets active).


Malcolm Cowe
Intel High Performance Data Division


-----Original Message-----
From: lustre-discuss [mailto:lustre-discuss-bounces at lists.lustre.org] On Behalf Of John White
Sent: Saturday, June 13, 2015 1:07 AM
To: lustre-discuss at lists.lustre.org
Subject: [lustre-discuss] trouble mounting after a tunefs

Good Morning Folks,
	We recently had to add TCP NIDs to an existing o2ib FS.  We added the nid to the modprobe.d stuff and tossed the definition of the NID in the failnode and mgsnode params on all OSTs and the MGS + MDT.  When either an o2ib or tcp client try to mount, the mount command hangs and dmesg repeats:
LustreError: 11-0: brc-MDT0000-mdc-ffff881036879c00: Communicating with 10.4.250.10 at o2ib, operation mds_connect failed with -11.

I fear we may have over-done the parameters, could anyone take a look here and let me know if we need to fix things up (remove params, etc)?

MGS:
Read previous values:
Target:     MGS
Index:      unassigned
Lustre FS:  
Mount type: ldiskfs
Flags:      0x4
              (MGS )
Persistent mount opts: user_xattr,errors=remount-ro
Parameters:

MDT:
 Read previous values:
Target:     brc-MDT0000
Index:      0
Lustre FS:  brc
Mount type: ldiskfs
Flags:      0x1001
              (MDT no_primnode )
Persistent mount opts: user_xattr,errors=remount-ro
Parameters:  mgsnode=10.4.250.11 at o2ib,10.0.250.11 at tcp:10.4.250.10 at o2ib,10.0.250.10 at tcp  failover.node=10.4.250.10 at o2ib,10.0.250.10 at tcp:10.4.250.11 at o2ib,10.0.250.11 at tcp mdt.quota_type=ug

OST(sample):
Read previous values:
Target:     brc-OST0002
Index:      2
Lustre FS:  brc
Mount type: ldiskfs
Flags:      0x1002
              (OST no_primnode )
Persistent mount opts: errors=remount-ro
Parameters:  mgsnode=10.4.250.10 at o2ib,10.0.250.10 at tcp:10.4.250.11 at o2ib,10.0.250.11 at tcp  failover.node=10.4.250.12 at o2ib,10.0.250.12 at tcp:10.4.250.13 at o2ib,10.0.250.13 at tcp ost.quota_type=ug
_______________________________________________
lustre-discuss mailing list
lustre-discuss at lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


More information about the lustre-discuss mailing list