[Lustre-discuss] Added Dual-homed OSS; ethernet clients confused
Chris Worley
worleys at gmail.com
Wed Apr 23 07:10:22 PDT 2008
On Wed, Apr 23, 2008 at 12:42 AM, Andreas Dilger <adilger at sun.com> wrote:
> On Apr 22, 2008 18:08 -0600, Chris Worley wrote:
> > The error specifically complains about the first OST/disk on the new
> > OSS, OST0026. It's tunefs.lustre output was:
> >
>
> > > On the OSS in question, for each OST, I did:
> > >
> > > # tunefs.lustre --writeconf --ost
> > >
> > > --mgsnode="36.102.29.1 at o2ib0,36.101.29.1 at tcp0" --fsname=lfs --param
> > > sys.timeout=40 --param lov.stripesize=2M /dev/sdl
>
> > > Lustre: cmd=cf003 0:lfs-OST0026-osc 1:lfs-OST0026_UUID 2:36.102.29.4 at o2ib
> > > LustreError: 15c-8: MGC36.101.29.1 at tcp: The configuration from log
> > > 'lfs-client' failed (-2). This may be the result of communication
> > > errors between this node and the MGS, a bad configuration, or other
> > > errors. See the syslog for more information.
>
> The problem is that the NID for the new OST is the IPoIB address, and this
> is what the TCP client is trying to connect to. If you specify the TCP
> NID first this may help. Also note that the client does not get the
> config from the OSTs, but rather the MGS, so you need to do a --write-conf
> on there.
This is confusing as the man page for "tunefs.lustre" wants a device
name at the end of the command... and the device is on another OSS...
"/dev/sda" on the MGS is a totally different drive. Can I use the
label?
Thanks,
Chris
>
> Cheers, Andreas
> --
> Andreas Dilger
> Sr. Staff Engineer, Lustre Group
> Sun Microsystems of Canada, Inc.
>
>
More information about the lustre-discuss
mailing list