[Lustre-discuss] mount.lustre: mount /dev/sdd1 at /hpcdata/ost1 failed: Protocol error
Christian Gajan
Christian.Gajan at Sun.COM
Thu Apr 24 09:30:12 PDT 2008
Hi,
my lustre configuration is RHEL5.1 (2.6.18-53.1.13 + lustre patch) +
OFED 1.2.5.5 + lustre 1.6.4.3
my lnet seem to work correctly but when I try to configure my OSS I got
a Protocol error at the mount step
Here some detail
MDT/MDS server
# modprobe lnet
# lctl network up
# lctl list_nids
192.168.1.17 at o2ib
# mkfs.lustre -fsname hpcdata --mdt --mgs /dev/sdb2
# mkdir -p /hpcdata/mdt
# mount -t lustre /dev/sdb1 /hpcdata/mdt
ALL is ok here
OST server
# modprobe lnet
# lctl network up
# lctl list_nids
192.168.1.16 at o2ib
# lctl ping 192.168.1.17 at o2ib
12345 - 0 at lo
12345 - 192.168.1.17 at o2ib
# mkfs.lustre --fsname hpcdata --ost --mgsnode=192.168.1.17 at o2ib /dev/sdd1
# mkdir -p /hpcdata/ost1
# mount -t lustre /dev/sdd1 /hpcdata/ost1
mount.lustre: mount /dev/sdd1 at /hpcdata/ost1 failed: Protocol error
!!!!!!!!!
I got in the /var/log/messages (OST side)
Apr 24 07:17:55 s1206 kernel: kjournald starting. Commit interval 5 seconds
Apr 24 07:17:55 s1206 kernel: LDISKFS FS on sdd1, internal journal
Apr 24 07:17:55 s1206 kernel: LDISKFS-fs: mounted filesystem with
ordered data mode.
Apr 24 07:17:55 s1206 kernel: kjournald starting. Commit interval 5 seconds
Apr 24 07:17:55 s1206 kernel: LDISKFS FS on sdd1, internal journal
Apr 24 07:17:55 s1206 kernel: LDISKFS-fs: mounted filesystem with
ordered data mode.
Apr 24 07:17:55 s1206 kernel: LDISKFS-fs: file extents enabled
Apr 24 07:17:55 s1206 kernel: LDISKFS-fs: mballoc enabled
Apr 24 07:17:55 s1206 kernel: LustreError:
13483:0:(pack_generic.c:782:lustre_unpack_msg()) bad lustre msg magic:
0X26F54000
Apr 24 07:17:55 s1206 kernel: LustreError:
13483:0:(client.c:613:after_reply()) @@@ unpack_rep failed: -22
req at ffff81022eb06800 x11/t0 o253->MGS at MGC192.168.1.17@o2ib_0:26 lens
4672/4672 ref 1 fl Rpc:R/0/0 rc 0/-22
Apr 24 07:17:56 s1206 kernel: LustreError:
13483:0:(obd_mount.c:954:server_register_target()) registration with the
MGS failed (-71)
Apr 24 07:17:56 s1206 kernel: LustreError:
13483:0:(obd_mount.c:1054:server_start_targets()) Required registration
failed for hpcdata-OSTffff: -71
Apr 24 07:17:56 s1206 kernel: LustreError:
13483:0:(obd_mount.c:1570:server_fill_super()) Unable to start targets: -71
Apr 24 07:17:56 s1206 kernel: LustreError:
13483:0:(obd_mount.c:1368:server_put_super()) no obd hpcdata-OSTffff
Apr 24 07:17:56 s1206 kernel: LustreError:
13483:0:(obd_mount.c:119:server_deregister_mount()) hpcdata-OSTffff not
registered
Apr 24 07:17:56 s1206 kernel: LDISKFS-fs: mballoc: 0 blocks 0 reqs (0
success)
Apr 24 07:17:56 s1206 kernel: LDISKFS-fs: mballoc: 0 extents scanned, 0
goal hits, 0 2^N hits, 0 breaks, 0 lost
Apr 24 07:17:56 s1206 kernel: LDISKFS-fs: mballoc: 0 generated and it took 0
Apr 24 07:17:56 s1206 kernel: LDISKFS-fs: mballoc: 0 preallocated, 0
discarded
Apr 24 07:17:56 s1206 kernel: Lustre: server umount hpcdata-OSTffff complete
Apr 24 07:17:56 s1206 kernel: LustreError:
13483:0:(obd_mount.c:1924:lustre_fill_super()) Unable to mount (-71)
I got in the /var/log/messages (MDS side)
Apr 24 08:04:53 s1207 kernel: ko2iblnd: no version for
"ib_fmr_pool_unmap" found: kernel tainted.
Apr 24 08:04:53 s1207 kernel: Lustre: Added LNI 192.168.1.17 at o2ib [8/64]
Apr 24 08:05:30 s1207 kernel: kjournald starting. Commit interval 5 seconds
Apr 24 08:05:30 s1207 kernel: LDISKFS FS on sdb2, internal journal
Apr 24 08:05:30 s1207 kernel: LDISKFS-fs: mounted filesystem with
ordered data mode.
Apr 24 08:05:30 s1207 kernel: kjournald starting. Commit interval 5 seconds
Apr 24 08:05:30 s1207 kernel: LDISKFS FS on sdb2, internal journal
Apr 24 08:05:30 s1207 kernel: LDISKFS-fs: mounted filesystem with
ordered data mode.
Apr 24 08:05:30 s1207 kernel: Lustre: MGS MGS started
Apr 24 08:05:30 s1207 kernel: Lustre: Enabling user_xattr
Apr 24 08:05:30 s1207 kernel: Lustre: MDT hpcdata-MDT0000 now serving
dev (hpcdata-MDT0000/f8e01d68-5859-94c6-4f92-cde70c2d1781) with recovery
enabled
Apr 24 08:05:30 s1207 kernel: Lustre:
4974:0:(lproc_mds.c:260:lprocfs_wr_group_upcall()) hpcdata-MDT0000:
group upcall set to /usr/sbin/l_getgroups
Apr 24 08:05:30 s1207 kernel: Lustre: hpcdata-MDT0000.mdt: set parameter
group_upcall=/usr/sbin/l_getgroups
Apr 24 08:05:31 s1207 kernel: Lustre: Server hpcdata-MDT0000 on device
/dev/sdb2 has started
Apr 24 08:05:36 s1207 kernel: LustreError:
4988:0:(client.c:975:ptlrpc_expire_one_request()) @@@ timeout (sent at
1209049530, 5s ago) req at ffff8102321de000 x7/t0
o8->hpcdata-OST0000_UUID at 192.168.1.16@o2ib:6 lens 240/272 ref 1 fl
Rpc:/0/0 rc 0/-22
Apr 24 08:05:36 s1207 kernel: LustreError:
4988:0:(client.c:975:ptlrpc_expire_one_request()) @@@ timeout (sent at
1209049530, 6s ago) req at ffff81022861ae00 x8/t0
o8->hpcdata-OST0001_UUID at 192.168.1.16@o2ib:6 lens 240/272 ref 1 fl
Rpc:/0/0 rc 0/-22
I've made a mistake ?
is it a bug (a quick search in bugzilla.lustre.org give nothing) ?
Thanks in advance for your help
regards
christian
-------------- next part --------------
A non-text attachment was scrubbed...
Name: christian.gajan.vcf
Type: text/x-vcard
Size: 233 bytes
Desc: not available
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20080424/21b18c1f/attachment.vcf>
More information about the lustre-discuss
mailing list