[Lustre-discuss] Problem with LNET configuration
Stefano Elmopi
stefano.elmopi at sociale.it
Mon Jul 12 07:36:13 PDT 2010
Hi,
I have a Lustre file system, consisting of a MGS/MDS an two OSS, interconnected with Infiniband.
The version of Lustre is 1.8.3 and the SO of the servers is CentOS 5.4 and I used
the following commands to their formatting:
MGS/MDS:
mkfs.lustre --mgs /dev/mpath/mpath1
mount -t lustre /dev/mpath/mpath1 /MGS
mkfs.lustre --mdt --fsname=lustre01 --mgsnode=172.16.100.111 at tcp0,192.168.150.1 at o2ib0 --mgsnode=172.16.100.121 at tcp0,192.168.150.11 at o2ib0 --failnode=172.16.100.121 at tcp0,192.168.150.11 at o2ib0 /dev/mpath/mpath2
mount -t lustre /dev/mpath/mpath2 /MDS_1/
OSS_1
mkfs.lustre --ost --fsname=lustre01 --failnode=172.16.100.122 at tcp0,192.168.150.12 at o2ib0 --mgsnode=172.16.100.111 at tcp0,192.168.150.1 at o2ib0 --mgsnode=172.16.100.121 at tcp0,192.168.150.11 at o2ib0 /dev/mpath/mpath1
mount -t lustre /dev/mpath/mpath1 /LUSTRE_1
OSS_2
mkfs.lustre --ost --fsname=lustre01 --failnode=172.16.100.121 at tcp0,192.168.150.11 at o2ib0 --mgsnode=172.16.100.111 at tcp0,192.168.150.1 at o2ib0 --mgsnode=172.16.100.121 at tcp0,192.168.150.11 at o2ib0 /dev/mpath/mpath2
mount -t lustre /dev/mpath/mpath2 /LUSTRE_1
and then there are two clients mounted, one on Ethernet and one on IB.
I disconnected the IB cable to simulate the breaking of the IB card on OSS_2.
I modified the file modprobe.conf to start LNET with only Ethernet card and then mount Lustre
filesystem and the operation seems to be successful, the ethernet client can see the entire filesystem.
The problem comes when I try to force a write on OSS_2 because writing crashes ,and the operation goes wrong.
Log on MGS/MDS:
Jul 12 15:04:59 mdt01prdpom kernel: LustreError: 4238:0:(events.c:66:request_out_callback()) @@@ type 4, status -113 req at ffff81013ea52000 x1340531260082684/t0 o8->lustre01-OST0001_UUID at 172.16.100.121@tcp:28/4 lens 368/584 e 0 to 1 dl 1278939908 ref 2 fl Rpc:N/0/0 rc 0/0
Jul 12 15:04:59 mdt01prdpom kernel: LustreError: 4238:0:(events.c:66:request_out_callback()) Skipped 16 previous similar messages
Jul 12 15:06:07 mdt01prdpom kernel: LustreError: 4237:0:(lov_request.c:690:lov_update_create_set()) error creating fid 0x10f8004 sub-object on OST idx 1/1: rc = -11
Jul 12 15:06:07 mdt01prdpom kernel: LustreError: 4237:0:(lov_request.c:690:lov_update_create_set()) Skipped 1 previous similar message
Jul 12 15:06:07 mdt01prdpom kernel: LustreError: 4408:0:(mds_open.c:441:mds_create_objects()) error creating objects for inode 17793028: rc = -5
Jul 12 15:06:07 mdt01prdpom kernel: LustreError: 4408:0:(mds_open.c:826:mds_finish_open()) mds_create_objects: rc = -5
My question is:
you can mount the server OSS_2 so that it can provide service with the ethernet card ?
If yes, What should I do?
Thanks
Ing. Stefano Elmopi
Gruppo Darco - Resp. ICT Sistemi
Via Ostiense 131/L Corpo B, 00154 Roma
cell. 3466147165
tel. 0657060500
email:stefano.elmopi at sociale.it
"Ai sensi e per effetti della legge sulla tutela della riservatezza personale
(D.lgs n. 196/2003), questa @mail e' destinata unicamente alle persone sopra
indicate e le informazioni in essa contenute sono da considerarsi strettamente
riservate. E' proibito leggere, copiare, usare o diffondere il contenuto della
presente @mail senza autorizzazione. Se avete ricevuto questo messaggio per
errore, siete pregati di rispedire la stessa al mittente. Grazie"
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20100712/657b70f2/attachment.htm>
More information about the lustre-discuss
mailing list