[Lustre-discuss] error while upgrading 1.6.5 to 1.6.7.2

Michal Bialoskorski m.bialoskorski at task.gda.pl
Fri Jun 26 14:27:24 PDT 2009


Thank You Nirmal very very much. "home" is working now.  Now I will 
upgrade/recover the second fs.

Summing it up what I've done is:

on MDS:
1) /opt/lustre/sbin/tunefs.lustre --mdt --writeconf   --erase-param  \
     --param="mdt.group_upcall=/usr/sbin/l_getgroups"/dev/mapper/home.mdt
2) mount -t lustre -o abort_recov /dev/mapper/work.mdt /lustre/home.mdt
3) umount /lustre/home.mdt
4) mount -t lustre /dev/mapper/work.mdt /lustre/home.mdt

on OSSs for all OSTs:
1) tunefs.lustre --ost --writeconf  --erase-param 
--mgsnode=192.168.27.252 at o2ib /dev/mapper/home.ost.XX
2) mount -t lustre -o abort_recov /dev/mapper/home.ost.XX 
/lustre/home/ost.XX


m.

Nirmal Seenu pisze:
> Hi Michal,
>
> You will have to include the option --writeconf to actually make those 
> tunefs modifications to be written on to the OST.
>
> Nirmal
>
> Michal Bialoskorski wrote:
>> Thanks Nirmal,
>>
>> tunefs now works, I run this commnad:
>>
>> /opt/lustre/sbin/tunefs.lustre --ost --erase-param
>> --mgsnode=192.168.27.252 at o2ib /dev/mapper/home.ost.01
>>   but I still cannot mount the OST. After try of mounting OST I've got:
>>
>> Jun 26 17:49:12 ossh2 kernel: Lustre: MGC192.168.27.252 at o2ib:
>> Reactivating import
>> Jun 26 17:49:12 ossh2 kernel: LustreError:
>> 5848:0:(obd_mount.c:1129:server_start_targets()) no server named
>> home-OST0000 was started
>> Jun 26 17:49:12 ossh2 kernel: LustreError:
>> 5848:0:(obd_mount.c:1628:server_fill_super()) Unable to start 
>> targets: -6
>> Jun 26 17:49:12 ossh2 kernel: LustreError:
>> 5848:0:(obd_mount.c:1411:server_put_super()) no obd home-OST0000
>> Jun 26 17:49:12 ossh2 kernel: LustreError:
>> 5848:0:(ldlm_request.c:1033:ldlm_cli_cancel_req()) Got rc -108 from
>> cancel RPC: canceling anyway
>> Jun 26 17:49:12 ossh2 kernel: LustreError:
>> 5848:0:(ldlm_request.c:1622:ldlm_cli_cancel_list())
>> ldlm_cli_cancel_list: -108
>> Jun 26 17:49:12 ossh2 kernel: LDISKFS-fs: mballoc: 0 blocks 0 reqs (0
>> success)
>> Jun 26 17:49:12 ossh2 kernel: LDISKFS-fs: mballoc: 0 extents scanned, 0
>> goal hits, 0 2^N hits, 0 breaks, 0 lost
>> Jun 26 17:49:12 ossh2 kernel: LDISKFS-fs: mballoc: 0 generated and it 
>> took 0
>> Jun 26 17:49:12 ossh2 kernel: LDISKFS-fs: mballoc: 0 preallocated, 0
>> discarded
>> Jun 26 17:49:12 ossh2 kernel: Lustre: server umount home-OST0000 
>> complete
>> Jun 26 17:49:12 ossh2 kernel: Lustre: Skipped 1 previous similar message
>> Jun 26 17:49:12 ossh2 kernel: LustreError:
>> 5848:0:(obd_mount.c:1991:lustre_fill_super()) Unable to mount  (-6)
>> Jun 26 17:49:12 ossh2 kernel: LustreError:
>> 5848:0:(obd_mount.c:1991:lustre_fill_super()) Skipped 1 previous similar
>> message
>>
>> And on MDS:
>>
>> Jun 26 18:16:41 mdsh kernel: LustreError: 13b-9: home-OST0000 claims to
>> have registered, but this MGS does not know about it, preventing
>> registration.
>> Jun 26 18:16:41 mdsh kernel: LustreError:
>> 5251:0:(mgs_handler.c:654:mgs_handle()) MGS handle cmd=253 rc=-2
>> Jun 26 18:16:41 mdsh kernel: LustreError:
>> 5251:0:(ldlm_lib.c:1643:target_send_reply_msg()) @@@ processing error
>> (-2)  req at ffff810805c00050 x16/t0
>> o253->6a7fc3be-6a98-7219-6302-a17def55c327 at NET_0x50000c0a81bf6_UUID:0/0
>> lens 4672/4672 e 0 to 0 dl 1246033101 ref 1 fl Interpret:/0/0 rc 0/0
>>
>> Have you got any idea what is wrong? How can I clean the MGS?
>>
>> Michal.
>>
>>
>> Nirmal Seenu napisal:
>>> The manual has the incorrect command, just remove the option 
>>> "--fsname" and everything should work fine.
>>>
>>> Nirmal
>>> _______________________________________________
>>> Lustre-discuss mailing list
>>> Lustre-discuss at lists.lustre.org
>>> http://lists.lustre.org/mailman/listinfo/lustre-discuss
>>>   
>>




More information about the lustre-discuss mailing list