[Lustre-discuss] Fwd: Reg /// OSS rebooted automatically

Jeff Johnson jeff.johnson at aeoncomputing.com
Mon Dec 20 23:00:26 PST 2010


Daniel,

Check the health and stability of your raid-6 volume. Make sure the raid is healthy and online. Use whatever monitor utility came with your raid card or check /proc/mdstat if it's a Linux mdraid. Check /var/log/messages for error messages from your raid or other hardware.

--Jeff

---mobile signature---
Jeff Johnson - Aeon Computing
jeff.johnson at aeoncomputing.com

On Dec 20, 2010, at 22:27, Daniel Raj <danielraj2006 at gmail.com> wrote:

> Hi Jeff,
> 
> 
> Thanks for your reply 
> 
> Storage information : 
> 
> 
> DL380G5       == OSS + 16GB Ram 
> OS                == SFS G3.2-2 + centos 5.3 + lustre 1.8.3
> MSA60 box   == OST
> RAID 6
> 
> 
> Regards,
> 
> Daniel A 
> 
> On Tue, Dec 21, 2010 at 11:45 AM, Jeff Johnson <jeff.johnson at aeoncomputing.com> wrote:
> Daniel,
> 
> It looks like your OST backend storage device may be having an issue. I would check the health and stability of the backend storage device or raid you are using for an OST device. It wouldn't likely cause a system reboot of your OSS system. There may be more problems, hardware and/or OS related that are causing the system to reboot in addition to Lustre complaining that it can't find the OST storage device.
> 
> Others here on the list will likely give you a more detailed answer. The storage device is the place i would look first.
> 
> --Jeff
> 
> -- 
> ------------------------------
> Jeff Johnson
> Manager
> Aeon Computing
> 
> jeff.johnson at aeoncomputing.com
> www.aeoncomputing.com
> t: 858-412-3810 x101   f: 858-412-3845
> m: 619-204-9061
> 
> 4905 Morena Boulevard, Suite 1313 - San Diego, CA 92117
> 
> 
> On Mon, Dec 20, 2010 at 9:43 PM, Daniel Raj <danielraj2006 at gmail.com> wrote:
> 
> 
> 
> Hi Genius,
> 
> 
> Good Day  !!!!!!!!!!!!!!!!!!
> 
> 
> I am Daniel. My OSS getting  automatically rebooted again and again . kindly help to me 
> 
> Its showing the below error messages 
> 
> 
>  kernel: LustreError: 23351:0:(ldlm_lib.c:1892:target_send_reply_msg()) @@@ processing error (-19)  req at ffff810400e24400 x1353488904620274/t0 o8-><?>@<?>:0/0 lens 368/0 e 0 to 0 dl 1292738958 ref 1 fl Interpret:/0/0 rc -19/0
> kernel: LustreError: 137-5: UUID 'south-ost7_UUID' is not available  for connect (no target)
> kernel: LustreError: 23284:0:(ldlm_lib.c:1892:target_send_reply_msg()) @@@ processing error (-19)  req at ffff8101124c7c00 x1353488904620359/t0 o8-><?>@<?>:0/0 lens 368/0 e 0 to 0 dl 1292739025 ref 1 fl Interpret:/0/0 rc -19/0
> 
> 
> Regards,
> 
> Daniel A 
> 
> 
> 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20101220/0481fed1/attachment.htm>


More information about the lustre-discuss mailing list