[Lustre-discuss] OFED 1.5.1 on Clients

Roger Spellman Roger.Spellman at terascala.com
Fri Jun 18 11:48:52 PDT 2010


Jason (or anyone else),

Patch 23498 ( https://bugzilla.lustre.org/attachment.cgi?id=23498 )
says:

Index: ./lnet/klnds/o2iblnd/o2iblnd_cb.c
===================================================================
RCS file: /cvsroot/cfs/lnet/klnds/o2iblnd/o2iblnd_cb.c,v
retrieving revision 1.12.6.1.2.5
diff -u -p -u -p -r1.12.6.1.2.5 o2iblnd_cb.c
--- ./lnet/klnds/o2iblnd/o2iblnd_cb.c	20 Nov 2008 09:29:34 -0000
1.12.6.1.2.5
+++ ./lnet/klnds/o2iblnd/o2iblnd_cb.c	15 May 2009 12:26:07 -0000
@@ -2654,6 +2654,8 @@ kiblnd_cm_callback(struct rdma_cm_id *cm
 
 	switch (event->event) {
 	default:
+                CERROR("Unexpected event: %d, status: %d\n",
+                       event->event, event->status);
                 LBUG();

Why should we LBUG just for an unexpected event?  Couldn't it just be
ignored?

-Roger

> -----Original Message-----
> From: Jason Rappleye [mailto:jason.rappleye at nasa.gov]
> Sent: Friday, June 18, 2010 2:16 PM
> To: Roger Spellman
> Cc: lustre-discuss at lists.lustre.org
> Subject: Re: [Lustre-discuss] OFED 1.5.1 on Clients
> 
> 
> On Jun 18, 2010, at 7:49 AM, Roger Spellman wrote:
> 
> > Jason,
> > Thanks for this response.  This brings up another question:
> 
> np
> 
> > The bug number you referred to mentions an LBUG in OFED 1.4.1.  Are
> > you
> > saying that the same LBUG would occur with OFED 1.5.1 too without
the
> > patch?
> 
> Yes. The patch handles new RDMA CM events that appear in OFED 1.4(.
> 1?). They are also in 1.5.1. Without the patch, receipt of one of
> those events will result in an LBUG.
> 
> Jason
> 
> >
> > -Roger
> >
> >> -----Original Message-----
> >> From: Jason Rappleye [mailto:jason.rappleye at nasa.gov]
> >> Sent: Thursday, June 17, 2010 5:02 PM
> >> To: Roger Spellman
> >> Cc: lustre-discuss at lists.lustre.org
> >> Subject: Re: [Lustre-discuss] OFED 1.5.1 on Clients
> >>
> >>
> >> On Jun 17, 2010, at 1:23 PM, Roger Spellman wrote:
> >>
> >>> Hi,
> >>> Can anyone share their experiences using OFED 1.5.1 on Lustre
> >>> Clients?  This is needed because RHAT 5.5 does not support OFED
> > 1.4.2.
> >>
> >> We're using it with ~9300 clients running Lustre 1.6.6 and haven't
> >> identified any OFED 1.5.1-specific issues. If you're using 1.6.x
and
> >> haven't done so already, you'll want to apply bug 19520 attach
23498.
> >>
> >> We just deployed 1.8.2 on a separate cluster with ~130 clients and
> >> haven't seen any OFED-specific issues there, either. While we did
see
> >> some failures when running acc-sm with the stack of software we use
> >> here, none of those had anything to do with the version of OFED we
> >> were running.
> >>
> >
> 
> --
> Jason Rappleye
> System Administrator
> NASA Advanced Supercomputing Division
> NASA Ames Research Center
> Moffett Field, CA 94035
> 
> 
> 
> 
> 
> 




More information about the lustre-discuss mailing list