[Lustre-discuss] 1.6.5.1 OSS crashes

Brian J. Murrell Brian.Murrell at Sun.COM
Mon Jul 21 05:51:28 PDT 2008


On Sun, 2008-07-20 at 04:54 -0400, Robin Humble wrote:
> 
> done. I rebuilt using the stock kernel's InfiniBand stack and
>  # CONFIG_SD_IOSTATS is not set
> 
>  % cexec -p oss: uptime
> oss x17:  18:45:07 up 1 day, 30 min,  1 user,  load average: 4.97, 7.00, 6.27
> oss x18:  18:45:07 up 1 day, 23 min,  1 user,  load average: 4.18, 5.78, 5.71
> oss x19:  18:45:07 up 1 day, 23 min,  1 user,  load average: 5.18, 5.66, 4.60
> 
> which is >> the 10hrs it was crashing at before.

Good.

> good guess about the cause of the problem! :-)

I cheated.  It's an already open bug: 16404.  There is even a patch in
that bug for the reporter to test.  Please feel free to test it yourself
and report here (or even better, in the bug) on your results.

> maybe that rhel4 1.6.5.1 kernel rpm needs a respin then? seems like a
> fairly critical issue... :-/

You can follow the above bug to see how we progress with it.

b.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20080721/3f5022d7/attachment.pgp>


More information about the lustre-discuss mailing list