[Lustre-discuss] 1.6.5.1 OSS crashes
Brian J. Murrell
Brian.Murrell at Sun.COM
Mon Jul 21 05:51:28 PDT 2008
On Sun, 2008-07-20 at 04:54 -0400, Robin Humble wrote:
>
> done. I rebuilt using the stock kernel's InfiniBand stack and
> # CONFIG_SD_IOSTATS is not set
>
> % cexec -p oss: uptime
> oss x17: 18:45:07 up 1 day, 30 min, 1 user, load average: 4.97, 7.00, 6.27
> oss x18: 18:45:07 up 1 day, 23 min, 1 user, load average: 4.18, 5.78, 5.71
> oss x19: 18:45:07 up 1 day, 23 min, 1 user, load average: 5.18, 5.66, 4.60
>
> which is >> the 10hrs it was crashing at before.
Good.
> good guess about the cause of the problem! :-)
I cheated. It's an already open bug: 16404. There is even a patch in
that bug for the reporter to test. Please feel free to test it yourself
and report here (or even better, in the bug) on your results.
> maybe that rhel4 1.6.5.1 kernel rpm needs a respin then? seems like a
> fairly critical issue... :-/
You can follow the above bug to see how we progress with it.
b.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20080721/3f5022d7/attachment.pgp>
More information about the lustre-discuss
mailing list