[Lustre-discuss] kernel panic with 1.6.5rc2 on mds
Oleg Drokin
Oleg.Drokin at Sun.COM
Sun May 18 16:58:20 PDT 2008
Hello!
On May 16, 2008, at 6:45 AM, Patrick Winnertz wrote:
> As I wrote in #11742 [1] I experienced a kernel panic after doing
> heavy I/O
> on the 1.6.5rc2 cluster on the mds. Since nobody answered to this bug
> until now (and I think in other cases the lustre team is _really_ fast
> (thanks for that :))) I fear that it was not recognised by anybody.
I just looked into the logs, you have out of memory issues at the very
least
during that i/o. Also checksum errors. The log you uploaded does not
contain
actual crash info, but rather only these messages (that do not cause
crash in itself),
followed by oom and checksum error messages.
I do not see any panic messages in your logs. Any chance you have a
serial console
or other way to see what was the actual panic complete with stacktrace
and other
useful info? (ideally a crashdump).
Bug 11742 that was referenced is just related to checksum errors
problems.
How do you do your heavy i/o? just regular writes or mmap writes or
what?
Bye,
Oleg
More information about the lustre-discuss
mailing list