[Lustre-discuss] kernel panic with 1.6.5rc2 on mds

Oleg Drokin Oleg.Drokin at Sun.COM
Sun May 18 16:58:20 PDT 2008


Hello!

On May 16, 2008, at 6:45 AM, Patrick Winnertz wrote:

> As I wrote in #11742 [1] I experienced a kernel panic after doing  
> heavy I/O
> on the 1.6.5rc2 cluster on the mds.  Since nobody answered to this bug
> until now (and I think in other cases the lustre team is _really_ fast
> (thanks for that :))) I fear that it was not recognised by anybody.

I just looked into the logs, you have out of memory issues at the very  
least
during that i/o. Also checksum errors. The log you uploaded does not  
contain
actual crash info, but rather only these messages (that do not cause  
crash in itself),
followed by oom and checksum error messages.
I do not see any panic messages in your logs. Any chance you have a  
serial console
or other way to see what was the actual panic complete with stacktrace  
and other
useful info? (ideally a crashdump).
Bug 11742 that was referenced is just related to checksum errors  
problems.
How do you do your heavy i/o? just regular writes or mmap writes or  
what?

Bye,
     Oleg



More information about the lustre-discuss mailing list