[Lustre-discuss] Solved: panic on jbd:journal_dirty_metadata
Michael Sternberg
sternberg at anl.gov
Tue Aug 3 16:27:46 PDT 2010
Hello Wojciech,
Confirmed - I built and installed the patch as well, and the problem hasn't occurred again here either - Thank you!
For reference, I'm using the released kernel and e2fsprogs rpm plus three rebuilt rpms. The patch only affects obdfilter.ko in lustre-modules. "nm /lib/modules/2.6.18-164.11.1.el5_lustre.1.8.3/kernel/fs/lustre/obdfilter.ko" produced identical output before and after the patch, which I found reassuring.
# rpm -qa | grep -e e2fs -e lustre | sort
e2fsprogs-1.41.10.sun2-0redhat
kernel-2.6.18-164.11.1.el5_lustre.1.8.3
lustre-1.8.3-2.6.18_164.11.1.el5_lustre.1.8.3_<date>
lustre-ldiskfs-3.0.9-2.6.18_164.11.1.el5_lustre.1.8.3_<date>
lustre-modules-1.8.3-2.6.18_164.11.1.el5_lustre.1.8.3_<date>
With best regards,
Michael
On Jul 25, 2010, at 18:08 , Wojciech Turek wrote:
> Hi Michael,
>
> Our OST's were also nearly full when the problem occured, after installing the patch we didn't have a single occurence of that problem.
>
> Cheers
>
> Wojciech
>
> On 24 July 2010 17:06, Michael Sternberg <sternberg at anl.gov> wrote:
> Wojciech,
>
> Thank you very much for your pointer. Perhaps the fact that the OSTs are nearly full contributes(?). I also see higher usage.
>
> In any case, I'll attempt compilation with the patch applied.
>
>
> With best regards,
> Michael
>
>
> On Jul 22, 2010, at 9:16 , Wojciech Turek wrote:
>
> > Hi Michael,
> >
> > This looks like the problem we had some time ago after upgrading to 1.8.3
> >
> > https://bugzilla.lustre.org/show_bug.cgi?id=22889
> >
> > Best regards
> > Wojciech
> >
> > On 20 July 2010 00:00, Michael Sternberg <sternberg at anl.gov> wrote:
> > Hello,
> >
> > I use OSSs with external journal partitions and since lustre-1.8.1 about one to two times a week I get frustrating panics on OSSs as follows:
> >
> > :libcfs:cfs_alloc ...
> > :lvfs:lprocfs_counter_add ...
> > ...
> >
> > RIP [<ffffffff88031e64>] :jbd:journal_dirty_metadata+0x7f/0x1e3
> > RSP <ffff8101f99c3670>
> > <0>Kernel panic - not syncing: Fatal exception
> >
More information about the lustre-discuss
mailing list