[Lustre-discuss] 1.8.1.1
Papp Tamás
tompos at martos.bme.hu
Thu Nov 19 12:00:38 PST 2009
Craig Prescott wrote, On 2009. 11. 19. 20:42:
> Papp Tamás wrote:
>> The logs are full with this:
>>
>> Nov 19 20:03:32 node1 kernel: BUG: soft lockup - CPU#3 stuck for 10s!
>> [ll_ost_80:4894]
>> Nov 19 20:03:32 node1 kernel: CPU 3:
> <snip>
>> Nov 19 20:03:34 node1 kernel: Lustre: Skipped 40339060 previous
>> similar messages 0; still busy with 3 active RPCs
>
> We had the same problem with 1.8.x.x.
>
> We set lnet.printk=0 on our OSS nodes and it has helped us
> dramatically - we have not seen the 'soft lockup' problem since.
>
> sysctl -w lnet.printk=0
>
> This will turn off all but 'emerg' messages from lnet.
>
> It would be interesting to know if this avoided the lockups for you, too.
I set it up.
We'll see.
Thank you very much!
tamas
More information about the lustre-discuss
mailing list