[Lustre-discuss] ll_ost_creat_* goes bersek (100% cpu used - OST disabled)

Adrian Ulrich adrian at blinkenlights.ch
Fri Aug 13 12:01:09 PDT 2010


Hi Alexey,

> in general soft-lookup isn't error, that just notice about some operation is need too many time (more then 10s i think).
> attached soft-lookup say - OST is busy with creating objects after MDS<>OST reconnect, 

Yes, i know that a soft-lockup doesn't mean that i hit a bug but having ll_ost_creat_* wasting 100% CPU
doesn't seem to be normal.

> i think you have too busy disks or overloaded node.

Disk %busy is < 5% for all attached disks.
The OST is doing almost nothing (there are a few read()'s, that's all)


> if you have slow disks - client can be disconnected before they request is processing, and that request blocked to reconnect from that client.

The recovery of the clients seems to be ok: all clients can write/read data from the OST but
there is something wrong between the MDS<->OST0005.

But this might just be a side-effect of the ll_ost_creat_* issue :-/

Regards,
 Adrian


-- 
 RFC 1925:
   (11) Every old idea will be proposed again with a different name and
        a different presentation, regardless of whether it works.




More information about the lustre-discuss mailing list