[Lustre-discuss] Soft CPU Lockup

Hendelman, Rob Rob.Hendelman at magnetar.com
Mon Oct 5 13:40:22 PDT 2009


It looks like the threads finally died....  The 2 cpu cores that were
pegged at 100% are idle again.

That seems like one heck of a timeout...
==============

Oct  5 14:10:59 maglustre04 kernel: Lustre:
13366:0:(service.c:1317:ptlrpc_server_handle_request()) @@@ Request
x6413848 took longer than es
timated (100+5495s); client may timeout.  req at ffff81009308c400
x6413848/t0 o101->1b9e4991-1d5e-814d-2607-8c52f432e68d@:0/0 lens 232/288
e 0 
to 0 dl 1254764364 ref 1 fl Complete:/0/0 rc 301/301
Oct  5 14:10:59 maglustre04 kernel: Lustre:
13421:0:(watchdog.c:330:lcw_update_time()) Expired watchdog for pid
13421 disabled after 5595.80
41s
Oct  5 14:10:59 maglustre04 kernel: Lustre:
13366:0:(service.c:1317:ptlrpc_server_handle_request()) Skipped 1
previous similar message
Oct  5 14:10:59 maglustre04 kernel: Lustre:
13366:0:(watchdog.c:330:lcw_update_time()) Expired watchdog for pid
13366 disabled after 5595.80
59s

Robert Hendelman Jr
Magnetar Capital LLC
Rob.Hendelman at magnetar.com
1-847-905-4557



The information contained in this message and its attachments 
is intended only for the private and confidential use of the 
intended recipient(s).  If you are not the intended recipient 
(or have received this e-mail in error) please notify the 
sender immediately and destroy this e-mail. Any unauthorized 
copying, disclosure or distribution of the material in this e-
mail is strictly prohibited.



More information about the lustre-discuss mailing list