[Lustre-devel] Testing LNET
    Nicholas Henke 
    nic at cray.com
       
    Thu Jul 16 14:20:25 PDT 2009
    
    
  
Scott Atchley wrote:
> Hi all,
>
> I have implemented handling of hosts with different PAGE_SIZE in  
> MXLND. I am running tests to make sure that I did not accidentally  
> break something else. So far, I have been using lctl and pinging back  
> and forth as well as with obdecho (using loadgen).
>
> When running loadgen tests or lctl test_brw with loadgen's echosrv  
> running, if I kill a host (either client or server) and bring it back  
> up, LNET seems happy (MXLND reconnects normally), but loadgen does not  
> resume. When using lctl test_brw and I restart a server, the client  
> reconnects but then fails an assertion when it connects to a new  
> server with:
>
> LustreError: 6295:0:(echo_client.c:1341:echo_client_cleanup())  
> ASSERTION(eco->eco_refcount == 0) failed
>
> If this the proper way to test a LND? What other methods can you  
> suggest?
>   
I've been using LNet SelfTest for my LND development and testing.  It is 
ok once you get things working - but it is painful to script around and 
to trace LST errors back to LND transactions.
Nic
    
    
More information about the lustre-devel
mailing list