[Lustre-devel] Testing LNET
Nicholas Henke
nic at cray.com
Thu Jul 16 14:20:25 PDT 2009
Scott Atchley wrote:
> Hi all,
>
> I have implemented handling of hosts with different PAGE_SIZE in
> MXLND. I am running tests to make sure that I did not accidentally
> break something else. So far, I have been using lctl and pinging back
> and forth as well as with obdecho (using loadgen).
>
> When running loadgen tests or lctl test_brw with loadgen's echosrv
> running, if I kill a host (either client or server) and bring it back
> up, LNET seems happy (MXLND reconnects normally), but loadgen does not
> resume. When using lctl test_brw and I restart a server, the client
> reconnects but then fails an assertion when it connects to a new
> server with:
>
> LustreError: 6295:0:(echo_client.c:1341:echo_client_cleanup())
> ASSERTION(eco->eco_refcount == 0) failed
>
> If this the proper way to test a LND? What other methods can you
> suggest?
>
I've been using LNet SelfTest for my LND development and testing. It is
ok once you get things working - but it is painful to script around and
to trace LST errors back to LND transactions.
Nic
More information about the lustre-devel
mailing list