[lustre-devel] Cannot find documentation or comment blocks specifically defining lnet metrics

Jeff Johnson jeff.johnson at aeoncomputing.com
Mon Jan 22 11:35:23 PST 2024


Greetings all...

I'm modifying a Lustre metrics reporting tool and incorporating lnet
metrics into the data gathered. I dug through all the documentation I could
find and looked through source for comment blocks and didn't find anything
definitive. I'm hoping someone here could clarify. While some metrics are
obvious (*_drop) others I would rather not develop to based on assumptions.

msgs_alloc
msgs_max
rst_alloc
send_count
recv_count
route_count
drop_count
send_length
recv_length
route_length
drop_length

If someone can point me to the document that clearly defines the above I'd
appreciate it, so far I can't find any. As I said I don't want to develop
based on assumptions.

For example, when sampled over a short period of time does...

((X_length / X_count) / # samples) = avg rpc size of X type?

Send, Recv, Route and Drop are fairly clear. Length and Count I am assuming
are just that, n bytes and IOs. msgs_alloc, msgs_max, rst_alloc are less
obvious.

Thanks!

--Jeff
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-devel-lustre.org/attachments/20240122/59710323/attachment.htm>


More information about the lustre-devel mailing list