[lustre-discuss] 'queue depth too large', but connection works
t.roth at gsi.de
Sat Jan 29 09:46:45 PST 2022
test system: servers 2.12.7, and a client 2.12.6., all mlx4.
The client has some non-default ko2iblnd parameters, including "peer_credits=16".
I mounted my test system there and happily copied around some directories. Only afterwards I found
> LNetError: 5278:0:(o2iblnd_cb.c:2551:kiblnd_passive_connect()) Can't accept conn from 10.20.3.64 at o2ib6, queue depth too large: 16 (<=8 wanted)
in the MDS log.
I did read LU-3322, but obviously did not the point. "Can't accept conn" used to deny client access, but the MDS that didn't like my client just
created some ~25k new objects on behalf of that client.
Does this mean client and server negotiate a suitable value, but behind the scenes?
More information about the lustre-discuss