[lustre-discuss] Problem mounting over infiniband

Jon Tegner tegner at foi.se
Thu Apr 28 05:12:34 PDT 2016


Hi,

I have brought up a test system using

2.8.0-3.10.0_327.3.1.el7.x86_64_g96792ba

I can mount the system over tcp, but when I try to do so over infiniband 
i get errors of the type:

Can't accept conn from 10.0.51.1 at o2ib, queue depth too large: 128 (<=8 
wanted)

Can't accept conn from 10.0.51.1 at o2ib (version 12): max_frags 32 
incompatible without FMR pool (256 wanted)

After searching I suspected it had something to do with the fact that we 
have mellanox (mlx4_ib) on the server and qlogic on the client (ib_qib).

Also found a possible solution, by putting

options ko2iblnd peer_credits=124 concurrent_sends=62 map_on_demand=256

However, there are a bunch of options to ko2iblnd, and to me it is not 
obvious which values to chose. Is there a specific strategy one should 
follow?

Regards,

/jon



More information about the lustre-discuss mailing list