[Lustre-discuss] Lustre with 10GbE or Infiniband?

Kevin Van Maren Kevin.Vanmaren at Sun.COM
Wed Feb 11 19:00:14 PST 2009


One more clarification: those IB numbers were for MPI, not Lustre.

Kevin


On Feb 11, 2009, at 6:46 PM, Kevin Van Maren <Kevin.Vanmaren at Sun.COM>  
wrote:

> Charles Taylor wrote:
>> On Feb 11, 2009, at 4:35 PM, Scott Atchley wrote:
>>
>>
>>> To add to Brian's comments, IB 4X SDR is limited to about 700-750  
>>> MB/s
>>> by the fabric. O2IBLND cannot go faster than minimum of either the
>>> fabric or PCI-E connection allow.
>>>
>>
>> Hmmm.   I can agree with the second part of that statement but I
>> question the first.   We've measured much closer to the 1GByte/sec
>> wire rate of IB using several different tools.  750 GBytes/sec
>> corresponds to roughly 6 GBits/sec.   You lose 2 of the 10 Gbits to
>> encoding (8B10) so line rate is really 8GBits/sec or 1 GByte/sec.
>> Yes, you'll lose some more to protocol and swtiching overhead but it
>> is not anywhere near an additional 2 GBits/sec - in our experience.
>>
>
> Correct.  Infinipath SDR was getting ~980 MB/s, and DDR HCAs in SDR  
> mode
> can also do quite well in an x8 PCIe slot.
>
> The PCI-X HCAs were limited to around 850MB/s by the bus, and PCIe  
> HCAs
> _are_ likewise limited to around 700-750MB/s -- but only in a PCIe  
> x4 slot.
>
> DDR IB (unless using a PCIe gen2 connectX card, or a x16 Infinipath
> card) are also
> limited to around 1450-1600 MB/s by the PCIe x8 bus, with a wire speed
> of 2000 MB/s.
>
> QDR IB, in a Gen2 x8 PCIe slot, are also going to be limited to <<
> 4000MB/s line rate
> (should expect around twice the BW of the gen1 PCIe slots).
>
> The IB headers are very small, compared to a 2KB or 4KB packet size,  
> but
> the PCIe
> headers (and eg flow-control overhead) are quite large compared to a
> typical 256B packet size.
>
> To clarify one point: IB advertises the "signaling" rate, so the 10Gb
> includes the overhead
> bits, as 8 bits are encoded in a 10 bit representation for
> transmission.  So 10Gb/s = 1GB/s,
> with 10-bit bytes.  Ethernet, on the other hand, always advertises the
> "data" rate, so 10Gb
> Ethernet is 1.25GB/s (12.5Gb/s signaling rate), as there are 8 bits  
> in a
> byte.  Ethernet packet
> headers are also effectively a bit larger than for IB (with IFG,
> preamble, etc).
>
> Kevin
>
>> Just ran a quick IMB (formerly Pallas) between a couple of our SDR
>> nodes and got 860 MBytes/sec (ping-pong, 4MB).   So I don't think
>> there is anything inherent in SDR IB that limits you to 750 MBytes/
>> sec.   However, running IPoIB will  probably limit you to something
>> even less than that which is why you should use the O2IBLND if you
>> want the real benefit of IB.
>>
>> Just our experience,
>>
>> Charlie Taylor
>> UF HPC Center
>>
>> _______________________________________________
>> Lustre-discuss mailing list
>> Lustre-discuss at lists.lustre.org
>> http://lists.lustre.org/mailman/listinfo/lustre-discuss
>>
>
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-discuss



More information about the lustre-discuss mailing list