[Lustre-discuss] Slow read performance across OSSes
James Robnett
jrobnett at aoc.nrao.edu
Wed Oct 14 13:15:42 PDT 2009
After reading through my first post I felt some clarification was
probably warranted.
In this test setup there are two OSS, call them OSS-1 and OSS-2,
each has an OST, call them OSS-1-A, OSS-1-B and OSS-2-A, OSS-2-B.
The MDS, OSSes and client all have 1Gbit ethernet connections.
The following table illustrates the data rates I see in MB/s.
OST(s) Read Write
OSS-1-A 113 95
OSS-1-B 112 93
OSS-1-A OSS-1-B 112 98
OSS-2-A 105 93
OSS-2-B 115 94
OSS-2-A OSS-2-B 115 98
OSS-1-B OSS-2-A ---> 42 113
OSS-1-A OSS-2-B ---> 42 114
OSS-1-A OSS-1-B OSS-2-A OSS-2-B ---> 46 114
The write numbers are almost exactly what I'd expect across
1Gbit. 96MB/s or so between the client and a single OSS and
nearly full rate (112MB/s) with two OSSes.
The 113MB/s read numbers for a single OSS (one or more OST's) are
also pretty much exactly what I'd expect. It's the 40MB/s reads
when utilizing 2 OSSes that are throwing me.
I can envision that there would be more re-assembly overhead on
the client in the case of 2 OSSes(1) but I'm surprised it's that high.
Is this an expected result ?
If it's unexpected is there a common misconfiguration or client
short coming that causes it to be slower when reading from multiple
OSSes ?
Is there some command I could run or data I could provide that would
help identify the issue ? I'm fairly new to Lustre so I'm just as
likely to flood noise as signal if I just randomly appended data
beyond raw rates.
I just upgraded to 1.8.1.1 which had no effect.
James Robnett
NRAO/NM
1) I'm assuming in the case of a single OSS with 2 OSTs the OSS
presents the client with a single stream. If assembly of two data
streams is required on the client in both the single and dual OSS
(both with 2 OSTs) cases then I'm even more confused about those
results.
James Robnett wrote:
> The nodes are a bit cobbled together from what I had handy.
>
> One MDS: Dual quad-core 2.5GHz nehalem 8GB RAM E1000 gigabit NIC
> MDT is just a partition on a 1TB SAS Seagate
> Two OSS: Single dual core 2.8GHz Xeon, 4GB RAM single gigabit NIC
> Dual 3ware 9550SX cards with 7+1 RAID 5 across 400GB WD SATA
> drives.
> Two OST/OSS: 2TB. Configured as LVM. 1 and 4MB stripe size tried.
> Client: Dual quad-core 2.5 GHz Xeon, 8GB RAM single gigabit NIC
> Network: Dedicated Cisco 2960g Gigabit switch
More information about the lustre-discuss
mailing list