DELL POWEREDGE C5220: HADOOP MAPREDUCE PERFORMANCE
Every year, the amount of data that businesses must process grows
enormously. The ability to sort, filter, and analyze this data is becoming more
and more vital to many businesses in analyzing their customers and their market
segment. Additionally, businesses need an infrastructure that is powerful and
flexible, but also compact and scale-friendly. The Dell PowerEdge C5220 server is
an ideal solution to pair with Apache Hadoop, a powerful multi-node data
analysis application. With the PowerEdge C5220, organizations can scale out to
their data processing requirements and successfully handle these ever-increasing
data volumes, finding new value in their big data.
To test the Hadoop performance capabilities of the Dell PowerEdge
C5220, we configured eight Dell PowerEdge C5220 servers into a Hadoop cluster
and ran the MapReduce benchmark (mrbench) on the platform. We found that
eight Dell PowerEdge C5220 servers, all contained within the single shared
infrastructure design of the Dell PowerEdge C5000 chassis, could run our
mrbench tests of varying sizes, map processes, and reduce processes, in times
averaging just 15.9 to 25.6 seconds, making this platform ideal for scale-out
data-analysis application workloads.
A PRINCIPLED TECHNOLOGIES TEST REPORT
Commissioned by Dell Inc.; April 2012