Huge amounts of information are flooding companies every second, which has led to an increased focus on big data and the ability to capture and analyze this sea of information. Enterprises are turning to big data and Apache Hadoop in order to improve business performance and provide a competitive advantage. But to unlock business value from data quickly, easily and cost-effectively, organizations need to find and deploy a truly reliable Hadoop infrastructure that can perform, scale, and be used safely for mission-critical applications.
As more and more Hadoop projects are being deployed to provide actionable results in real-time or near real-time, low latency has become a key factor that influences a company’s Hadoop distribution choice. Thus, performance and scalability should be evaluated closely before choosing a particular Hadoop solution.
The raw performance of a Hadoop platform is critical; it refers to how quickly the platform can ingest, process and analyze information. The MapR Distribution for Hadoop in particular provides world-record performance for MapReduce operations on Hadoop. Its advanced architecture harnesses distributed metadata with an optimized shuffle process, delivering consistent high performance.
The graph below compares the MapR M7 Edition with another Hadoop distribution, and it vividly illustrates the vast difference in latency and performance between these Hadoop distributions.
One particular solution that is optimized for performance is Cisco UCS with MapR. MapR on the Cisco Unified Computing System™ (Cisco UCS®) is a powerful, production-ready Hadoop solution that increases business and IT agility, supports mission-critical workloads, reduces total cost of ownership (TCO), and delivers exceptional return on investment (ROI) at scale.
Read More »
Tags: Big Data, blade server, Blade Servers, Cisco UCS, Cisco UCS C240 M3 Rack Server, Cisco Unified Computing System, Cisco Unified Data Center, Cisco Unified Fabric, Hadoop, MapR, rack server, UCS Central, UCS service profiles
While there is not yet an industry standard benchmark for measuring the performance of Hadoop systems (yes, there is work in progress -- WBDB, BigDataTop100 etc), workloads like TeraSort have become a popular choice to benchmark and stress test Hadoop clusters.
TeraSort is very simple, consists of three map/reduce programs (i) TeraGen -- generates the dataset (ii) TeraSort -- samples and sort the dataset (iii) TeraValidate -- validates the output. With multiple vendors now publishing TeraSort results, organizations can make reasonable performance comparisons while evaluating Hadoop clusters.
We conducted a series of TeraSort tests on our popular Cisco UCS Common Platform Architecture (CPA) for Big Data rack with 16 Cisco UCS C240 M3 Rack Servers equipped with two Intel Xeon E5-2665 processors, running Apache Hadoop distribution, see figure below, demonstrating industry leading performance and scalability over a range of data set sizes from 100GB to 50TB. For example, out of the box, our 10TB result is 40 percent faster than HP’s published result on 18 HP ProLiant DL380 Servers equipped with two Intel Xeon E5-2667 processors.
While Hadoop offers many advantages for organizations, the Cisco story isn’t complete without including collaborations with our ecosystem partners that enables us to offer complete solution stacks. We support leading Hadoop distributions including Cloudera, HortonWorks, Intel, MapR, and Pivotal on our Cisco UCS Common Platform Architecture (CPA) for Big Data. We just announced our Big Data Design Zone that offers Cisco Validated Designs (CVD) -- pretested and validated architectures that accelerate the time to value for customers while reducing risks and deployment challenges.
Cisco Big Data Design Zone
Cisco UCS Demonstrates Leading TeraSort Benchmark Performance
Cisco UCS Common Platform Architecture (CPA) for Big Data
Tags: Big Data, Big Data Benchmarks, Cisco UCS C240 M3 Rack Server, Cisco UCS CPA, CPA, Hadoop, TeraSort, YCSB
Organizations use Cisco UCS servers to gain the power, flexibility, and management simplicity needed to meet their Microsoft SQL Server workload demands while increasing their IT agility.
Starting with standalone servers for performance and bandwidth, or connecting servers through Cisco UCS for automated configuration, simplified management, and massive I/O flexibility which provide SAN and network-attached storage (NAS) access, the pairing of Microsoft SQL Server with Cisco UCS provides business intelligence and OLTP applications exceptional connectivity to your data.
Let’s not about record-setting performance with lower cost, too! In its inaugural TPC-H™ result, Cisco asserted industry leadership in partnership with Microsoft, establishing Cisco UCS as the fastest 4-socket Intel Xeon processor– powered platform for running Microsoft SQL Server at the 1,000 GB scale factor.
Table 1 below outlines the flexibility of SQL Server on UCS, describing various sized configurations to support your data management needs. Here you can see how our B series or C series UCS servers support small to medium organizations up to the largest of enterprises.
Table 1 -- UCS SQL Server Sample Configurations
Want to learn more about Microsoft applications on Cisco UCS? Then please feel free to download in this new Application Solutions Brochure and see how UCS provides an optimal platform for Microsoft SQL Server, SharePoint and other leading applications.
Tags: applications, Cisco, Cisco UCS C240 M3 Rack Server, Hyper-V, Microsoft, Microsoft SQL Server, UCS, UCS B250 M2
There is no better time to announce a world-record TPC-C benchmark result with Oracle than right before Oracle Open World 2012. The Cisco UCS C240 M3 Rack Server with two Intel® Xeon® E5-2690 2.9 GHz Processors and 768 GB memory running Oracle Database 11g Standard Edition achieved the best 2-Processor performance record of 1.6 Million transactions-per-minute  beating the nearest competition, the IBM Flex System x240 Server with identical Intel processor and memory footprint running DB2 , by 7% better performance and 11% lower price-performance. This benchmark result is yet another demonstration of the performance and price-performance advantages of Cisco UCS for running enterprise applications.
This result also demonstrate a 53% improvement in performance from the previous generation of servers (see chart 1)  and in line with Moore’s Law (see chart 2). (Yes, TPC-C trend has been in line with Moore’s Law) .
Chart 1: TPC-C: 53% improvement from Cisco UCS M2 to M3 Generation
Chart 2: Transaction Rate vs. Moore’s Law
As of 9/27/2012. Source: Transaction Processing Performance Council (TPC), www.tpc.org.
 Cisco UCS C240 M3 Rack Server, 1,609,186.39 tpmC, $0.47/tpmC, available 9/27/2012
 IBM Flex System x240, 1,503,544 tpmC, $0.53/tpmC, available 8/16/12
 Cisco UCS C250 M2 Extended-Memory Server, 1,053,100 tpmC, $0.58/tpmC, available 12/07/2011
 HP ProLiant DL380 G7, 1,024,380 tpmC, $0.65/tpmC, available 06/20/2011
 R. Nambiar, M. Poess, Transaction Performance vs. Moore’s Law: A Trend Analysis: http://www.springerlink.com/content/fq6n225425151344/
About TPC and TPC-C
The Transaction Processing Performance Council (TPC) is a non-profit corporation founded to define transaction processing and database benchmarks and to disseminate objective and verifiable performance data to the industry.
Often referred to as the flagship server benchmark that measures online transaction processing performance, TPC-C simulates a complete compute environment where a population of users runs transactions against a database. TPC-C is not limited to the activity of any particular business segment, but rather represents any industry that must manage, sell, or distribute a product or service. The primary metrics are the transactions per minute (expressed as tpmC) and the associated price per transaction (expressed as $/tpmC).
Tags: Cisco UCS, Cisco UCS C240 M3 Rack Server, Oracle Database 11g, TPC, TPC Benchmark, TPC-C