Crossing the Performance Chasm with OpenPOWER

Crossing the Performance Chasm with
OpenPOWER
Dr. Srini Chari
Cabot Partners/IBM
[email protected]
#OpenPOWERSummit
Join the conversation at #OpenPOWERSummit 1
Disclosure
Copyright® 2015. Cabot Partners Group. Inc. All rights reserved. Other companies’ product names,
trademarks, or service marks are used herein for identification only and belong to their respective owner.
All images and supporting data were obtained from IBM , NVIDIA, Mellanox or from public sources. The
information and product recommendations made by the Cabot Partners Group are based upon public
information and sources and may also include personal opinions both of the Cabot Partners Group and
others, all of which we believe to be accurate and reliable. However, as market conditions change and
not within our control, the information and recommendations are made without warranty of any kind.
The Cabot Partners Group, Inc. assumes no responsibility or liability for any damages whatsoever
(including incidental, consequential or otherwise), caused by your or your client’s use of, or reliance
upon, the information and recommendations presented herein, nor for any inadvertent errors which may
appear in this document. This paper was developed with IBM funding. Although the paper may utilize
publicly available material from various vendors, including IBM, it does not necessarily reflect the
positions of such vendors on the issues addressed in this document.
Please read accompanying Cabot Partners Whitepaper for Additional Detail,
References and Notes on information presented in this document.
Join the conversation at #OpenPOWERSummit
2
Agenda
 Key Technology Trends
•
•
Cloud, HPC, Analytics, Social, Mobile and Internet of Things
Centered Around Data
 Open Innovation Vital for Value Creation
•
Data-Centric HPC Growing Rapidly
 Client Considerations in HPC Systems Evaluations
 System Attributes Impacting Real Life Performance
•




Why the LINPACK Benchmark is Inadequate
IBM Data Centric Approach and Solutions
Why OpenPOWER
Examples of Performance Gains
Key Takeaways
Join the conversation at #OpenPOWERSummit
3
Key Intertwined Technology Trends
Join the conversation at #OpenPOWERSummit
4
Key Intertwined Technology Trends
 Cloud, High Performance Computing, Analytics,
Social, Mobile and IoT
•
•
•
•
Enterprise cloud growth - $70B (2014) to $250B (2017)
Annual growth: Smart phones 20% . Mobile data 81%
IoT at 12B today reaching over 1 Trillion in a decade
Social media users - 1.79B (2014) to 2.44B (2018)
 DATA
•
•
2.5 exabytes (1018 bytes) created daily. Individuals create
70% and enterprises manage 80%
Annual spending 30% to reach $114B in 2018
Join the conversation at #OpenPOWERSummit
5
What IT Must Consider to Deal with Data







Volume
Variety
Velocity
Veracity
Vulnerability
Visualize
Virtualize
8
V
 Value
Join the conversation at #OpenPOWERSummit
6
Extracting Value From Data with HPC
Requires Open Innovation across the stack
Join the conversation at #OpenPOWERSummit
7
HPC Drives Value Across Many Industries
 Overall HPC servers growing ~ 6.4% annually
 Traditional HPC
Risk Analytics
Life Sciences
Oil and Gas
 Data-Centric HPC growing ~ 23.5%
Emergency Response
Fraud and Threat Detection
Enhance Customer Experience
Join the conversation at #OpenPOWERSummit
8
Considerations to Evaluate HPC Systems
 Not Just Point Benchmarks
 But Workflows Across the HPC Data Life Cycle
 Example in Seismic Processing
Join the conversation at #OpenPOWERSummit
9
Total Value of Ownership Framework
Holistic Cost Benefit Analysis for Entire HPC Workflow

Value Delivered
•
•
•

Business Value: e.g. customer revenues, new business models, compliance
regulations, better products, increased business insight, faster time to market, and
new breakthrough capability
Operational Value: e.g. faster time to results, more accurate analyses, more users
supported, improved user productivity, better capacity planning
IT Value: e.g. improved system utilization, manageability, administration, and
provisioning, scalability, reduced downtime, access to robust proven technology and
expertise.
Costs Incurred
•
•
•
•
IT /Data Center Capital e.g. new servers, storage, networks, power distribution units,
chillers, etc.
Data Center Facilities e.g. land, buildings, containers, etc.
Operational Costs: e.g. labor, energy, maintenance, software license, applications, etc.
Other Costs: e.g. system management, deployment and training, downtime,
migration, etc.
Join the conversation at #OpenPOWERSummit
10
What Impacts Traditional HPC Performance
Life Sciences
Computer Aided Engineering
Structures
Flops/Core
Flops/Core
Crash
Fluids
Network Bandwidth
Cores
Network Bandwidth
0
0
Network Latency
Memory Capacity
I/O Performance
Memory Bandwidth
Financial Services
Flops/Core
Quantum Chemistry
Molecular Modeling
Bioinformatics
Cores
Network Latency
Memory Capacity
I/O Performance
Memory Bandwidth
Energy and Environmental Sciences
Low Latency Trading
Reservoir
Seismic
Weather
Flops/Core
Monte Carlo
Risk Analytics
Network Bandwidth
Cores
Network Bandwidth
0
0
Network Latency
I/O Performance
Cores
Memory Capacity
Memory Bandwidth
Network Latency
I/O Performance
Join the conversation at #OpenPOWERSummit
Memory Capacity
Memory Bandwidth
11
Why LINPACK is Inadequate
Most HPC Analytics Involve Sparse Matrices
but LINPACK Solves Dense Matrix Problems
LINPACK vs. HPC Analytics
Flops/Core
LINPACK
HPC Analytics
Network Bandwidth
Cores
0
Network Latency
I/O Performance
Join the conversation at #OpenPOWERSummit
Memory Capacity
Memory Bandwidth
12
Data Centric System
Traditional System Design
IBM’s Data Centric Approach and Solutions
Join the conversation at #OpenPOWERSummit
13
Major HPC Win for OpenPOWER
Join the conversation at #OpenPOWERSummit
14
Key Benchmarks*: POWER8 2 - 2.5X Better
SPECint_rate2006
(greater is better)
1.8 x Performance
60
70
Performance / Core
Performance / Core
80
60
50
40
30
20
10
0
Dell PowerEdge
T620
2s/36c/72t
Intel Xeon Haswell
POWER S824
2s/24c/192t
IBM POWER8
SPECfp_rate2006
(greater is better)
2.1 x Performance
50
40
30
20
10
0
Dell PowerEdge
T620
2s/36c/72t
Intel Xeon Haswell
POWER S824
2s/24c/192t
IBM POWER8
Terasort Big Data Hadoop
(greater is better)
Stream Triad
(greater is better)
2.9 x Performance
Relative System Performance
350
3.0
300
2.5
250
2.0
200
GB/s
2.5x
1.5
150
100
1.0
50
0.5
0
Intel Xeon Haswell
2s/24c/48t
IBM POWER8
2s/24c/192t
0.0
POWER8
Join the conversation at #OpenPOWERSummit
Cisco
15
Application Performance with POWER8* 1.4 – 2.6 X better
Nanoseconds / day
Molecular Dynamics - NAMD apoa1
(greater is better)
1.4 x Performance
3
3
2.5
2.5
2
2
1.5
1.5
1
1
0.5
0.5
0
Intel E5-2690 V3
2s/24c/2.6GHz
Intel Xeon Haswell
POWER S824L
2s/24c/3.6GHz
IBM POWER8
STAC A2 – Options Pricing
(greater is better)
2.07 x Performance
0
Seismic – RTM
(greater is better)
2.6 x Performance
2.6x
Intel E5-2690 V3
2s/24c/2.6GHz
Intel Xeon Haswell
POWER S822L
2s/24c/3.358GHz
IBM POWER8
PostGreSQL
(higher is better)
Max Paths (10 min)
3.00E+07
2.50E+07
2.00E+07
1.50E+07
1.00E+07
5.00E+06
0.00E+00
4 x Intel E7-4890 v2
2.80 GHz /1TB
Intel Ivy Bridge EX
Power S824
2s/24c/3.52GHz/1TB
Join the conversation at #OpenPOWERSummit
16
Key Takeaways
 HPC is becoming more data-centric
 Traditional system evaluations based on point
benchmarks such as LINPACK are inadequate
 Focus evaluation on cost-benefit analysis of workflow
across HPC data lifecycle
 Many system features impact HPC performance
 Benefits of OpenPOWER HPC Offerings:
•
•
•
•
Deliver Choice and Flexibility
Minimize Costly Data Motion for Entire Workflow
Accelerate Compute and Data Intensive Tasks with Lower TCO
Provide Investment Protection
Join the conversation at #OpenPOWERSummit
17
*Appendix - Additional Benchmark Detail
SPECcpu (int_rate & fp_rate)
SPECcpu2006 results are based on best published results on E5-2699 v3 from the top 5 Intel system vendors (HP, Oracle, Lenovo, Dell, Fujitsu)
submitted as of 9/8/2014. For more information go to http://www.specbench.org/cpu2006/results/ . The IBM POWER8 published data is
based on Power S824 2s/24c/3.5GHz POWER8. The x86 Xeon published data is based on Dell PowerEdge T620 2s/36c/2.3GHz E5-2699 v3.
Hadoop Tersort
IBM Analytics Stack: IBM Power System S822L; 8 nodes each with 24 cores / 192 threads, POWER8; 3.0GHz, 512 GB memory, RHEL 6.5,
InfoSphere BigInsights 3.0
Cisco Stack: 16 high-density Cisco UCS C240 M3 Rack Servers each with 16 cores / 32 threads, Intel Xeon E5-2665; 2.4 GHz, 256 GB of
memory, Cisco UCS VIC 1225, and LSI 9266 8i with 24 1-TB SATA 7200-rpm disk running Apache Hadoop open source distribution.
Stream Triad
The Stream Triad results are based on results reported in published papers.
IBM POWER8:
http://www.dcs.warwick.ac.uk/~sdh/pmbs14/PMBS14/Workshop_Schedule_files/2-PerformancePower8.pdf
Intel Xeon E5-2600 v3
http://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&ved=0CB8QFjAA&url=http%3A%2F%2Fdownload.boston.co.uk%2Fd
ownloads%2F9%2F3%2Fc%2F93c022fd-0d6d-46a4-9124-28c9e32f2533%2FIntelWhitepaper.pdf&ei=mLgBVbysL8KrggT774CICw&usg=AFQjCNFal5q5Vz2ly6ZbsaKZ2QPPad1fg&sig2=3LzktTXeKPvS2QW9ndXgfQ&bvm=bv.87920726,d.eXY
STAC and all STAC names are trademarks or registered trademarks of Securities Technology Analysis Center LLC.
https://stacresearch.com/system/files/asset/files/STAC-A2%20Intel%20Composer%20on%204%20x%20IVT%20EX%20-%20INTC140509.pdf
Join the conversation at #OpenPOWERSummit
18