Outline概述
High-Performance Computing – Brief Intro 简介
R16.0 Update on HPC Features & Capabilities 新功能
• Optimized for latest HPC architectures
• Parallel performance & scaling
• Support “HPC Resources Anywhere”
HPC Hardware Selection – Workstation硬件选择
HPC Case Studies 应用案例
© 2015 ANSYS, Inc.
High-Performance Computing - HPC Defined
High Performance Computing (HPC) at ANSYS: • An ongoing effort designed to remove computing limitations from
engineers who use computer aided engineering in all phases of design, analysis, and testing.
• It is a hardware and software initiative!
Recent advances • Multi-core processors • Large amounts of RAM • SSDs • Hardware accelerators
• Graphics Processing Units (GPUs) from NVIDIA® and AMD® • Intel® Xeon Phi™ coprocessors (previously called Intel MIC)
© 2015 ANSYS, Inc.
High-Performance Computing - Typical HPC Growth Path
© 2015 ANSYS, Inc.
Cluster Users Desktop User Workstation and/or
Server Users
6
High-Performance Computing - Key Enabler for…
Enhanced engineering productivity • Accelerate simulation throughput • Consider more design variants • Make more efficient decisions earlier
in the product development cycle • Ensure robust design
Image courtesy of Timo Krappel, IHS, Universität Stuttgart and Voith Hydro Holding GmbH
© 2015 ANSYS, Inc.
Higher-fidelity insight into product performance! • Assess larger, more detailed models • Consider more complex physics • From single component to system innovations
Be Highly Optimized to Run Fast on the Latest HPC Architectures
● ANSYS Mechanical 16.0 demonstrates 20-40% performance gains on Intel’s latest Haswell processor vs. Sandy Bridge processor.
● ANSYS Mechanical 16.0 supports solver computation on Intel Xeon Phi coprocessors. Adding 1 Phi to 1-8 CPU cores leads to an average speedup of 1.8 times, particularly for bulkier models.
● ANSYS Fluent, HFSS, Mechanical and Polyflow supports solver computation on NVIDIA GPUs. This can lead to a speedup of up to 2.5 times.
ANSYS Features & Capabilities
Customer Benefits
© 2015 ANSYS, Inc. 8
● Better utilization of hardware, software licenses, and people.
● Engineers spend more time analyzing results, leading to improved product understanding.
● Accelerate your simulation throughput ● Get more insight into product performance ● Allow you to evaluate more product design
ideas
Optimized for Latest HPC Architectures
ANSYS Fluent 16.0
Background: ● ANSYS Fluent 14.5 was the first
commercial CFD program to support NVIDIA GPUs
● It was limited to shared-memory parallelism, single GPU per node only
NVIDIA GPU support ● Since R15 full multi-GPU, multi-
node support in distributed-memory mode
● Since R16 user control to pick and choose which equation to run on GPU
ANSYS Features & Capabilities
9 Jobs/day
27 Jobs/day
Simulation productivity
48 CPU cores 48 CPU cores + 4 Tesla K80s
Truck Model
External aerodynamics 14 million cells Steady, k-ε turbulence Coupled PBNS, DP Intel Xeon E5-2697v2; 48 CPU cores on 2 nodes 4 Tesla K80 GPUs
Higher is
Better
10 © 2015 ANSYS, Inc. June 11, 2015
Optimized for Latest HPC Architectures ANSYS Mechanical 16.0
Optimized for Intel Xeon E5 v3 processors: • ANSYS Mechanical 16.0 performs well on the
latest Intel processor architecture • Haswell processor-based system is 20% to
40% faster than Sandy Bridge processor-based system for a variety of benchmarks
ANSYS Features & Capabilities
11 © 2015 ANSYS, Inc. June 11, 2015
Optimized for Latest HPC Architectures ANSYS Mechanical 16.0
Background: • ANSYS Mechanical 15.0 was the first
commercial FEA program to support Intel Xeon Phi coprocessor
• It was limited to shared memory parallelism (SMP) on Linux only
Intel Xeon Phi coprocessor support • R16 now supports distributed
memory parallelism (DMP) and Windows
ANSYS Features & Capabilities
3.6
1.8
5.1
3.0
7.0
4.7
9.8
6.0
14.4
0
4
8
12
16
No Xeon Phi Xeon Phi
Spee
dup
1 core2 cores4 cores8 cores16 cores
12 © 2015 ANSYS, Inc. June 11, 2015
Optimized for Latest HPC Architectures ANSYS Mechanical 16.0
ANSYS Application Examples
Courtesy of HP
13 © 2015 ANSYS, Inc. June 11, 2015
Optimized for Latest HPC Architectures ANSYS Mechanical 16.0
ANSYS Application Examples
6 CPU cores + K80 GPU
1.8x
8 CPU cores 6 CPU cores + K80 GPU
2.3x
8 CPU cores
Higher is
Better
159 Jobs/day 135
Jobs/day
247 Jobs/day
371 Jobs/day
V15sp-4 Model
Turbine geometry 3.2 million DOF SOLID187 elements Static, nonlinear analysis Sparse direct solver
V15sp-5 Model
Ball grid array geometry 6.0 million DOF Static, nonlinear analysis Sparse direct solver
Distributed ANSYS Mechanical 16.0 with Intel Xeon E5-2697v2 2.7 GHz 8-core CPU; Tesla K80 GPU with boost clocks.
Customer Benefits
Deliver Outstanding Parallel Performance & Scaling at an Ever Increasing Scale of Parallelism
● A continuous software development focus on HPC enabling parallel improvements made release by release - also at R16.0.
● ANSYS solvers are highly optimized to run fast and deliver outstanding parallel scaling at an increasing scale of parallelism!
ANSYS Features & Capabilities
© 2015 ANSYS, Inc. 14
● Reduced time to solution of your current models by leveraging more cores.
● Be less constrained by hardware limitations because ‘bigger’ models can be sped up at your existing compute capacity.
● As HPC evolves into the future, ANSYS is the right choice to sustain the software investment that is required to stay ahead.
15 © 2015 ANSYS, Inc. June 11, 2015
Improved Parallel Performance & Scaling ANSYS Fluent 16.0
ANSYS Features & Capabilities
Improved parallel robustness: • Improved handling of ‘poor mesh’ numerics
o New cell quality-based criteria considering both orthogonal quality and equiangle skewness
• Parallel check: o Inspect system usage, interconnect speeds and
load balance
• Enhanced automatic handling of AMG divergence
• Cache flush o Automatically checks for cache file buildup and
warns users of potential performance impact
• Improved start-up and run-time robustness of compressible flow numerics o SIMPLEC, PISO o Multiphase flows o User-defined compressible gas/liquid
Example problematic VOF case that previously diverged, now shows smooth convergence with improved poor mesh
numerics treatment in R16.0
Mixture multiphase case showing significantly improved convergence with new numerics for compressible multiphase flow
16 © 2015 ANSYS, Inc. June 11, 2015
Improved Parallel Performance & Scaling ANSYS CFX 16.0
Solver Scalability R&D – Industrial Benchmark: • 6-Stage Axial Compressor • 13m nodes, 14 domains, 12 mixing planes o 256 cores approx. 50,000 nodes/core
ANSYS Application Example
Reduction in solver wall clock with default settings over recent releases
17 © 2015 ANSYS, Inc. June 11, 2015
Improved Parallel Performance & Scaling ANSYS Mechanical 16.0
Distributed ANSYS Enhancements: • Improved the domain decomposition step o Better load balancing for domains o Faster performance at higher core
counts • Added support for inertia relief (IRLF, 1) o Allows for improved performance by
using Distributed ANSYS
• Added support for mode-superposition method in harmonic and transient analyses o Includes both the reduced order calculations
as well as the expansion pass o Includes support for modal restart analyses
ANSYS Features & Capabilities
18 © 2015 ANSYS, Inc. June 11, 2015
Improved Parallel Performance & Scaling ANSYS Mechanical 15.0 vs. 16.0
ANSYS Application Examples
At Release 15.0
NOW: At Release 16.0
19 © 2015 ANSYS, Inc. June 11, 2015
Supporting “HPC Resources Anywhere” Engineering Knowledge Manager (EKM)
Users (Remote/Mobile)
Enterprise datacenter or cloud
Work in Process Data Management
Graphics Server
HPC Cluster Batch Jobs
ANSYS EKM Web/Mobile UI
Browser Access
Remote Display
Interactive Jobs
• Web & Mobile UI • Interactive Remote Display with
3D Graphics • Detach/Re-attach
• Knowledge Management • Best Practices • Search & Retrieve • Light Weight Visualization • Security & Access Controls
• Job Submission Templates • Remote Monitoring • Access to Interactive Controls • Support for Leading Schedulers
and Resource Managers
IP Protection Remote Job Management Mobility & Remote Access
Open Platform: Supports ANSYS and 3rd Party Applications
20 © 2015 ANSYS, Inc. June 11, 2015
Supporting “HPC Resources Anywhere” Engineering Knowledge Manager (EKM)
ANSYS EKM 16.0 is better than ever: • Improved HPC support
o Simplified job submission o Launch, monitor, track and control jobs from anywhere
• Faster file transfers • Built-in Application Launcher templates
o Launch and use applications remotely o Support MAPDL, Fluent, Electronics
• New, easy-to-use web front end o Access from many clients: Workbench, web browsers, smartphones, tablets o Access control & permissions at any level can be easily set up o Easier organization of data as per user requirements o 3D CAE visualization
ANSYS Features & Capabilities
HPC for Avoiding Expensive Design Changes While Increasing Operational IT Efficiency
Business Initiative: Avoid Costly Design Changes • Eliminate the use of thermal sensors on
prototype engines by computationally expensive simulations
• Because HPC was at maximum capacity, outsource simulation workload to the cloud
Customer Success Factors • Optimized the entire cooling and sealing
system design in the early stages of the product development process
• By outsourcing HPC, computing resources were elastically provisioned and released.
“Because of the near-linear scalability of Fluent, running the coupled fluid-structural simulation on an HPC cluster in the cloud was five times faster wall-clock time than running the problem on a local workstation."
Marius Swoboda Head of Design Systems Engineering Rolls-Royce Germany
Key Results • Engineers were able to expand or shrink HPC capacity as
needed, thus increasing their operational IT efficiency and better utilizing HPC resources
Reduced simulation time by 80%
Earlier design changes at lower costs
Gro
wth
Prof
it
HPC for Bringing New, Better Offshore Equipment Products Faster to Market
Business Initiative: Reducing Time to Market • Quickly develop a steerable conductor for
offshore drilling that can provide real-time accurate positioning
Customer Success Factors • Using ANSYS Mechanical, custom cylinders
costing $160,000 each could be replaced by off-the-shelf parts that cost only $7,000.
• ANSYS HPC was critical in meeting the delivery-time requirements of this project.
“Parallel processing makes it possible to evaluate five to 10 design iterations per day, enabling us to rapidly improve the design."
Ray Younger Managing Director Cognity Limited
Key Results • Cognity was able to complete the design in only five
months, approximately 70 percent less time than would have been required using conventional methods
Reduced design time by 70%
Cost reduction by 95% per part
Gro
wth
Prof
it
© 2015 ANSYS, Inc.
HPC for Enhanced Insight into Critical Scenarios and Complex Operations
Business Initiative: Increase Design Fidelity • Overtake technological challenges on flow
assurance and subsea oil processing present on the new pre-salt oil fields
• Better understand the sand transportation inside kilometers long production lines
Customer Success Factors • Intensive investments on the HPC
infrastructure expansion at CENPES are enabling lots of new simulation possibilities
• Physical insights can guide the design of upstream processing systems at oil industry.
“Petrobras relies on ANSYS software for its superior parallel scalability, together with advanced multiphase models and dynamic meshing."
Carlos Alberto Capela Moraes Technical Consultant CENPES (Petrobras R&D Center)
Key Results • Engineers were able to virtually reproduce critical
scenarios and complex operations of upstream processing systems in the oil industry.
Higher-fidelity insight in upstream processes Pr
ofit
© 2015 ANSYS, Inc.
HPC for Minimizing Solution Time While Enabling Faster Analysis of Potential Failure
Business Initiative: Save Maintenance Costs • Accurately predict if and when each part in
a particular jet engine needs to be replaced or serviced
• Improve safety by treating each engine based on its own unique history
Customer Success Factors • Use finer models to accurately capture life-
limiting locations • Use HPC to reduce solution time from
weeks to days, days to hours.
“With Volvo Aero’s LTS relying on ANSYS software, external customers’ service departments can replace and maintain parts only when necessary — and take that benefit right to the bank."
Magnus Andersson System Owner for Life Engine Volvo Aero
Key Results • External customers’ service departments can faster
evaluate the need for repairs and replace and maintain parts only when necessary
Reduced simulation time by 75%
More parts changed less frequently
Gro
wth
Prof
it
© 2015 ANSYS, Inc. 25
Realize Your Product Promise®
HPC Hardware Selection – Workstation
References http://www.hp.com/go/wsansys
ANSYS Fluent 16.0 and CFX 16.0 • HP Z420 Intel® Xeon® E5-1680 v2 8-Core 3.0/4.0, 64GB memory, SSD,
NVIDIA Quadro K600 • HP Z620 Intel® Xeon® E5-2667 v2 16-Core 3.3/4.0, 64GB memory,
SSD, NVIDIA Quadro K2000 • HP Z820 Intel ® Xeon® E5-2697 v2 24-Core 2.7/3.5, 64GB memory,
SSD, NVIDIA Quadro K2000
HP Workstation recommendations
NVIDIA GPU Compute Options
Z420 (600W) Z620 (800W) Z820 (1125W)
Tesla K40 Max 1 Max 2
Tesla K20c Max 1 Max 1 Max 2
Quadro K6000 Max 1 Max 1 Max 2
ANSYS Mechanical 16.0 • HP Z420 Intel® Xeon® E5-1660 v2 6-Core 3.7/4.0, 64GB
memory, SSD RAID0, Quadro K600 • HP Z620 Intel® Xeon® E5-2667 v2 16-Core 3.3/4.0, 96GB
memory, SSD RAID0, Quadro K2000 • HP Z820 Intel ® Xeon® E5-2687W v2 16-Core 3.4/4.0, 128GB
memory, SSD RAID0, Quadro K2000
HP Workstation recommendations
HP Workstation family ANSYS 16.0 Pre-Processing and Post-Processing
HP Workstation
Intel® Processor(s)
# CPU Cores
Max Memory GB
Graphics
HP Z1 G2 Xeon® E3-1200v3
Core™ i3, i5
4
2, 4
32 NVIDIA Quadro
HP Z230 Xeon® E3-1200v3
Core™ i5, i7
4
4, 4
32 AMD FirePro
NVIDIA Quadro
HP Z420 Xeon® E5-1600v2
Xeon® E5-2600v2
4, 6, 8
8
64 AMD FirePro
NVIDIA Quadro
HP Z620 Xeon® E5-1600v2
Xeon® E5-2600v2
4, 6, 8
4, 6, 8, 10, 12
96 AMD FirePro
NVIDIA Quadro
HP Z820 Xeon® E5-2600v2 4, 6, 8, 10, 12
512 AMD FirePro
NVIDIA Quadro
HP ZBook 14 Core™ ULT i5, i7 2, 2 16 AMD FirePro
HP ZBook 15, 17 Core™ M i5, i7 2, 4 32 NVIDIA Quadro