High Productivity Computing with Windows slides

Download Report

Transcript High Productivity Computing with Windows slides

High Productivity Computing
With Windows
Windows HPC Server 2008
Neil Foster
HPC Partner Mgr
Microsoft
Agenda
High Productivity for HPC
Overview Windows HPC Server 2008
Partnerships
Discussion
The purpose if computing is?
‘The purpose of
computing is insight
not numbers.’
Richard Hamming
HPC Drivers
Competitive Advantages
Pressure to improve operational
performance (cost, quality and time
to market)
Quality driven
regulatory compliance
Rapid cycles of product
innovation
The Challenge: High Productivity Computing
High integration pain
•
•
Lack of seamless integration
between workstations, clusters, data
Lack of user workflow integration
across applications and departments
Isolated technology islands
•
•
•
High manual touch
Lack of end-to-end IT process
integration
Cannot leverage existing
investments in broad IT skills and
infrastructure
“Make high-end computing
easier and more productive to
use. Emphasis should be
placed on time to solution, the
major metric of value to highend computing users… A
common software environment
for scientific computation
encompassing desktop to highend systems will enhance
productivity gains by
promoting ease of use and
manageability of systems.”
Application availability
•
•
Limited eco-system of parallel
applications
Lack of developer-friendly tools,
difficult to program
High-End Computing Revitalization Task Force, 2004
(Office of Science and Technology Policy,
Executive Office of the President))
Changing face of HPC
Costs and pain points have moved:
• Manpower more expensive than hardware
• Software is more expensive than hardware
• Every system will be multi-core
• Power, Cooling, facilities much more expensive than
hardware!
• NSF Grants only cover hardware costs
Why Microsoft in HPC?
Current Issues
 HPC and IT data centers merging: isolated cluster
management
 Developers can’t easily program for parallelism
 Users don’t have broad access to the increase in processing
cores and data
How can Microsoft help?
 Well positioned to mainstream integration of application
parallelism
 Have already begun to enable parallelism broadly to the
developer community
 Can expand the value of HPC by integrating productivity and
management tools
Microsoft Investments in HPC
Comprehensive software portfolio: Client, Server,
Management, Development, and Collaboration
Dedicated teams focused on Cluster Computing
Unified Parallel development through the Parallel Computing
Initiative
Partnerships with the Technical Computing Institutes
Microsoft’s Vision for HPC
“Provide the platform, tools and broad ecosystem to reduce the complexity of HPC by
making parallelism more accessible to address future computational needs.”
Reduced Complexity
Mainstream HPC
Developer Ecosystem
Ease deployment for
larger scale clusters
Enable non-technical users to
harness the power of HPC
Increase number of parallel
applications and codes
Simplify management for
clusters of all scale
Address emerging
cross-industry
computation trends
Offer choice of parallel
development tools,
languages and libraries
Integrate with
existing infrastructure
Address needs of traditional
supercomputing
Drive larger universe of
developers and ISVs
Integrated HPC Envirnment
Clients/Job Submission
Development Tools
Administration
Visual Studio: C#,
C++, WCF, OpenMP,
MPI, MPI.NET
Trace Analysis
Batch Applications
SharePoint
Profiling
Windows® HPC Server 2008
Administration Console:
WCF Applications
Excel
Numerical Libraries
MPI Debugging
System, Scheduling, Networking,
Imaging, Diagnostics
CCS Job Console
Windows Workflow
Foundation
Fortran
MPI Tracing
CCS Scripts
Windows Powershell
System Center
Operations Manager
Existing Cluster
Infrastructure
Windows® HPC Server 2008
Job Submission
APIs
WCF Router
UNIX/Linux
System
Job Scheduler w/ Failover
Administration APIs
HPC
Profile
System Center
Data Protection Manager
Compute Nodes
Node Manager
Applications:
WCF, C#, C++, Fortran
New TCP/IP
MPI w/Network Direct
System Center
Configuration Manager
Windows Server
Update Services
Software Protection Services
3rd Party Systems
Management Utilities
Business Intelligence
Storage
Storage
Storage
Key
Partner
Microsoft
HPC Server 2008
Parallel/Clustered
Storage
SQL Structured
Storage
Windows Storage
Server with DFS
SQL Server
Integration
Services
SQL Server
Analysis/
Reporting
What’s New in the HPC Pack 2008
 New System Center UI
 PowerShell for CLI Management
High Availability for Head Nodes
Windows Deployment Services
Diagnostics/Reporting
Support for Operations Manager
 Support for SOA and WCF
 Granular resource scheduling
 Improved scalability for larger
clusters
 New Job scheduling policies
Interoperability via HPC Profile
Systems
Management
Networking
& MPI
 NetworkDirect (RDMA) for MPI
 Improved Network
Configuration Wizard
 Shared Memory MS-MPI for
multi-core
 MS-MPI integrated with
Windows Event Tracing
Job
Scheduling
Storage
 Improved iSCI SAN Support in
Win2008
 Improved Server Message
Block ( SMB v2)
 New 3rd party parallel system
file support for Windows
 New Memory Cache Vendors
Head Node High Availability
• Eliminates single point of failure with support for high availability
• Requires Windows Server 2008 Enterprise Failover Clustering Services
– Next generation of cluster services
– Major improvement in
configuration validation and
management
• HPC Pack Includes
– Setup integration with Failover
Clustering Services
• Head Node and Failover Node set
up with SQL Failover Cluster
• Job Scheduler services failover
– Management console linked to
Windows Server Failover
Management console
Private
Network
Windows
Failover
Clustered
Head node
Win2008 Enterprise
Clustered SQL
Server
Failover Head node
Win2008 Enterprise
Clustered SQL
Server
Shared
Disk
NetworkDirect
A new RDMA networking interface built for speed and stability
Priorities
– Comparable with hardware-optimized MPI
stacks
•
Socket-Based
App
Focus on MPI-Only Solution for version 2
MS-MPI
– Verbs-based design for close fit with
native, high-perf networking interfaces
– Coordinated w/ Win Networking team’s
long-term plans
•
Implementation
– MS-MPIv2 capable of 4 networking paths:
•
•
•
•
MPI App
Shared Memory
between processors on a motherboard
TCP/IP Stack (“normal” Ethernet)
Winsock Direct (and SDP)
for sockets-based RDMA
New RDMA networking interface
– HPC team partners with networking IHVs
to develop/distribute drivers for this new
interface
Windows Sockets
(Winsock + WSD)
RDMA
Networking
Networking
Networking
WinSock
Direct
Hardware
Hardware
Provider
Networking
Networking
NetworkDirect
Hardware
Hardware
Provider
Networking Hardware
Hardware
Networking
User
Mode Access Layer
TCP/Ethernet
Networking
TCP
Kernel By-Pass
•
IP
NDIS
Networking
Networking
Mini-port
Hardware
Hardware
Driver
Networking Hardware
Hardware
Networking
Hardware Driver
Networking Hardware
Hardware
Networking
Networking
Hardware
(ISV) App
CCP
Component
OS
Component
IHV
Component
User
Mode
Kernel
Mode
Job Scheduling
•
Support for larger clusters
– Create new designs for clusters of size,
including
“heterogeneous” clusters
– Scale deployment and administration
technologies
– Provide interfaces for those accustomed
to *nix
•
Improve interoperability with existing
IT infrastructure
– Interoperability with existing job
schedulers
– High speed file I/O through native support
for parallel and clustered file systems
•
Broader application support
– Simplify the integration of new
applications with the job scheduler
– Addressing needs of in-house and open
source developers
•
Platform Support
– Built for Windows Server 2008
– Cluster nodes with different hardware /
software
Scenario: Broaden Application Support
V1 (focusing on batch jobs)
V2 (focusing on Interactive jobs)
Engineering
Applications
Oil & Gas
Applications
Life Science
Applications
Financial Services
Excel
Structural Analysis
Crash Simulation
Reservoir simulation
Seismic Processing
Structural Analysis
Crash Simulation
Portfolio analysis
Risk analysis
Compliance
Actual
Pricing
Modeling
Job Scheduler
App.exe
App.exe
Your applications
here
WCF Service Router
+
Resource allocation
Process Launching
Resource usage tracking
Integrated MPI execution
Integrated Security
App.exe
Interactive
Cluster
Applications
App.exe
WS Virtual Endpoint Reference
Request load balancing
Integrated Service activation
Service life time management
Integrated WCF Tracing
Service
(DLL)
Service
(DLL)
Service
(DLL)
Service
(DLL)
Interoperability & Open Grid Forum
What is it?
What is its value?
What’s the Status?
•A draft OGSA (Open Grid Services
Architectures) interoperability standard
for batch job scheduler task submission
and management
•Based on web services standards (HTTP,
XML, SOAP)
•Enables integration of HPC applications
executing on different platforms and
schedulers via web services standards
•Passed the public comment period
•Working on new extensions
LSF / PBS / SGE / Condor
Linux, AIX, Solaris
HPUX, Windows
Windows Cluster
Windows Center
Window Center
Spring 2008, NCSA, #23
9472 cores, 68.5 TF, 77.7%
Spring 2008, Umea, #40
5376 cores, 46 TF, 85.5%
Spring 2008, Aachen, #100
2096 cores, 18.8 TF, 76.5%
Fall 2007, Microsoft, #116
2048 cores, 11.8 TF, 77.1%
30% efficiency
improvement
Windows HPC Server 2008
Spring 2007, Microsoft, #106
2048 cores, 9 TF, 58.8%
Windows Compute Cluster 2003
Spring 2006, NCSA, #130
896 cores, 4.1 TF
Winter 2005, Microsoft
4 procs, 9.46 GFlops
Customers
“Ferrari is always looking for the most advanced technological solutions and, of course, the
same applies for software and engineering. To achieve industry leading power-to-weight
ratios, reduction in gear change times, and revolutionary aerodynamics, we can rely on
Windows HPC Server 2008. It provides a fast, familiar, high performance
computing platform for our users, engineers and administrators.”
-- Antonio Calabrese, Responsabile Sistemi Informativi (Head of Information Systems), Ferrari
“The Umeå cluster with Windows HPC Server 2008 performed well enough to
become the fastest academic system in Sweden. We are very happy with that result.”
-- Bo Kågström, Professor and Director,
High Performance Computing Center North Umeå University
“Financial analysts in Europe mainly use Windows systems. As such, the deployment of
a Windows HPC Server 2008 cluster renders our HPC services extremely
attractive to a large potential user base.”
-- Dr. M. Rosati, Manager of the Computational Materials Science and Finance Group, CASPUR
“We are really impressed with many of the new features of Windows HPC
Server 2008. Microsoft is a pretty young player in the HPC market, but this is already
a very solid product.”
-- Christian Terboven, Project Lead for HPC on Windows, Center for Computing and
Communication, RWTH Aachen University
Parallel Programming
•
Available Now
–
–
•
Emerging Technologies – Parallel Framework
–
–
•
Development and Parallel debugging in Visual Studio
3rd party Compilers, Debuggers, Runtimes etc.. available
LINQ/PLINQ – natural OO language for SQL queries in .NET
C# Futures – way to explicitly make loops parallel
For the future: Parallel Computing Initiative (PCI)
–
–
Triple investment with a new engineering team
Focused on common tools for developing multi-core codes from desktops to clusters
Compilers
• Visual Studio
• Intel C++
• Gcc
• PGI Fortran
• Intel Fortran
• Absoft Fortran
• Fujitsu
Profilers and Tracers
• PerfMon
• ETW (for MS-MPI)
• VSPerf /VSCover
• CLRProfiler
• Vampir (Being ported to
Windows)
• Intel
Collector/Analyzer(Runs
on CCS w Intel MPI)
• Vtune & CodeAnalyst
• Marmot (Being ported to
Windows)
• MPI Lint++
Debuggers
• Visual Studio
• WinDbg
• DDT
Runtimes and Libraries
• MPI
• OpenMP
• C# Futures
• MPI.C++ and MPI.Net
• PLINQ
Microsoft approach to HPC
Best of breed software and Partners:
• Partner with every hardware vendor
• Partner with every software vendor (even Novell!)
• One integrated platform (Windows)
• Stick to our competencies: OS, management,
development, tools, user interface, etc.
Cross Microsoft Effort
“Here and Now”
Technologies
Emerging Technologies
Research
Product Teams
Windows HPC Server
SQL
Office
Visual Studio
Microsoft Research
eScience
External Research &
Programs
External Research Office
Industry Focused Solutions
Academia Aerospace Automotive
Financial
Services
Geo
Life
Services Government Sciences
Company Introduction
Cluster Resources (CRI) are based in Utah, USA and Cambridge, UK
CRI’s core product is Moab
Moab’s pedigree goes back more than 10 years to the Maui scheduler
CRI develop, maintain and support 2 open source products:
TORQUE compute resource manager
GOLD resource allocation and accounting suite
Best efforts support for SLURM
Moab is widely installed on the TOP 500 systems, and is the scheduler for the
worlds first petaflop system at LANL
ACADEMIC INSTALLATIONS OF MOAB
•
•
•
•
•
•
•
•
Cambridge
Cardiff
Birmingham
Bristol
UCL
UCD
St. Andrews
ICHEC
Adaptive/Dynamic: Windows/Linux Cluster
Definition: Moab is able to dynamically
monitor & then adjust the operating
system or other environmental factors
to meet the needs of current and
upcoming workload. Moab can
manipulate, grow and shrink the
allocated resources in order to meet
QoS targets.
Examples of Dynamic Adaptations:
• Operating Systems
• Services
• Network / Bandwidth
• Application Resources
• Storage Space
• Etc.
Moab
Linux
RM
Linux
Workload
Windows Workload
Windows
RM
Upcoming Workload
Linux
Windows
Note: RM is responsible for
monitoring and job execution.
Note: Moab's Dynamic
Adaptation capability is
based on its abstracted
workload concept and its
ability to import data from
external resource managers.
Allinea Software
• Offers next generation tools for parallel application
development
– Traditionally for clusters, SMPs and MPPs
– Focus on usability and scalability
– Cross platform with Windows and Linux / UNIX
• Addressing future requirements
– Growth in processors / cores ...
– Growth in parallel and distributed programming ...
• Application of this technology
– Experience from HPC to the desktop
– Experience of embedded applications
High Profile Clients (extract)
• National research
centres
– AWE, BSC, CASPUR, CEA,
CINECA, HLRS, ICHEC,
IDRIS, LLNL, ONERA,
PROUDMAN, RAL, ….
• Universities
– Bristol, Dresden, Edinburgh,
HLRS, IPGP, Jülich,
Karlsruhe, Leicester, LRZ,
North West Grid, Oxford,
PenState, Nottingham,
Sharcnet, TACC, Tokyo,
UFA, Vanderbilt, etc.
• Aerospace research
– CIRA, EADS CCR, DLR,
MBDA, etc.
• Commercial research
– Airbus, AVL, CGGVeritas,
Fujitsu (Japan & UK), IFP,
MTEM, OHM, Total, etc.
DDTLite Plugin for Visual Studio®
• New product from Allinea
– Simplifies Parallel development on Microsoft®
platforms
– Bringing popular features from DDT to Visual
Studio®
– Makes easy path from Linux/Unix to
Microsoft® world
– Available Q4 2007
Resources
• Microsoft HPC Web site – download Beta 1 Today!
– http://www.microsoft.com/hpc
• Windows HPC Community site
– http://www.windowshpc.net
• Windows Server x64 information
– http://www.microsoft.com/x64/
• Windows Server System information
– http://www.microsoft.com/windowsserver
• Get the Facts Web site
– http://www.microsoft.com/getthefacts
© 2008 Microsoft Corporation. All rights reserved.
This presentation is for informational purposes only. Microsoft makes no warranties, express or implied, in this summary.