technology_module1 - NDSU Computer Science
Download
Report
Transcript technology_module1 - NDSU Computer Science
HPC Technology Track:
Foundations of Computational Science
Lecture 1
Dr. Greg Wettstein, Ph.D.
Research Support Group Leader
Division of Information Technology
Adjunct Professor
Department of Computer Science
North Dakota State University
What is Computational Science?
Application of computational methods to
fundamental problems in science and engineering.
Concerned with application of computational
methods to one of three objectives.
Modeling
Simulation
Prediction
Modeling
Involves development of a mathematical model
capable of predicting physical phenomenon.
weather prediction
molecular energy and force fields
properties of materials
Typically involves solving systems of differential
equations expressed in linear form.
Challenges in Modeling
Currently only order 0 systems can be solved
directly.
Computational approaches are frequently
implemented with methods involving truncated
series.
Garbage In/Garbage out (GIGO)
Limitations constantly demand attention to model
parameterization and viability.
The field considers better differential equation
solvers to be essential.
Emerging Opportunities in Modeling
Model parameter evaluation.
Considered important by the national laboratories
Involves computation of first and second
derivatives of the model with respect to finalized
model parameters
Important focus is to determine quality or stability of
model.
** Model evaluation is critical.
Multi-Scale Studies
Defn: Calculation of system behavior or properties
on one level using information from subordinate
levels.
Continuum of levels (physical systems):
quantum mechanical
molecular dynamics
meso or nanoscale levels
level of continuum
level of device
Simulation
Focuses on simulating the behavior of physical
systems.
Usually involves Monte Carlo methods to solve
stochastic systems.
Most commonly employed in computational
physics.
Simulation – con't.
Central to the 'birth' of computational science.
Metropolis, Rosenbluth, Rosenbluth, Teller and
Teller
“Equation of State Calculations by Fast Computing
Machines”
Goal is to develop 'ensembles' or collections of
parameters.
Typically implemented as 'coarse grained'
parallelism.
Prediction
The analysis or 'mining' of large sets of data for the
purpose of predicting future phenomenon.
Centrally important to marketing and e-commerce.
Represents a type of computational problem referred
to as 'embarassingly' parallel.
Most famous example is NetFlix competition.
Challenges in Prediction
Data locality
More processors does not equal more speed.
NetFlix competition demonstrated inadequacy of
improperly 'balanced' computational architectures.
Primary concern of national labs involved in security
based computation.
Current HPC architectures exacerbate data locality
problems.
'The Wettstein Rule of
Computational Reality'
“If filling a cache line is too slow you will
be really unhappy doing a cross-node
lookup to a machine 200 racks away.”
RoadRunner Configuration
1.72 petaflops peak / 1.456 petaflops demonstrated
296 racks covering 6,000 square feet
Massively parallel – hybrid architecture
6,480 Opteron (x86) processors
12,960 IBM PowerXCell processors
122,400 cores
103.6 terrabytes of memory
But when is it fast?
A Tradeoff
Compelling speeds when each node can work on a
discrete element of the problem.
Embarrassingly parallel problems.
Strictly orthogonal decomposition
MIMD
Less efficient when.
Boundary condition dependent problem.
Access to entire memory space is required.
Latency
The Enemy of Prediction
Latency definition:
The amount of time required to retrieve the next
relevant item of data required in a computational or
predictive sequence.
'Achilles Heel' of modern massively parallel systems
such as RoadRunner.
Common problem since the design of the Cray-1.
Wiring optimized to place time critical connections
on the inner portion of the computer.
Reducing Latencies through PTree's
Current area of research interest.
Addition of second order PTree's to optimize data
selection decisions.
Minimizes:
cache line flushing
cross-node data lookups
Exercise
Log into cluster1.chpc.ndsu.nodak.edu.
Use sinfo command to locate an available node.
Use ping command to measure message latency over
standard TCP/IP network.
e.g. node64-49
ping -c 5 node64-49
Use ping command to measure message latency over
Myrinet:
ping -c 5 node64M-49
Exercise – con't.
Bottom of ping command details min, average and
maximum communication latencies.
Compute expected performance change if a
computation is constrained by the length of time
required to pass a message from one node to another.
goto technology_lecture1_2;