MonteCarlo Method Course - Computer Science Department

Download Report

Transcript MonteCarlo Method Course - Computer Science Department

Monte Carlo Methods and Grid
Computing
Yaohang Li
Department of Computer Science
North Carolina A&T State University
Textbook

Monte Carlo Methods
– By J. M. Hammersley and D. C. Handscomb

Markov Chain Monte Carlo in Practice
– By Gilks, Richardson, and Spiegelhalter
Course Goals

Let the students master the theory of the Monte
Carlo methods in Computational Science
 Enable the students to apply the Monte Carlo
methods to simulate and solve real-life problems
 Allow the students to use the parallel and
distributed platform such as the grid to perform
large-scale Monte Carlo simulation
Course Design

Methods
– Basic Monte Carlo Methods
– Analysis of Monte Carlo Methods

Simulation Applications
–
–
–
–
–


Biology
Nuclear Physics
Materials
Engineering
Chemistry
Applications on the Grid
Course Projects
– Understand the theory behind Monte Carlo methods
– Apply Monte Carlo methods to solve real-life problems
– Using the Grid as the computational platform

Term Papers
– Survey of the development of Monte Carlo methods
Course Outline

Monte Carlo Methods

– Nuclear Physics
– Radiation Shielding and
– The General Nature of Monte Carlo
Methods
– Short Resume of Statistical Terms
– Random Number Generation


–
Pseudorandom Number Generation
Quasirandom Number Generation
–
– General Principles of the Monte
–
–
–
–
Carlo Methods
Conditional Monte Carlo
Direct Simulation
Solution of Linear Operator
Equations
Metropolis Method
Simulation Applications
–
–

Reactor Criticality
Polymer Science
– Long Polymer Molecules
Statistical Mechanics
– Markov Chain
Computational Biology
– Protein Folding
– Protein Docking
Material Science
– Ab initio Monte Carlo
Grid Computing
– Characteristics of Grid Computing
– Grid-based Monte Carlo
Applications
Introduction to Monte Carlo Methods

What is Monte Carlo method?
– Experimental mathematics


Experiments on random numbers
General Nature of Monte Carlo Methods
– Understand sophisticated problems
– Extensive use in Simulation






Operational research
Nuclear physics
Chemistry
Biology
Medicine
Engineering
– According to word of mouth, about 50% of CPU time of
Department of Energy is spent on Monte Carlo
Computation
Short Resume of Statistical Terms

Recall the basic terms and theories in statistics and
probability
– Random events and probability
– Random variables


Discrete random variable
Continuous random variable
– Distribution



Probability density function
Cumulative distribution function
Various Distributions
–
–
–
–
–
Uniform distribution
Rectangular distribution
Binomial distribution
Poisson distribution
Normal Distribution
– Expectation
Random Number Generation (I)

Generating high-quality random numbers is critical in Monte Carlo
applications
– How random numbers will affect the accuracy of Monte Carlo applications

Generating Pseudorandom Numbers
– Requirement of Pseudorandom Numbers
 Randomness
 Reproducibility
 Efficiency
 Uniformity
– Pseudorandom Number Generator
 LCG
 LFG
 ICG
– Test of Pseudorandom Number Generator

Quasirandom Number Generator
– What are the advantages of quasirandom number generator?
– Types of Quasirandom Number Generators
 Van der Corput Generator
 Halton Generator
 Faure Generator
 Sobol’ Generator
Random Number Generator (II)

Sampling from non-rectangular distributions
– Using the inverse function
– Using the acceptance-rejection method

Class Project 1
– Build a Pseudorandom Number Generator
– Test the Randomness of this Random Number
Generator
– Generating Gaussian Samples using the Pseudorandom
Number Generator
– Build a Quasirandom Number Generator
Direct Simulation

Direct Simulation Methods
– Base on a probabilistic problem
– Build a Monte Carlo model
– Simple general structure

Miscellaneous examples of Direct Simulations
– Operational research problem

Comparative Simulation
 Course Project II
– Develop a Buffon’s Needle Applet to calculate 
General Principles of the
Monte Carlo Method

Crude Monte Carlo Method
– Integral Estimation
 Expectation and variance
 Convergence rate O(cN-1)
 Error estimation
 Avoiding the curse of dimentionality
– Compare with the deterministic numerical
methods
– Compare with hit-or-miss Monte Carlo
Variance Reduction Techniques

Theory Behind Variance Reduction Techniques
– Reduce the constant c of the convergence rate O(cN-1)
of crude Monte Carlo

Variance Reduction Techniques
– Importance Sampling
– Control Variates
– Antithetic Variates
– Regression Methods
– Use of Orthonormal Functions
Course Project III

Develop a program using Crude Monte
Carlo to estimate an integral
ex 1
0 e  1 dx
1

Compare with deterministic methods as
dimension increases
 Apply a variance reduction technique to
compare the convergence rate
Conditional Monte Carlo

Theory of Conditional Monte Carlo
– A special case of the foregoing theory
– Conditional Monte Carlo
– Compare Conditional Monte Carlo with direct
simulation
Solution of Linear Operation Equation

Simultaneous Linear Equations
– x = a + Hx
– Condition of convergence
– von Neumann and Ulam’s random walk method





Sequential Monte Carlo
Fredholm integral equations of the second kind
The Dirichlet Problem
Eigenvalue problems
Course Project IV
– Use the random walk method to develop a program to
simulate the Google search for finding the eigenvector
corresponding to largest eigenvalue
Radiation Shielding and Reactor Criticality

The Shielding Problem
– When a thick shield of absorbing material is exposed to
-radiation (photons), of specified energy and angle of
incidence, what is the intesity and energy-distribution
of the radiation that penetrates the shield?
– Special Handling

The Criticality Problem
– When a pulse of neutrons is injected into a reactor
assembly, will it cause a multiplying chain reaction or
will it be absorbed, and, in particular, what is the size of
the assembly at which the reaction is just able to sustain
itself?
– Eigenvalue Problem

Matrix Method
Problems in Statistical Mechanics

Markov Chains
– Definition of Markov Chains
– Characteristics of Markov Chains
– Categorizing Markov Chains

Problems in equilibrium statistical mechanics
– Thermal equilibrium
– Statistical mechanics system
– Order-disorder phenomena
– Quantum statistics
Exploring the Equilibrium States

Global Minimum and Local Minimum
 Equilibrium States
 Markov Chain Monte Carlo (MCMC)
–
–
–
–
Metropolis-Hastings Method
Simulated Annealing
Simulated Tempering
Accelerated Simulated Tempering (AST)

Developed by Yaohang Li et al.
– Parallel Tempering
– Accelerated Parallel Tempering (APT)

Developed by Yaohang Li et al.
Course Project V





Develop a Program using Metropolis-Hastings
Method to Explore a rough 1-dimensional energy
landscape
Extend the program using a simulated annealing
method
Extend the program using a simulated tempering
method
Analyze the algorithms when dimension increases
Analyze the algorithms when the landscape
becomes rougher
Long Polymer Molecules

Self-avoiding Random Walks
– Walks on a mesh
– Walks in continuous space

Crude Sampling
 Generation of very long walks
 Recent development
– Protein Folding and Docking
Monte Carlo Method in
Material Science

Monte Carlo Process resembles a Physical
Process
– Ab initio Monte Carlo
– Ising Model
– Complex System
High Performance Monte
Carlo Simulation

Characteristics of Monte Carlo Simulation
 Parallel Random Number Generation
 Natural Parallel Monte Carlo Simulation
 High Performance Monte Carlo Simulation
Monte Carlo Applications
and Grid Computing

Grid Computing
– Widely distributed
– Large-scale resource sharing
– Issues


Performance
Trustworthiness

Monte Carlo Applications
– Approximate solutions to a
variety of mathematical
problems

Performing statistical
sampling experiments
– Slow convergence rate,
approximately O(N-1/2)
– Computationally intensive
and naturally parallel
Grid-based Monte Carlo
Applications

Issues in Grid Computing
– From application point of view



Performance
Trustworthiness
Our Approaches
– Address issues from application level
– Analyze characteristics of Monte Carlo applications


Statistical nature
Cryptographic aspect of underlying random number generator
– Develop strategies and tools
Techniques for Grid-based
Monte Carlo Applications

Improve Performance
– N-out-of-M Strategy
– Bio-inspired Scheduling
– Lightweight Checkpoint

Improve Trustworthiness
– Partial Result Validation Scheme
– Intermediate Value Checking
Conclusion
Stimulate the students’ interest in computational
science
 Allow the students to master the methods and
techniques of high performance statistical
simulation
 Develop interdisciplinary research and educational
areas
 Encourage the students to apply the computational
techniques to real-life applications
