Trees, Scenario Generation
Download
Report
Transcript Trees, Scenario Generation
Scenario Generation
based on
Quantization
Moscow, September 2009
A. Pichler
Department of Statistics and
Decision Support Systems
Outline of this Talk
• Approximation of
Probability measures
• Quantization
• Trees, Scenario Generation
Comparison of Measures
Given 2 measures, how can we link them
and assign a single number to account
for distance?
P1 c P2
maximum
(comonotone)
P1 P2
product measure
P1 a P2
minimum
(antimonotone)
Earth Mover Distance
Distance for Measures
As the initial space obeys a distance
function, we thus may compute
d x, y dx, dy
d x, y dx, dy
p
for every marginal distribution.
A X PA
p
d P, Q : inf d x, y dx, dy :
X B QA
... is a metric, metrizing the
weak topology.
Summary
A X PA
p
d P, Q : inf d x, y dx, dy :
X B QA
r
cost function
Distribution with adapted
marginals
is a metric on measures,
metrizing the weak topology.
Approximation
• Proxy for Expectation reads
dP p q
qQ
q
• and convergence is to be understood in
the weak (*) sense,
Pn P : dPn dP
Metrizable
• The weak Topology is metrizable
Pn P : dPn dP
Pn P d Pn , P 0
• Elaborated since 18th century
G. Monge
Outline of this Talk
• Approximation of Probability
Measures
• Quantization
• Trees, Scenario Generation
Approximation
• Approximation via Dirac Measures
P p q q
qQ
1 q A
q A :
0 q A
• Well defined?
Pq pq
q Q
Discrete Situation
p
d
x, y dx, dy
original
distribution
P p q q
qQ
Approximation
Linking the Concepts
Questions:
1. Where to place qQ?
2. How to choose the weights
pq?
P
P p q q
qQ
Comparison of distances
Example: Approximation of Gaussian Distribution
3
3
1.0
1.0
0.8
0.8
0.6
0.6
0.4
0.4
0.2
0.2
2
3
n 1
-1
1
sup F x n 1, i F x
n
i 1
2
2
1
3
2
1
d x, y
1.0
1.0
0.8
0.8
0.6
0.6
0.4
0.4
0.2
0.2
d 2 x, y
1
2
3
3
2
1
1
2
3
1
2
3
d 10 x, y
Associated Region
1.0
0.8
1.2
1.0
0.6
0.8
0.4
0.6
0.4
0.2
0.2
3
2
1
1
2
3
3
-
2
-
1
-
1
2
min qQ d ( x, q) r
Associated with given sample points are the intervals
containing points being closest to the sample point.
3
Voronoi Diagram
Георгий Феодосьевич Вороной
Борис Николаевич Делоне
in 2 dimensions ...
2
5
1
3
7
6
4
13
8
9
14
10
12
15
11
16
(c) Mathematica
... and in 3 dimensions
Lemma
1
2
pq q
min qQ d x, q Pdx d r P, q
Q
r
Let
pq : P X q
then
3
r
.
with
X Xq
qQ
d r P, pq q d x, q Pdx
qQ
qQ X q
Let Xq be Voronoi and
pq : P X q , then
d r P, pq q min qQ d x, q Pdx
qQ
Solution to 1
Questions:
1. How to choose the weights pq?
2. Where to place qQ?
Answer Part 1:
• Compute the Voronoi
Tessellation Xq and set
r
• The even
pq : P X q
r
d r P, pq q min qQ d x, q Pdx
qQ
Towards Question 2
The problem of finding the best
supporting nodes (Q) now transforms
into the problem of minimizing the
functional f Q
f Q : min qQ d x, q Pdx
r
d r P, pq q
qQ
r
Lloyd Algorithm
f Q : min qQ d x, q Pdx
r
d r P, pq q
qQ
r
Lloyd Algorithm repeats 2 steps:
• for Q, compute Xq and
pq : P X q
• for Xq, compute
Q : arg min
d
x
,
q
P
dx
r
Xq
Example
Optimal
supporting
nodes and
Voronoi
Diagram for
the
Bivariate
Normal
Distribution
Selected Solutions
Optimal supporting nodes are known in a
few selected sitations only. They include
(r=1)
• Uniform distribution
2i 1
qi :
2n
• Exponential distribution
n2 n
qi : 2 log
n 1 i
Towards Question 2
Both solutions have the form (for a
function z)
2i 1
qi z
n
1.2
1.0
0.8
0.6
0.4
0.2
3
-
Differentiating gives (R, r= 1)
2
-
1
-
1
2
q q
0 0 2 F q1 F 1 2
2
q q
q qi 1
0 F i 1 i 2 F qi F i
2
2
F... the
cumulative
distribution
function,
q q
0 F n 1 n 2 F qn 1
2
f... the density
3
Asymptotic, motivation
• For n increasing, the difference
equation transforms into the differential
equation
r 1z" f z z'
z'
z
f
1
1 r
r 1
2
f ' z 0
f z const .
x dx f
1
1 r
x dx
again...
If P has density ...
f
dimension
... then the supporting
nodes have (asymptotic) ~ f
density ...
As a consequence, tails
get higher attention if r
is chosen big enough.
d
d r
Solution to 2
Questions:
1. How to choose the weights pq?
2. Where to place qQ?
Answer Part 2:
• Solve related equations, or
• use, for example, Lloyd Algorithm
• Asymptotically, compute quantiles of ~ f
d
d r
Example
Example: Approximation
of Gaussian Distribution
1.0
0.8
0.6
0.4
z ~ r 1 N 0,1
0.2
3
3
2
1
d x, y
1.0
1.0
0.8
0.8
0.6
0.6
0.4
0.4
0.2
0.2
d 2 x, y
1
2
1
2
3
3
2
1
1
2
3
1
2
3
d 10 x, y
Theorem (Zador):
The error of the best approximation
Vn,r P : inf
Q n
min d x, q Pdx
r
qQ
satisfies
d
lim n Vn,r P Qr 0,1 f
r
d
d
d
Qr 0,1 inf n Vn ,r 0,1
r
d
n
d
d r
d
d
d r
d
Note: this theorem links P to a simple uniform distribution
and discloses the rate of convergence.
The Constant...
... has the following bounds
1
Qr 0,1
d
r
d
d
B0,1d
d r
cube proxy
0.7
r
p
0.6
1 d
r
2 1 p
r
2
d
upper/ lower bound
0.5
d
r
p
0.4
0.3
0.2
0.1
(l2 Norm, r =1)
5
10
15
dimension d
20
25
30
Outline of this Talk
• Approximation of Probability
measures
• Quantization
• Trees, Scenario Generation
A tree is a Quantizer
A tree is a special quantizer on the multidimensional
space
T
X : X t
T
t 0
PT : F T 0,1
Xt-1
Xt
Xt+1
Assume, there exist stochastic kernels describing the
Pt 1 : X t Ft 1 0,1
transition
Theorem
Suppose that
d x , y
d P. | x , Q. | x Q dx
t
d P .| x , P .| y
t
t
then
t
t
t
d P ,Q d P ,Q e
t
t
0
0
t
t
T
T
Xt-1
Xt
Xt+1
Note: the statement gives exact bounds if 0 (independent).
Note: May be used to explicitely construct a tree (scenario)
with a distance given in advance.
Outlook 2010+
> Scenario Reduction
>> Römisch
> Include Decisions at stage t
>> Pflug
> Applications and numerous
implementations
>> Hochreiter
> Portfolio Optimization
>> Kovacevic
Thanks For Your Attention.