PennState-jun06-unfolding
Download
Report
Transcript PennState-jun06-unfolding
Spectrum
Reconstruction of
Atmospheric
Neutrinos with
Unfolding Techniques
Juande Zornoza
UW Madison
Introduction
We will review different approaches for
the reconstruction of the energy
spectrum of atmospheric neutrinos:
Blobel / Singular Value Decomposition
actually, both methods are basically the same,
with only differences in issues not directly related
with the unfolding but with the regularization and
so on
Iterative Method based on Bayes’ theorem
Spectrum reconstruction
In principle, energy spectra can be obtained using the
reconstructed energy of each event.
However, this is not
efficient in our case
because of the
combination of two factors:
Fast decrease (power-law) of
the flux.
Large fluctuations in the
energy deposition.
For this reason, an alternative way has to be used:
unfolding techniques.
Spectrum unfolding
•Quantity to obtain: y, which follows pdf→ftrue(y)
•Measured quantity: b, which follows pdf→fmeas(b)
•Both are related by the Fredholm integral equation of first kind:
f meas (b) R(b | y) ftrue ( y)dy
Matrix notation
Ây b
•The response matrix takes into account three factors:
-Limited acceptance
-Finite resolution
-Transformation
•The response matrix inversion gives useless solutions, due to the
effect of statistical fluctuations
Dealing with instabilities
Regularization:
Solution with minimum curvature
Principle of maximum entropy
Iterative procedure, leading
asymptotically to the unfolded distribution
Single Value Decomposition1
The response matrix is decomposed as:
Aˆ USV T
U, V: orthogonal matrices
S: non-negative diagonal matrix (si, singular values)
This can be also seen as a minimization
problem:
2
ˆ
Aij xi bi min
i 1 j 1
nb
nx
Or, introducing the covariance matrix to take
into account errors:
ˆAx b T B 1 Aˆ x b min
1
A. Hoecker, Nucl. Inst. Meth. in Phys. Res. A 372:469 (1996)
SVD: normalization
Actually, it is convenient to normalize the
unknowns
Ây b
w j y j / y ini j
nx
A w
j 1
ij
j
bi
Aij contains the number of events, not the
probability.
Advantages:
the vector w should be smooth, with small bin-to-bin
variations
avoid weighting too much the cases of 100%
probability when only one event is in the bin
SVD: Rescaling
Rotation of matrices:
B QRQ
T
1
Aij Qim Amj
ri m
bi
1
Qimbm
ri m
allows to rewrite the system with a
covariance matrix equal to I, more
convenient to work with:
~T ~
~
~
( Aw b ) ( Aw b ) min
Regularization
Several methods have been proposed for the
regularization. The most common is to add a
curvature term add a curvature term
~T ~
~
~
( Aw b ) ( Aw b ) (Cw)T Cw min
1
1
C 0
1
2
1
0
1
1
1
0
2
1
0
1
1
Other option: principle of maximum entropy
Regularization
We have transformed the problem in the
optimization of the value of , which tunes how
much regularization we include:
too large: physical information lost
too small: statistical fluctuations spoil the result
In order to optimize the
value of :
Evaluation using MC
information
Maximum curvature of
the L-curve
T
Components of vector d U b
L-curve
Solution to the system
Actually, the solution to the system with
the curvature term can be expressed as
a function of the solution without
curvature:
ny
w'
i 1
di
f i vi
si
A ' AC 1
w ' Cw
where
d
( )
i
si2
di 2
si
2
1
if
s
s
i
fi 2
2
si si / if si2
2
i
(Tikhonov factors)
Tikhonov factors
The non-zero tau is equivalent to change di by
1 if si2
si2
fi 2
2
si si / if si2
And this allows to find a criteria to find a good tau
fun0
fun1
fun2
Components of d
k
= sk2
Differences between SVD and Blobel
More simplified implementation
Possibility of different number of bins in y
and b (non square A)
Different curvature term
Selection of optimum tau
B-splines used in the standard Blobel
implementation
Bayesian Iterative
2
Method
•If there are several causes (Ei) which can produce an effect Xj and we
know the initial probability of the causes P(Ei), the conditional probability of
the cause to be Ei when X is observed is:
P( Ei | X j )
smearing matrix: MC
P( X j | Ei ) P0 ( Ei )
P( X
l 1
j
| El ) P0 ( El )
prior guess:
iterative
approach
•The expected number of events to be assigned to each of the
nX
causes is:
nˆ ( Ei ) n( X j ) P( Ei | X j )
j 1
experimental data (simulated)
•The dependence on the initial probability P0(Ei) can be overcome by
an iterative process.
nˆ ( Ei )
ˆ
P( Ei ) nE
nˆ( Ei )
i 1
2
G. D'Agostini NIM A362(1995) 487-498
Iterative algorithm
1. Choose the initial distribution P0(E). For instance, a good guess
could be the atmospheric flux (without either prompt neutrinos or
signal).
2. Calculate nˆ ( E ) and Pˆ ( E ).
3. Compare nˆ ( E ) to n0 ( E ).
4. Replace P0 ( E ) by Pˆ ( E ) and n0 ( E ) by nˆ ( E ).
5. Go to step 2.
P(Xj|Ei)
Smearing matrix (MC)
no(E)
Initial
guess
P(Ei|Xj)
Reconsructed
spectrum
n(Ei)
Po(E)
n(Xj)
Experimental data
P(Ei)
For IceCube
Several parameters can be investigated:
Number of channels
Number of NPEs
Reconstructed energy
Neural network output…
With IceCube, we will have much better
statistics than with AMANDA
But first, reconstruction with 9 strings will be
the priority
Remarks
First, a good agreement between data and MC
is necessary
Different unfolding methods will be compared
(several internal parameters to tune in each
method)
Several regularization techniques are also
available in the literature
Also an investigation on the best variable for
unfolding has to be done
Maybe several variables can be used in a
multi-D analysis
This is the end
B-splines
Spline: piecewise continuous
and differentiable function that
connects two neighbor points
by a cubic polynomial:
from H. Greene PhD.
B-spline: spline functions can be expressed by a
finite superposition of base functions (B-spilines).
(first order)
(higher orders)
Maximum entropy
In general, we want to minimise ln L S
S can be the spikeness of the solution, or, the
entropy of the system: S P ln P
Probability of any event of being in bin i is fi/N.
N
i 1
i
i
Then, following the maximum entropy principle,
we will minimize:
ln L fi / N ln( f i / N )
i
Useful for sharp gradients, i.e. when the
solution it is not expected to be very smooth