WISSystematics18Dec08 - University of Toronto, Particle Physics
Download
Report
Transcript WISSystematics18Dec08 - University of Toronto, Particle Physics
Systematic Uncertainties:
Principle and Practice
Outline
1. Introduction to Systematic Uncertainties
2. Taxonomy and Case Studies
3. Issues Around Systematics
4. The Statistics of Systematics
5. Summary
Pekka K. Sinervo,F.R.S.C.
Rosi & Max Varon Visiting Professor
Weizmann Institute of Science
&
Department of Physics
University of Toronto
18 Dec 08
Weizmann Institute of Science
1
Introduction
Systematic uncertainties play key role in physics
measurements
– Few formal definitions exist, much “oral tradition”
– “Know” they are different from statistical uncertainties
Random Uncertainties
Arise from stochastic
fluctuations
Uncorrelated with previous
measurements
Well-developed theory
Examples
measurement resolution
finite statistics
random variations in system
Systematic Uncertainties
Due to uncertainties in the
apparatus or model
Usually correlated with
previous measurements
Limited theoretical framework
Examples
calibrations uncertainties
detector acceptance
poorly-known theoretical
parameters
Weizmann Institute of Science
2
Literature Summary
Increasing literature on the topic of “systematics”
A representative list:
–
–
–
–
–
–
–
–
–
–
R.D.Cousins & V.L. Highland, NIM A320, 331 (1992).
C. Guinti, Phys. Rev. D 59 (1999), 113009.
G. Feldman, “Multiple measurements and parameters in the unified approach,”
presented at the FNAL workshop on Confidence Limits (Mar 2000).
R. J. Barlow, “Systematic Errors, Fact and Fiction,” hep-ex/0207026 (Jun 2002), and
several other presentations in the Durham conference.
G. Zech, “Frequentist and Bayesian Confidence Limits,” Eur. Phys. J, C4:12 (2002).
R. J. Barlow, “Asymmetric Systematic Errors,” hep-ph/0306138 (June 2003).
A. G. Kim et al., “Effects of Systematic Uncertainties on the Determination of
Cosmological Parameters,” astro-ph/0304509 (April 2003).
J. Conrad et al., “Including Systematic Uncertainties in Confidence Interval
Construction for Poisson Statistics,” Phys. Rev. D 67 (2003), 012002
G.C.Hill, “Comment on “Including Systematic Uncertainties in Confidence Interval
Construction for Poisson Statistics”,” Phys. Rev. D 67 (2003), 118101.
G. Punzi, “Including Systematic Uncertainties in Confidence Limits”, CDF Note in
preparation.
Weizmann Institute of Science
3
I. Case Study #1: W Boson Cross
Section
Rate of W boson production
– Count candidates Ns+Nb
– Estimate background
Nb & signal efficiency e
N c N b (e L)
– Measurement reported as
2.64 0.01 (stat)
0.18 (syst) nb
– Uncertainties are
stat 0 1/N c
syst 0 N b /N b e /e L /L
2
2
2
Weizmann Institute of Science
4
Definitions are Relative
Efficiency uncertainty estimated using Z
boson decays
– Count up number of Z candidates NZcand
Can identify using charged tracks
Count up number reconstructed NZrecon
recon
N
e Z cand e
NZ
NZ
recon
N
cand
Z
NZ
N Z cand
– Redefine uncertainties
2
stat 0 1/N c e /e
–
2
2
syst 0 N b /N b L /L
recon
Lessons:
• Some systematic uncertainties
are really “random”
• Good to know this
• Uncorrelated
• Know how they scale
• May wish to redefine
• Call these
“CLASS 1” Systematics
Weizmann Institute of Science
5
Top Mass Good Example
Top mass uncertainty in template analysis
– Statistical uncertainty from shape of
reconstructed mass distribution and
statistics of sample
– Systematic uncertainty coming from jet
energy scale (JES)
Determined by calibration studies,
dominated by modelling uncertainties
5% systematic uncertainty
Latest techniques determine JES
uncertainty from dijet mass peak (W->jj)
– Turn JES uncertainty into a largely
statistical one
– Introduce other smaller systematics
M top 171.8 1.9(stat + JES) 1.0 (syst) GeV/c 2
171.9 2.1 GeV/c 2
Weizmann Institute of Science
6
Case Study #2: Background
Uncertainty
Look at same W cross section analysis
– Estimate of Nb dominated by QCD backgrounds
Candidate event
– Have non-isolated leptons
– Less missing energy
Assume that isolation
and MET uncorrelated
Have to estimate the
uncertainty on NbQCD
– No direct measurement
has been made to verify the model
– Estimates using Monte Carlo modelling have large
uncertainties
Weizmann Institute of Science
7
Estimation of Uncertainty
Fundamentally different class of uncertainty
– Assumed a model for data interpretation
– Uncertainty in NbQCD depends on accuracy of model
– Use “informed judgment” to place bounds on one’s
ignorance
Vary the model assumption to estimate robustness
Compare with other methods of estimation
Difficult to quantify in consistent manner
– Largest possible variation?
Asymmetric?
– Estimate a “1 ” interval?
– Take
?
12
Lessons:
• Some systematic uncertainties
reflect ignorance of one’s data
• Cannot be constrained by
observations
• Call these
“CLASS 2” Systematics
Weizmann Institute of Science
8
Case Study #3: Boomerang CMB
Analysis
Boomerang is one of several
CMB probes
– Mapped CMB anisoptropy
– Data constrain models of the
early universe
Analysis chain:
– Produce a power spectrum for
the CMB spatial anisotropy
Remove instrumental effects through a complex
signal processing algorithm
– Interpret data in context of many models with
unknown parameters
Weizmann Institute of Science
9
Incorporation of Model
Uncertainties
Power spectrum extraction
includes all instrumental
effects
– Effective size of beam
– Variations in data-taking
procedures
Use these data to extract
7 cosmological parameters
– Take Bayesian approach
Family of theoretical models defined by 7 parameters
Define a 6-D grid (6.4M points), and calculate likelihood
function for each
Weizmann Institute of Science
10
Marginalize Posterior Probabilities
Perform a Bayesian
“averaging” over a grid
of parameter values
– Marginalize w.r.t. the
other parameters
NB: instrumental
uncertainies included
in approximate manner
– Chose various priors
in the parameters
Comments:
– Purely Bayesian analysis with
no frequentist analogue
– Provides path for inclusion of
additional data (eg. WMAP)
Lessons:
• Some systematic uncertainties
reflect paradigm uncertainties
• No relevant concept of a
frequentist ensemble
• Call these
“CLASS 3” Systematics
Weizmann Institute of Science
11
Proposed Taxonomy for Systematic
Uncertainties
Three “classes” of systematic uncertainties
– Uncertainties that can be constrained by ancillary
measurements
– Uncertainties arising from model assumptions or
problems with the data that are poorly understood
– Uncertainties in the underlying models
Estimation of Class 1 uncertainties straightforward
– Class 2 and 3 uncertainties present unique challenges
– In many cases, have nothing to do with statistical
uncertainties
Driven by our desire to make inferences from the data
using specific models
Weizmann Institute of Science
12
II. Estimation Techniques
No formal guidance on how to define a systematic
uncertainty
– Can identify a possible source of uncertainty
– Many different approaches to estimate their magnitude
Determine maximum effect
?
2
General rule:
?
– Maintain consistency with definition of
12
statistical intervals
– Field is pretty glued to 68% confidence intervals
– Recommend attempting to reflect that
in magnitudes of
systematic uncertainties
– Avoid tendency to be “conservative”
Weizmann Institute of Science
13
Estimate of Background
Uncertainty in Case Study #2
Look at correlation of Isolation and MET
– Background estimate
increases as isolation
“cut” is raised
– Difficult to measure or
accurately model
Background comes
primarily from very
rare jet events with
unusual properties
Very model-dependent
Assume a systematic uncertainty representing
the observed variation
– Authors argue this is a “conservative” choice
Weizmann Institute of Science
14
Cross-Checks Vs Systematics
R. Barlow makes the point in Durham(PhysStat02)
– A cross-check for robustness is not an invitation to introduce
a systematic uncertainty
Most cross-checks confirm that interval or limit is robust,
– They are usually not designed to measure a systematic
uncertainty
More generally, a systematic uncertainty should
– Be based on a hypothesis or model with clearly stated
assumptions
– Be estimated using a well-defined methodology
– Be introduced a posteriori only when all else has failed
Weizmann Institute of Science
15
III. Statistics of Systematic
Uncertainties
Goal has been to incorporate systematic uncertainties
into measurements in coherent manner
– Increasing awareness of need for consistent practice
Frequentists: interval estimation increasingly sophisticated
– Neyman construction, ordering strategies, coverage properties
Bayesians: understanding of priors and use of posteriors
– Objective vs subjective approaches, marginalization/conditioning
– Systematic uncertainties threaten to dominate as precision
and sensitivity of experiments increase
There are a number of approaches widely used
– Summarize and give a few examples
– Place it in context of traditional statistical concepts
Weizmann Institute of Science
16
Formal Statement of the Problem
Have a set of observations xi, i=1,n
– Associated probability distribution function (pdf) and
likelihood function
p x |q L q
p x |q
i
i
i
Depends on unknown random parameter q
Have some additional uncertainty in pdf
– Introduce a second unknown parameter l
L q, l px i | q, l
i
In some cases, one can identify statistic yj that
provides information about l
L q, l px i , y j | q, l
i, j
– Can treat l as a “nuisance parameter”
Weizmann Institute of Science
17
Bayesian Approach
Identify a prior pl for the “nuisance parameter” l
– Typically, parametrize as either a Gaussian pdf or a flat
distribution within a range (“tophat”)
– Can then define Bayesian posterior
L q, l p l dq dl
– Can marginalize over possible values of l
Use marginalized posterior to set Bayesian credibility
intervals,
estimate parameters, etc.
Theoretically straightforward ….
– Issues come down to choice of priors for both q,l
No widely-adopted single choice
Results have to be reported and compared carefully to
ensure consistent treatment
Weizmann Institute of Science
18
Frequentist Approach
Start with a pdf for data px i, y j | q, l
– In principle, this would describe frequency
distributions of data in multi-dimensional space
– Challenge is take account
of nuisance parameter
– Consider a toy model
px, y | ,n Gx n ,1Gy n ,s
Parameter s is Gaussian
width for n
Likelihood function (x=10, y=5)
– Shows the correlation
– Effect of unknown n
Weizmann Institute of Science
19
Formal Methods to Eliminate
Nuisance Parameters
Number of formal methods exist to eliminate
nuisance parameters
– Of limited applicability given the restrictions
– Our “toy example” is one such case
Replace x with t=x-y and parameter n with
2
s
v' n
1 s2
2
2
ts
s
pt, y | ,n ' G t , 1 s Gy n '
2,
1
s
1 s2
Factorized pdf and can now integrate over n’
Note that pdf for has larger width, as expected
– In practice, one often loses information using this
technique
Weizmann Institute of Science
20
Alternative Techniques for
Treating Nuisance Parameters
Project Neyman volumes onto parameter of
interest
– “Conservative interval”
– Typically over-covers,
possibly badly
Choose best estimate of
nuisance parameter
– Known as “profile method”
– Coverage properties
From G. Zech
require definition of ensemble
– Can possible under-cover when parameters strongly
correlated
Feldman-Cousins intervals tend to over-cover slightly
(private communication)
Weizmann Institute of Science
21
Example: Solar Neutrino Global
Analysis
Many experiments have measured solar neutrino flux
– Gallex, SuperKamiokande, SNO, Homestake, SAGE, etc.
– Standard Solar Model (SSM) describes n spectrum
– Numerous “global analyses” that synthesize these
Fogli et al. have detailed one such analysis
– 81 observables from these experiments
– Characterize systematic uncertainties through 31 parameters
12 describing SSM spectrum
11 (SK) and 7 (SNO) systematic uncertainties
Perform a c2 analysis
– Look at c2 to set limits on parameters
Hep-ph/0206162, 18 Jun 2002
Weizmann Institute of Science
22
Formulation of c2
In formulating c2, linearize effects of the systematic
uncertainties on data and theory comparison
R exp t R theor (c kx )
n
n
n k
N
c 2pull min x
un
n1
2
K
2
x k
k1
Uncertainties un for each observable
– Introduce “random” pull xk for each systematic
Coefficients ckn to parameterize effect on nth observable
Minimize c2 with respect to xk
Look at contours of equal c2
Weizmann Institute of Science
23
Solar Neutrino Results
Can look at “pulls” at c2
minimum
– Have reasonable distribution
– Demonstrates consistency of
model with the various
measurements
– Can also separate
Agreement with experiments
Agreement with systematic
uncertainties
Weizmann Institute of Science
24
Pull Distributions for Systematics
Pull distributions for xk
also informative
– Unreasonably small variations
– Estimates are globally too
conservative?
– Choice of central values
affected by data
Note this is NOT a
blind analysis
But it gives us some
confidence that intervals
are realistic
Weizmann Institute of Science
25
Typical Solar Neutrino Contours
Can look at probability
contours
– Assume standard c2 form
– Probably very small
probability contours have
relatively large
uncertainties
Weizmann Institute of Science
26
Hybrid Techniques
A popular technique (Cousins-Highland) does an
“averaging” of the pdf
– Assume a pdf for nuisance parameter g(l)
– “Average” the pdf for data x
pCH x | q
px | q, lgl dl
– Argue this approximates an ensemble where
Each measurement uses an apparatus that differs in
parameter l
– The pdf g(l) describes the frequency distribution
Resulting distribution for x reflects variations in l
Intuitively appealing
See, for example, J. Conrad et al.
– But fundamentally a Bayesian approach
– Coverage is not well-defined
Weizmann Institute of Science
27
Summary
HEP & Astrophysics becoming increasingly
“systematic” about systematics
– Recommend classification to facilitate understanding
Creates more consistent framework for definitions
Better indicates where to improve experiments
– Avoid some of the common analysis mistakes
Make consistent estimation of uncertainties
Don’t confuse cross-checks with systematic uncertainties
Systematics naturally treated in Bayesian framework
– Choice of priors still somewhat challenging
Frequentist treatments are less well-understood
– Challenge to avoid loss of information
– Approximate methods exist, but probably leave the “true
frequentist” unsatisfied
Weizmann Institute of Science
28