Descriptive Epidemiology Dr. KANUPRIYA CHATURVEDI
Download
Report
Transcript Descriptive Epidemiology Dr. KANUPRIYA CHATURVEDI
Descriptive
Epidemiology
Dr. KANUPRIYA CHATURVEDI
How we view the world…..
Pessimist: The glass is
half empty.
Optimist: The glass is
half full.
Epidemiologist: As
compared to what?
Epidemiology is...
Epidemiology is...
"The worst taught course in Medical
school."
Medical
Student
Epidemiology is...
"The science of making the obvious
obscure."
Clinical
Professor
Epidemiology is...
"The science of long division....
I'=[(480)(log2)(10E6)]/[(9.1)(0.955po)+0.45
n]"
Statistician
Definition of Epidemiology*
"The STUDY of the DISTRIBUTION and
DETERMINANTS of HEALTHRELATED STATES in specified
POPULATIONS, and the application of this
study to CONTROL of health problems."
*Last, J.M. 1988. A Dictionary of Epidemiology, 2nd ed.
Epidemiology: Definition
Dynamic study of the
Determinants
Occurrence
Distribution
Control
Pattern
Of health and disease in a population
Epidemiology
EPI
DEMO
Upon,on,befall
LOGOS
People,population,man
The study of anything that happens to
people
“That which befalls man”
the Study of
Definition of Epidemiology
A quantitative basic science, built on a working
knowledge of probability, statistics and sound
research methods.
A method of causal reasoning, based on
developing and testing biologically plausible
hypothesis pertaining to occurrence and
prevention of morbidity and mortality.
A tool for public health action to promote and
protect the public's health based on science, causal
reasoning, and a dose of practical common sense.
Epidemiology is a Quantitative
Discipline
Measures of frequency
Counts and rates
Measures of association
Relative risk
Odds ratio
Statistical inference
P-value
Confidence limits
Clinician
Epidemiologist
Patient’s
diagnostician
Community’s
diagnostician
Investigations
Investigations
Diagnosis
Predict trend
Therapy
Control
Cure
Prevention
Epidemiology
Describes
health events
cause and risk factors of disease
clinical pattern of disease
Identify
syndromes
Identify control and/or preventive measures
So, Epidemiology
Is the basic science of public health
Provides insight regarding the nature, causes,
and extent of health and disease
Provides information needed to plan and target
resources appropriately
Kinds of Epidemiology
Descriptive
Study of the occurrence and
distribution of disease
Analytic
Further studies to determine the
validity of a hypothesis concerning
the occurrence of disease.
Experimental
Deliberate manipulation of the
cause is predictably followed
by an alteration in the effect
not due to chance
Overview of epidemiologic design
strategies
Descriptive
Populations{Correlational studies}
Individual
Case report
Case series
Cross sectional studies
Analytic studies
Observational
Case control
Cohort
Retrospective
Prospective
Interventional/Experimental
Randomized controlled trial
Field trial
Clinical trial
Descriptive vs. Analytic Epidemiology
Descriptive
Used when little is
known about the
disease
Analytic
Used when insight about
various aspects of disease is
available
Rely on preexisting
data
Rely on development of new
data
Who, where, when
Why
Illustrates potential
associations
Evaluates the causality of
associations
Both are important!
Descriptive Studies
Relatively inexpensive and less time-consuming
than analytic studies, they describe,
Patterns of disease occurrence, in terms of,
Who gets sick and/or who does not
Where rates are highest and lowest
Temporal patterns of disease
Data provided are useful for,
Public health administrators (for allocation of resources)
Epidemiologists (first step in risk factor determination)
Descriptive Epidemiology
Correlational studies
Case reports
Case series
Cross sectional studies
Correlational Studies (Ecological Studies)
Uses measures that represent characteristics of
entire populations
It describes outcomes in relation to age, time,
utilization of services, or exposures
ADVANTAGES
We can generate hypotheses for case-control studies and
environmental studies
We can target high-risk populations, time-periods, or
geographic regions for future studies
Correlational Studies
LIMITATIONS
Because data are for groups, we cannot link disease and
exposure in individual
We cannot control for potential confounders
Data represent average exposures rather than individual
exposures, so we cannot determine a dose-response
relationship
Caution must be taken to avoid drawing inappropriate
conclusions, or ecological fallacy
Patterns of disease Occurrence :
Correlation of Population statistics
Ecologic ( correlation ) studies
Used as first step in determining association
plot : disease (population) burden [ Y axis ]
vs. prevalence of “risk factor” [ X axis ]
e.g. smoking vs. lung cancer
-- correlation coefficient : r ; + 1 to -1
Quantifies linear relationship between exposure & disease
Case Reports (case series)
Report of a single individual or a group of
individuals with the same diagnosis
Advantages
We can aggregate cases from disparate sources to generate
hypotheses and describe new syndromes
Example: hepatitis, AIDS
Limitations
We cannot test for statistical association because there is no
relevant comparison group
Based on individual exposure {may simply be coincidental}
Case report/Case series(contd.)
Important interface between clinical medicine &
epidemiology
Most common type of studies published in
medical journals{1/3rd of all}
e.g. Frisbee finger , break dancing neck
AIDS ~ b/w oct1980-may81, 5 cases of P.carinii
pneumonia were diagnosed among previously healthy
young homosexual males in L.A.
Cross-Sectional Studies (prevalence studies)
Measures disease and exposure simultaneously in a
well-defined population
Advantages
They cut across the general population, not simply those
seeking medical care
Good for identifying prevalence of common outcomes, such
as arthritis, blood pressure or allergies
Limitations
Cannot determine whether exposure preceded disease
It considers prevalent rather than incident cases, results
will be influenced by survival factors
Remember: P = I x D
Cross-Sectional Studies
Can be used as a type of analytic study for testing
hypothesis, when;
Current values of exposure variables are unalterable over
time
Represents value present at initiation of disease
E.g. eye colour or blood group
If risk factor is subject to alterations by disease, only
hypothesis formulation can be done
The epidemiologic approach:
Steps to public health action
DESCRIPTIVE
What (case
definition)
Who (person)
Where (place)
When (time)
How many
(measures)
ANALYTIC
Why (Causes)
How (Causes)
MEASURES
Counts
Times
Rates
Risks/Odds
Prevalence
METHODS
Design
Conduct
Analysis
Interpretation
ALTERNATIVE
EXPLANATIONS
Chance
Bias
Confounding
INFERENCES
Epidemiologic
Causal
ACTION
Behavioural
Clinical
Community
Environmental
Key questions
Why now?
Why here?
Why in this group?
Descriptive Epidemiology
Study of the occurrence and distribution of
disease
Terms:
Time
Place
Person
What are the three categories of
descriptive epidemiologic clues?
□ Person: Who is getting sick?
□ Place:
Where is the sickness occurring?
□ Time:
When is the sickness occurring?
PPT = person, place, time
Time
Secular
Periodic
Seasonal
Epidemic
Secular Trend
The long-time trend of disease
occurrence
Tetanus – by year, USA, 1955-2000
During 2000, a total of 35 cases of tetanus were reported. The percentage of cases among persons aged 25-59 years
Has increased in the last decade. Note: A tetanus vaccine was first available in 1933.
900
800
700
600
500
400
300
200
100
0
1955
1960
1965
1970
1975
1980
Year
1985
1990
1995
2000
Possible Reasons for Changes in
Trends
Artifactual
Errors
in numerator due to
Changes
in the recognition of disease
Changes in the rules and procedures for
classification of causes of death
Changes in the classification code of causes of
death
Changes in accuracy of reporting age at death
Errors in the denominator due to error in the
enumeration of the population
Possible Reasons for Changes in
Trends (cont.)
Real
Changes in age distribution of the population
Changes in survivorship
Changes in incidence of disease resulting
from
Genetic factors
Environmental factors
Other phrases
Cyclic trends ~ recurrent alterations in
occurrence , interval or frequency of disease
Secular cyclicity
Levels
of immunizations
Build up of susceptibles
e.g.
Short
Hep A-7 yr cycle,Measles-2yr cycle
term cyclicity
Chickenpox,salmonella(yearly
basis)
Periodic Trend
Temporal interruption of the general
trend of secular variation
Whooping Cough - Four-monthly
admissions, 1954-1973
Seasonal
A
cyclic variation in disease frequency
by time of year & season.
Seasonal
fluctuations in,
Environmental factors
Occupational activities
Recreational activities
Seasonal Trend
Pneumonia-Influenza Deaths – By year,
1934-1980
Epidemic
An increase in incidence above the expected
in a defined geographic area within a
defined time period
Endemic, Epidemic and Pandemic
Endemic - The habitual presence (or usual occurrence) of a
disease within a given geographic area
Epidemic - The occurrence of an infectious disease clearly in
excess of normal expectancy, and generated
from a common or propagated source
Pandemic - A worldwide epidemic affecting an exceptionally
high proportion of the global population
Number
of Cases
of
Disease
Time
Time clustering
Time Place Cluster/disease cluster
A
group of cases occur close together
& have a well aligned distribution
pattern {in terms of time and place}
Cluster analysis-used for rare or special disease
events.
Time/Place clustering analysis using the
Poisson model
{Poisson spatial/nearest neighbor distribution}
Poisson probability distribution is an inferential statistics probability
measure.
Describes objects/events as they are distributed geographically.
Geographical area divided into a series of equal square areas.
Randomization i.e. each case has equal probability of falling into each
square.
If clustering occurs, probability of cause-effect relationship goes up &
vice versa.
Place
Geographic Area
Diagnosis is Made
Contact occurred
between agent
and host
Source became
infected
Example
Action Level
Home – Patient ill
Investigation
Restaurant – Food
Eaten
Control
Farm – Eggs Infected
Prevention
Person
Age
Sex
Occupation
Immunization status
Underlying disease
Medication
Nutritional status
Socioeconomic factors
Crowding
Hobbies
Pets
Travel
Personal Habits
Stress
Family unit
School
Genetics
Religion
Descriptive epidemiology :
Patterns of Disease Occurrence
distribution of disease in populations
numerator ( “event” count ) / denominator ( group “at
risk” )
by “person” : age , race / ethnicity , gender ,
occupation , education , marital status , genetic
marker , sexual preference
by “place” : residence (urban vs. rural) , worksite ,
social event
by “time” : week , month , year ; sporadic , seasonal
, trends
--- incubation period ; latency
Sources of information
Census data
Vital statistical records
Employment health examinations
Clinical records from hospitals
National figures on food consumption ,
medications, health events etc
Epidemiologic ( scientific ) Approach
1. Identify a PROBLEM :
clinical suspicion ; case series ; review of medical literature
2. Formulate a HYPOTHESIS ( asking the right question ) ;
good hypotheses are: Specific, Measurable, and Plausible
3. TEST that HYPOTHESIS ( assumptions vs. type of data )
4. always Question the VALIDITY of the result(s) :
Chance ; Bias ; and Causality
Epidemiologic Study: threats to Validity
Chance : role of random error in outcome measure(s)
( p - value ; power of the study and the confidence interval )
--- largely determined by sample size
Bias : role of systematic error in outcome measure(s)
Selection bias - subjects not representative
Information bias - error(s) in subject data / classification
Confounding - 3rd variable (causal) assoc. w/ both X and Y
What is a hypothesis?
An educated guess
an unproven idea
based on observation or reasoning, that can be
proven or disproven through investigation.
What goes into a hypothesis?
Characteristics of the disease
The illness
Established modes of transmission
Distribution
In time
By place
By person
Hypothesis formulation
4 methods {derived from 5 canons of inductive
reasoning by John Stuart Mill}
Method
of difference
Method of agreement
Method of concomitant variation
Method of analogy
Measures
Morbidity: Refers to the presence of disease in a
population
Mortality: Refers to the occurrence of death in a
population
Methods for Measuring
How do we determine disease frequency for a
population?
Rate = Frequency of defined events in specified
population for given time period
Rates allow comparisons between two or more
populations of different sizes or of a population
over time
Compute Disease Rate
Number of persons at risk = 5,595,211
Number of persons with disease = 17,382
Rate = 17,382 persons with heart disease
5,595,211 persons
= .003107 heart disease / resident / year
Rates
Rates are usually expressed as integers and
decimals for populations at risk during specified
periods to make comparisons easier.
.003107 heart disease / resident / year x 100,000
= 310.7 heart disease / 100,000 residents / year
Prevalence vs. Incidence
Prevalence is the number of existing cases of
disease in the population during a defined
period.
Incidence is the number of new cases of
disease that develop in the population during a
defined period.
Incidence
Incidence rate is a measure of the
probability of the event among persons at
risk.
Incidence Rates
Population denominator:
IR = # new cases during time period X K
specified population at risk
Example (Incidence Rate)
During a six-month time period, a total of 53 nosocomial
infections were recorded by an infection control nurse
at a community hospital. During this time, there were
832 patients with a total of 1,290 patient days. What is
the rate of nosocomial infections per 100 patient days?
Mortality Rates
A special type of incidence rate
Number of deaths occurring in a specified
population in a given time period
Use of Mortality rates
Mortality rates are used to estimate disease
frequency when…
incidence data are not available,
case-fatality rates are high,
goal is to reduce mortality among screened or
targeted populations
Mortality Rates: Examples
Crude mortality: death rate in an entire
population
Rates can also be calculated for sub-groups within
the population
Cause-specific mortality: rate at which deaths
occur for a specific cause
Mortality Rates: Examples
Case-fatality: Rate at which deaths occur from a
disease among those with the disease
Maternal mortality: Ratio of death from
childbearing for a given time period per number
of live births during same time period
Mortality Rates: Examples
Infant mortality: Rate of death for children less
than 1 year per number of live births
Neonatal mortality: Rate of death for children
less than 28 days of age per number of live
births
Prevalence
Prevalence: Existing cases in a specified
population during a specified time period (both
new and ongoing cases)
Prevalence is a measure of burden of disease or
health problem in a population
Prevalence
Prevalence: The number of existing cases in the
population during a given time period.
PR
=
# existing cases during time period
population at same point in time
Prevalence rates are often expressed as a percentage.
Factors Influencing Prevalence
Increased by:
Longer duration of the
disease
Prolongation of life of
patients without cure
Increase in new cases
(increase in incidence)
In-migration of cases
Out-migration of
healthy people
In-migration of
susceptible people
Improved diagnostic
facilities
(better reporting)
Decreased by:
Shorter duration of
disease
High case-fatality
rate from disease
Decrease in new
cases (decrease in
incidence)
In-migration of
healthy people
Out-migration of
cases
Improved cure rate
of cases
Basic Measures of Association
Relative risk& odds ratio
We often need to know the relationship between
an outcome and certain factors (e.g., age, sex,
race, smoking status, etc.)
Used to guide planning and intervention
strategies
2 x 2 contingency table for Calculation of
Measures of Association
Outcome
Exposure
Present
Absent
TOTAL
Present
a
b
a+b
Absent
c
d
c+d
TOTAL
a+c
b+d
a+b+c+d
Note: “Exposure” is a broad term that represents any
factor that may be related to an outcome.
Relative Risk
Ratio of the incidence rates between two groups
Can only be calculated from prospective studies
(cohort studies)
Interpretation
RR > 1: Increased risk of outcome among “exposed”
group
RR < 1: Decreased risk, or protective effects, among
“exposed” group
RR = 1: No association between exposure and
outcome
Calculation of Relative Risk
incidence rate among exposed
RR =
incidence rate among non-exposed
Calculation of Relative Risk
Outcome
Exposure
Present
Absent
TOTAL
Present
a
b
a+b
Absent
c
d
c+d
TOTAL
a+c
b+d
a+b+c+d
Relative Risk =
a
a b
c
c d
Relative Risk Case Study
Smoking and low birth weight
Birth Weight
Smoking status
<2500 g
>2500 g
TOTAL
Smoker
120
240
360
Non-smoker
60
580
640
TOTAL
180
820
1000
Answers to Relative Risk Case Study
1. Incidence of LBW among
smokers
120
x1,000 3333
.
360
2. Incidence of LBW among
non-smokers
60
x1,000 938
.
640
3. Relative risk for having a
LBW baby among smokers
versus non-smokers
333.3
3.6
93.8
Understanding Probability and Odds
Probability: Chance or risk of an event occurring (a
proportion)
Probability= no. of times an event occurs
no. of times an event can occur
Odds: ratio of the probability of an event occurring to
the probability of an event not occurring
Odds = P/(1-P)
Calculation of Odds Ratio
Outcome
Exposure
Present
Absent
TOTAL
Present
a
b
a+b
Absent
c
d
c+d
TOTAL
a+c
b+d
a+b+c+d
Odds Ratio =
ad
bc
Odds Ratio
The odds ratio (OR) is a ratio of two odds.
The OR can be calculated for all three study
designs
Cross-sectional
Case-control
Cohort.
Various approaches to Odds ratio
Cross product/odds ratio
Prevalence odds ratio
cross sectional studies
Exposure odds ratio( odds of exposure in diseased vs. nondiseased)
2 x 2 contingency table (ad/bc)
In rare cases or exotic diseases
Disease odds/Rate odds ratio(odds of getting a disease if exposed
or unexposed)
Cohort
& cross sectional
Risk odds ratio
Cross
sectional ,cohort & case control
Odds Ratio
For cohort & cross sectional studies: OR is a
ratio of the odds of the outcome in exposed
persons to the odds of the outcome in nonexposed persons.
For case-control studies: OR is a ratio of the
odds of exposure in cases to the odds of
exposure in controls.
Provides an estimate of the relative risk when
the outcome is rare
Interpretation of Odds Ratio
OR > 1: Increased odds of exposure among those
with outcome
OR < 1: Decreased odds, or protective effects,
among those with outcome
OR = 1: No association between exposure and
outcome
Keeping the Terms Straight
“Risk ratio” = “relative risk”
“Relative odds” = “odds ratio”
Remember – the key is recognizing the terms
“risk” and “odds”
Appropriateness of Measures
Remember that the relative risk can only be
calculated in prospective studies
Odds ratio can be calculated for any design
Cohort / prospective
Case-control
Cross-sectional
Inference
The relative risk and odds ratio provide the
magnitude of difference between some factor
and an outcome
How do we know if the magnitude is statistically
significant?
Confidence Intervals
A confidence interval is a range of values that is
likely (e.g., 95%) to contain the true value in the
underlying population
The 10 Steps of Outbreak Investigation
Prepare for field work
Establish the existence of an outbreak
Verify the diagnosis
Define & identify cases
Perform descriptive epidemiology
Develop hypotheses
Perform analytic epidemiology
Refine hypotheses & conduct additional studies
Implement control & prevention measures
Communicate findings
Objectives of Descriptive Epidemiology
To evaluate trends in health and disease and allow
comparisons among countries and subgroups within
countries
To provide a basis for planning, provision and
evaluation of services
To identify problems to be studied by analytic methods
and to test hypotheses related to those problems