MARINE SCIENCE - Phoenix College
Download
Report
Transcript MARINE SCIENCE - Phoenix College
Data
Handbook Chapter 4 & 5
Data
A series of readings that represents a
natural population parameter
It provides information about the
population itself
Organizing Data
Important prelude to describing and
interpreting data
Charting Data
Tables
– Organized by rows and columns
Column 1 Column 2 Column 3
Row 1
Row 2
Row 3
Charting Data
Graphs
– Organized by horizontal (abscissa) and vertical
(ordinate) axes
Charting Data
Graphs
– Proper legend
– Properly labeled axes
Graphs
Multiple graphs used for comparing data should
map the same variables on the ordinate and
abscissa and use the same scale for each graph.
Describing data
Descriptions of data indirectly describes
actual population parameters
Describing the data distribution is a first
step in this process
Data Distributions
Pattern of frequency
Frequency is how often a particular
value or set of values occurs in a data
set
Histogram Frequency Graph
Types of distributions
Uniform
Unimodal
Bimodal
Normal
Skewed
Uniform
The distribution has an equal frequency
(number of occurrences) of each value
or category of values
Uniform Distribution of Tree
Heights
Uniform Frequency Graph
This is not a uniform distribution.
Unimodal
The distribution has an unequal
frequency (number of occurances) of
each value or category of values
The distribution has distinct central
values that have a greater frequency
than the others
Unimodal Distribution of Tree
Heights
Unimodal Frequency Graph
Skewed
The distribution has distinct central
values that have a greater frequency
than the others
The less frequent values are not evenly
distributed on either side of the high
point
Skewed Distribution of Tree
Heights
Skewed Frequency Graph
Bimodal
The distribution has two distinct values
or sets of values that have greater
frequencies than the others
These values are separated from one
another by less frequent values
Often indicative of two populations
Bimodal Distribution of Tree
Heights
Bimodal Frequency Graph
Two Populations of Trees
Normal
Frequencies are equally spread out on
either side of a central high point
Bell shaped
Most frequent type of distribution
Normal Frequency Graph
Interpreting Data
Descriptive statistics are used to
summarize data
Several descriptive statistics are used to
describe two important aspects of data
distributions:
– Central Tendency
– Dispersion
Central Tendency
Most data are spread out around a
central high point
The central values are the ones that
occur most often and thus important to
report
Measures of Central Tendency
Three common measurements
– Mean
»Average value
– Median
»Center value
– Mode
»Most frequent value
Mean
“Typical Value”
N
Mean =
S
Xi
i=1
N
Normal Distribution and
Central Measures
In a perfectly normal distribution the
mean, median and mode are all the
same
Perfectly Normal Distribution
Dispersion
The distribution of values that occur less
often
The spread of the data around the
central values is important to report
Dispersion is about the degree of
clustering of the data
Measures of Dispersion
Two common measurements
– Range
»Distance between the lowest and
highest values
– Standard Deviation
»Average deviation from the mean
Calculate Mean and Range
Ranges
Calculating Standard Deviation
Normal Distribution and
Dispersion
68.26% of values fall within one
standard deviation on either side of the
mean
95.44% of values fall within two
standard deviations on either side of the
mean
99.74% of values fall within three
standard deviations on either side of the
mean
Normal Distribution and
Standard Deviation
Graph of
Dispersion
Accuracy & Precision
Accuracy
Error
Error = Accuracy of a particular data point
relative to an accepted value
Absolute Error = I Accepted – Data I
Percent Error = I Accepted – Data I x 100
Accepted
Precision
Precision is a measure of how
consistent the data within a data set are
relative to each other
One measure of precision of a data set
is the standard deviation SD provided
that m (the mean) is the accepted value
m + SD
Calculation of SD of a Data Set
1/2
N
SD = S (m –
i=1
N-1
2
X)
i