Pitch - Department of Psychology
Download
Report
Transcript Pitch - Department of Psychology
Psy280: Perception
Prof. Anderson
Department of Psychology
Audition 1 & 2
1
Hearing: What’s it good for?
Remote sensing
Not restricted like visual field
Can sense object not visible
2
Hearing: The sound of silence
A tree in the forest
One hand clapping
No physical signal, no perception
Separate physical quantity from perceptual
quality
Sound is the perceptual correlate of the physical
changes in air pressure
Physical signal but no perception
Or water pressure when under water
John Cage’s 4:33 No. 2, 1962
3
What are the physical attributes
associated with sound?
Loudness
Amplitude or height of pressure wave
Pitch
Frequency of times per second (Hz) a pressure wave
repeats itself
4
What is sound quality?
Pure tones
Single frequency (f)
Rarely exist in real world
Complex tones
More than one f
Due to resonance
Air pressure causes reverberations
E.g., tuning forks
E.g., Plucking the A string on a guitar
Fundamental frequency 440 Hz (cycles/s)
Harmonics
Reverberations at multiples of the fundamental
E.g., 880, 1320
Creates fullness of complex sounds
Timbre is the relative amplification of harmonics
5
The human ear
Outer ear
Focusing of sound
Resonance amplifies 20005000 Hz range
Converts from air to
mechanical vibration
Middle ear
Amplification
Fluid denser than air
Focus vibrations onto
stapes/oval window
Increased leverage from
ossicles
Inner ear
Sensory transduction
Physical to neural energy
Fluid pressure changes
Bending of hair cells
6
Auditory sensory
transduction: The inner ear
Cochlea
3 layers
Cochlear partition
Contains organ of corti
Organ of corti
Coiled and liquid filled
Cilia (hair) cells
Between basilar and
tectorial membranes
Transduction
Movement of cilia
between membranes
7
Auditory transduction
Bending—>physical energy
Converted to neural signals
Bend one direction —> depolarization
More likely to fire AP
Other direction —> hyperpolarization
Less likely to fire AP
8
Auditory pathways
QuickTime™ and a
GIF decompressor
are needed to see this picture.
9
Audition: What and where
What is it?
*Pitch
Identification
Surprisingly, little is
known beyond speech
Where is it?
*location
10
What: Pitch
How does neural firing signal different
pitches?
1) Timing codes
2) Place codes
11
Pitch: Temporal coding
Idea: Diff f’s signaled
by rate of neuronal
firing
Hair cell response
Bend one direction —>
depolarization
Other direction —>
hyperpolarization
Result?
Bursting pattern of
neural response
related to frequency of
oscillation
12
Problems with temporal
coding
Problem: A single neuron can’t fire at the rate necessary to
represent higher f tones
Solution: volley principle
E.g., 1000-20,000 Hz (i.e., 1000-20000 per second)
Max neuron firing rate: 500-800 per second
No single neuron represents f
Coding across many neurons with staggered firing rates
Evidence: Phase locking
Diff neurons respond to
diff peaks
Not every peak
Pool across multiple neurons to
represent high f’s
13
Pitch: Place coding
Related to doctrine of specific
nerve energies
What is pitch?
Activation of different places in
auditory system
Frequency specific
Tonotopy
Owl
brainstem
Cochlear
Brainstem
Cortical
Stimulate these regions
Should result in pitch
perception
Human auditory cortex
14
Place coding starts in cochlea
Von Bekesy studied
basilar membrane in
cadavers
Observed traveling waves
Diff frequencies (f) result
in waves w/ diff envelopes
Base more narrow and
stiffer
Apex wider and more
flexible
Higher f: Peak closer to
base
Lower f: Peak closer to
apex
Thus, f related to “place”
where peak fluctuation
15
Frequency tuning:
Neural place coding
Tonotopic arrangement of hair cell nerves
Diff nerves innervate diff parts of basilar membrane
Allows for “place” code for frequency
Frequency tuning curves of
single hair cells
16
Complex tones:
Fourier decomposition
Basilar
membrane acts
as f analyzer
Breaks down
complex f inputs
into constituent
pure tone
components
17
Auditory masking: Evidence
for cochlear place coding
Auditory masking
Presence of certain tones
decreases perception of
nearby tones
Similar f result in greater
masking
Asymmetry in spread of
masking
400 Hz mask
Increases threshold
for 800 more than
200 Hz
Consistent with basilar
vibrational overlap
E.g. 400 Hz mask overlaps
more with 800 than 200 Hz
18
Mystery of the missing
fundamental
400 Hz fundamental plus harmonics
(800, 1200, 1600, 2000)
What if remove fundamental f (400Hz)?
Perceived pitch doesn’t change!
Hence: The missing fundamental
Problem for place coding
Sounds like 400 Hz pitch with complex timbre
No direct stimulation of 400 Hz on basilar
membrane
Harmonic structure determines perceived pitch
Not what is present on basilar membrane
What we hear is not what the basilar membrane tell us, but what
19
our brain does
f
What does Barry White sound
like on the telephone?
Telephone carries 3003400Hz
Typical male voice
Barry white
Fundamental f = 120 Hz
30 Hz?
Can’t speak to Barry on the
telephone?
Missing fundamental allows
us to hear “virtual” pitch of
voice
20
If its too loud your too old
Db (SPL) scale
Attenuated low and high f
relative to midrange
High volume
Loudness varies with f
Low volume
Pain and pleasure
Audibility curves
Loudness doubles about
every 10 db at 1000 Hz
Less frequency attenuation
Low volume sounds muddy
Mostly mid range
I like my music loud
Each curve represents equal
loudness
21
Otoacoustic emissions:
Talking ears
Ears don’t only receive sounds, they make
them!
Occur spontaneously and also in response to
sound
It like your ears are talking back!
Created by movement of outer hair cells (ohc)
Discovered in 1978
Tiny microphones
Part of auditory sensitivity is movement of ohc to
change region specific flexibility of basilar membrane
Allows tuning curves to be so narrow
Hearing impairments often start with loss of ohc
function
22
Auditory localization
Where is the sound coming from?
Distance
Elevation (vertical)
Azimuth (horizontal)
Localization not nearly as precise as vision
Localization within 2-3.5 degrees in front of head
20 degrees behind head
Suggests important role of vision
Tunes auditory localization
23
Why is is auditory localization
not obvious?
Vision
Stimulate different photoreceptors in eye
Audition
No such separation of sounds sources on
sensory surface
Sources combine to equally stimulate ear
receptors
24
Why have two ears?
Two aural perspectives on the world
Like vision, can be used to get different
sound pictures of environment
Binaural cues
The disparities between ears is used for
localization
25
Azimuth
Interaural (between ears) Time
Difference (ITD)
Air pressure changes are very slow
relative to speed of light
ITD at side = max 600 µS
ITD at front = 0
Can induce perception of location by
varying ITD using headphones
Interaural Level (intensity) Difference
(ILD)
Amplitude decreases w/ distance
Head casts sound/acoustic shadow
Reduced amplitude due to reflection
Measure w/ tiny microphones
f dependent
Greater shadow for higher f
26
Elevation
ITD/ILD not very useful
Use spectral cues
Frequency information
can result in different
perceptual qualia
Monaural: f serves as
signal for pitch
Binaural: f serves as
signal for location
Pinna differentially
absorb f
Result: Notches in
frequency spectra
Above
Level
Below
27
Distance
At close distances (< 1 meter)
ILD can discriminate near and far
At very close distances ILD is very large (e.g. 20 Db)
But what’s that going to do for us?
At far distances
We are very poor judges for unfamiliar sounds
Use sound level for familiar sources
Frequency: Auditory atmospheric haze
Suggests that sound serves as signal for visual search
Absorption of high f
Sound muffled
Auditory parallax
Sounds move faster across ears at near relative to far
distances
28
Brain basis for localization
Sound to right
ITD detectors
Brainstem: Superior
olivary nucleus
Primary auditory cortex
Coincidence detection
Neurons fire maximally
when signals arrive at
same time
Thus: “coincidence”
Axonal distance create
input delays
Sound to left
29
Auditory scene analysis
How do we segregate different sounds being
produced by many sources simultaneously?
How do we tell what frequencies belong to what
source?
E.g., Cocktail party
Don’t perceive an unorganized jumble of frequencies
Not simply high vs low f
Most f ranges overlap
How do we segregate information as belonging to
distinct auditory objects?
30
Principles of auditory
grouping
Like gestalt visual principles
Auditory stream segregation
Similarity
Timbre
Location
Pitch
Time
1 stream
2 streams
31
Auditory-visual interactions:
Location and pitch
Visual capture of sound
Location: Ventriloquism
effect
Pitch: McGurk effect
QuickTime™ and a
Cinepak decompressor
are needed to see this picture.
“Ba”
“Va”
“Tha”
“Da”
Visual information is
integrated with audition
Creates fused auditory visual
perception
32
Auditory-visual interactions:
Location and pitch
Auditory experience is much more than
pressure level changes
33