Transcript Session 6

Perception and VR
MONT 104S, Fall 2008
Lecture 6
Seeing Motion
1
Tips for Scientific Writing
1.
2.
3.
4.
5.
Stay focused: Writing should address question.
Good: People in the distance appeared to be small. Their size helped
me to judge their distance.
Not-so-good: The scenery was spectacular. The lakes were small so I
could not see any details.
Provide specific examples as evidence.
Write it all out:
Good: When walking downhill, things that were more distant were
lower in the visual field. This is a situation in which height in the
visual field would not be a useful cue to distance.
Not-so-good: I misjudged the size of the rocks.
Answer each part of the question: When size works, when size
doesn't work, when height works, when height doesn't work.
Be as clear as you can be. Simple and clear is better than flowery and
obtuse.
2
Causes of Image Motion
Image motion can result from numerous causes:
• A moving object in the scene
• Eye movements
• Motion of the observer
3
Uses of Image Motion
Image motion on the retina can be used to compute a variety of
scene properties. Among them are:
• Image segmentation (dividing up the scene into individual
objects or surfaces)
• 3D structure of an object (structure from motion)
• Depth (motion parallax)
• Time to collision
• Heading direction
• Moving object direction
• Speed of eye movements (for smooth pursuit)
4
Two stages of Motion
processing
Visual motion processing is thought to occur in two stages:
1) Extract the 2D image velocity field.
2) Use the 2D velocity field to compute properties of the scene
(as listed in the previous slide).
5
Motion Detection by Neurons
Problem:
• A single photoreceptor (or retinal ganglion cell) cannot detect
motion unambiguously.
•
A spot of light moving across its receptive field will cause a
temporary increase in light followed by a decrease.
•
The photoreceptor cannot distinguish between motion or
changes in ambient lighting.
6
A Neural Circuit for Motion
Barlow and Levick proposed a model to compare the response at one
location with a delayed response at a neighboring position.
1.
2.
3.
4.
Prefers right motion
Each neuron has a receptive field that
is shifted to the left with respect to
the receptive field of the previous
neuron.
Each neuron causes the neuron with a
RF to its left to be inhibited a short
time after the first neuron is
stimulated.
The neurons in the circuit will not
respond to leftward motion because
of this inhibition.
They will still respond to rightward
motion.
7
Mapping of Visual Areas
This map shows a flattened cortex with the
known visual areas mapped onto it.
There are a large number of
distinct visual areas (probably
at least 20).
Each area appears to have a
specific function.
The areas show a roughly
hierarchical organization
(although most areas have
reciprocal connections).
8
Two Major Processing Streams
There appear to be 2 major processing streams (although there
are cross connections between them):
1. The Dorsal Stream:
Includes areas MT, MST, VIP, 7a, etc.
Processes motion, stereo, spatial relationships
The "where" pathway.
2. The Ventral Stream:
Includes areas V4, IT, etc.
Processes color, form, objects.
The "what" pathway.
9
Dorsal and Ventral Streams
10
Motion Processing in V1
In V1, some simple cells and complex cells are tuned to
direction of motion. I.e. they respond most strongly to motion
in a given direction and their response falls off as the motion
deviates from that direction.
Tuning for 180 deg
Firing
Rate
120o
180o 240o
Direction of Motion
Direction Tuning
Polar Plot
(tuning for zero deg)
11
Motion Processing in MT
MT (The Middle Temporal Area) is thought to be important for
processing motion information.
Characteristics of MT neurons:
1) Cells tuned for direction of motion (more broadly tuned than
V1 cells.
2) Cells tuned for speed.
3) Large receptive field sizes. (Some are 100x bigger than V1
receptive fields). They range from 1-2deg in diameter in the
foveal region and increase in the periphery.
12
Speed Selectivity
McKee and Nakayama have shown that people are very good at
discriminating two different speeds.
The Weber fraction gives a measure of how big a change in speed
is necessary to distinguish two different speeds. It is fairly
constant over a broad range of speeds:
DV/V = .05
MT may be the area that first computes speed.
13
The Aperture Problem
•If our view is limited to an edge seen through an aperture, we can
only find the component of motion perpendicular to the edge.
•The aperture problem is a fundamental problem when one is trying
to measure image velocity using local detectors.
•This is true in biological vision (neurons have local receptive fields).
•In the "barber pole" illusion, the edges of the aperture affect
perception.
http://www.aceviper.net/Optical2/barber_pole_illusion.htm
Aperture
Edge
Perpendicular velocity
component
14
Motion Adaptation
•MT cells do not respond if opposite directions of motion (e.g.
left and right) presented at the same time. This is called motion
opponency.
•The detectors for opposite directions balance each other.
The waterfall effect:
•If you view one direction of motion for a long time, the detectors
for that direction become fatigued.
•If you then look at a stationary surface, it will appear to move in
the opposite direction.
•This works for expansion and contraction as well.
•Demo:
http://www.michaelbach.de/ot/mot_adaptSpiral/index.html
15
2D Motion is just the Beginning
2D image motion contains information about:
• Relative depth of surfaces
• 3D motion of objects
• 3D structure of objects
• Direction of observer motion
Among other things.
Many of these tasks require local comparisons of neighboring motions.
16
Motion Parallax
•For an observer moving in a straight line, the images of objects that are
nearby move faster than the images of objects that are far away.
•This is the result of perspective projection.
•The difference in image speed, known as motion parallax, is a strong
cue for the relative depth of objects.
•A moving observer can find changes in depth in the scene by finding
changes in image speed.
object 1
object 2
17
MT cells have inhibitory
surround
-
Many MT cells have an inhibitory surround.
+
Motion in the surround inhibits the response to
motion in the center.
The inhibitory surround may be involved in:
• Figure-ground segmentation based on motion.
• Motion parallax
• Heading judgments (judging where one is going).
18
Structure From Motion
•Relative motion can give information about the 3D structure of
objects.
•Structure from motion originally studied rigorously by Wallach
and O'Connell (1953).
•They studied wire-frame objects and examined peoples ability to
judge the structure of the objects when moving.
•The ability to see a 3D structure from a moving 2D image is
known as the Kinetic Depth Effect.
Demo: http://www.michaelbach.de/ot/mot_ske/index.html
Quick Time™a nd a
TIFF ( Unco mpre ssed ) dec ompr esso r
ar e nee ded to see this pictur e.
19
Detecting Biological Motion
Johansson attached points of light to the joints of people and
filmed them walking.
He showed that when the lights are in motion, people can
recognize the actions being performed.
Demo: http://www.michaelbach.de/ot/mot_biomot/index.html
Relative motion of the points of light is certainly important for
recognizing the actions.
20
Using Motion to See Where
You're Going
When one moves in a straight line, the images on the retina move in a radial
pattern.
The center of the pattern coincides with the direction you're going.
When we move on a curved path, the computation is more difficult (and
involves relative motion), but we can still judge our path of motion.
21