Transcript Learning
Learning
Classical Conditioning
The Fathers of Behaviorism
(1st half of 20th century)
What is learning?
Long-lasting
change in behavior resulting
from experience. (behavior as
measurement of learning)
Classical Conditioning
Ivan Pavlov
1849-1936
30+ years in research
on Learning
Tripped upon theory
of learning while
studying digestion
Classical Conditioning
Pavlov’s Experiment
Classical Conditioning
Learning
to associate a neutral stimuli with
another stimuli that produces reflexive,
involuntary responses. (Bell with food…)
Key principles
Unconditioned
Something that elicits a natural, reflexive
response (food)
Unconditioned
stimulus (NS)
NS (bell) paired with the US (food) to form
association between US and CS
Conditioned
response: (UR=unlearned!)
natural, involuntary response (salivation)
Neutral
stimulus: US
Stimulus: (bell)
When CS elicits CR (NS becomes CS)
Conditioning Processes
Acquisition
Learning: when animal responds to CS
without US (bell = salivation)
Strength
of conditioning affected by order
and timing of US and CS
So, what’s the most effective method of
conditioning? (In Pavlov’s experiment)
Ring bell, while it is still ringing, present food
(Delayed conditioning)
What
if you ring bell after the food?
Backward conditioning = ineffective
Extinction
To
unlearn a behavior (suppressed)
Qualifies when CS no longer elicits the CR
How
is it done?
Present CS without US
(Bell without food)
Spontaneous Recovery
Sudden
reappearance of a CR upon
presentation of CS (after extinction)
Renewal
Effect: sudden reappearance of
CR after extinction when return to
environment where acquisition took place
Generalization / Discrimination
Generalization
When a stimulus similar to CS will elicit a CR.
(similar to bell sound- tapping glass with
spoon)
Discrimination
To distinguish between various stimuli
(between animals, objects. Sounds etc.)
John Watson
“Tabula Rasa”
1913: Publication in Psychology Review
“behaviorist manifesto”
1) Psychology’s content should be behavior
2) Method should be objective rather than introspective
3) Its goal should be the “prediction of and control of
behavior” rather than the fundamental understanding of
mental events
Age 30: chair of John Hopkins Psychology department
1915: president of American Psychological Association
John Watson: Aversive Conditioning
Little Albert
Experiment (11 months)
Aversive conditioning
(conditioning for
negative response)
US = loud noise
UR = fear
CS =rat
CR= fear of rat
Life examples?
Second-order Conditioning
Second-Order,
or higher order conditioning
Once a CS elicits a CR, the CS can be used
(as a US) to condition a response to a new
stimulus
Example:
Dog
salivates to bell (first order conditioning)
Light is paired with bell (second-order)
Light = salivation
Biology and Classical Conditioning
Animals / humans are biologically wired to make
certain associations more easily than others
Examples:
Learned taste aversions (adaptive response)
Disgust reactions (Rozin and Colleagues)
Fudge: shaped in squares & dog feces
Bottle: labeled sucrose & cyanide
Classical conditioning, but biologically predisposed
Garcia and Koelling’s Experiment
Biological
Preparedness in Classical
Conditioning
CS
US
Learned Response
Loud noise
shock
fear
Loud noise
radiation (nausea) nothing
Sweet water
Shock
nothing
Sweet water
radiation (nausea) avoid water
What conclusions can be drawn from this?
Results appear adaptive. (each animal has different
biological predispositions to learning that enhance survival)
Thus, significance?
Classical conditioning is a critical way in which
all organisms learn to adapt to their environment
Classical Conditioning in today’s world? Provide
an example for each of the following:
Drug addiction / quitting
politicians
Advertising
Fears / phobias
Operant Conditioning
Learning based on consequences…
Association made between consequences and one’s
behavior
Thorndike’s Law of Effect:
(Early pioneer of Operant Learning)
Positive consequences results in strong stimulusresponse connection = increased behavior
Negative consequence results in weakened stimulusresponse connection = decreased behavior
Instrumental Learning: Consequence shapes
behaviors
Thorndike’s Cat and Puzzle Box Study
Cat’s
learn gradually, not by insight.
Suggests stimulus-response process
Classical vs. Operant
Remember…
Classical
= stimuli
Operant = consequences
Introducing B.F. Skinner….
“Cognitive science is the creationism of psychology.”
“External influences, not internal thoughts feelings,
influence behavior.”
“Recognizing that behavior is shaped by its
consequences is the first step in taking control of the
environment and ensuring that it delivers consequences
promoting desirable behavior.”
“The movement toward a better society demands giving
up the belief in dignity.” (Dignity is an illusion..)
The Skinner Box
Skinner’s Reinforcement
Reinforcement
: A consequence that
encourages behavior
Types
Effects
Examples?
Positive R
Add something pleasant
Negative R
Remove something unpleasant
Skinner’s Punishment
Punishment: A consequence
that
discourages behavior
Types
Effects
Examples?
Positive Punishment
Adds something negative
Negative Punishment
(omission training)
Removes something pleasant
Application:
Reinforcement or Punishment?
Indicate for the following which type of
Reinforcement or Punishment applies.
1.
Taking aspirin for a headache.
Water boarding to force a confession
Running home to get out of the cold.
Having your license revoked for speeding.
Spanking a child for a tantrum.
$25 for each A you make…. (Insane!)
2.
3.
4.
5.
6.
Learning by Operant Conditioning
Shaping:
Reinforcing the steps used to reach a desired
behavior. (single behavior: Press bar for
food)
Chaining:
Reinforcing a number of separate behaviors
for a more complex activity. (Obstacle
course)
Operant Conditioning
1.
2.
3.
4.
5.
The following terms can also be applied
to Operant Conditioning. Explain an
example for each using the Skinner Box.
Acquisition
Extinction
Spontaneous recovery
Generalization
Discrimination
A Quick Review!!!
1.
2.
3.
4.
5.
Brain defrost……
Pavlov is to _________ , as Skinner is to
_________ .
Who coined the phrase behaviorism? (Baby
Albert and aversive conditioning…)
Pavlov’s classical conditioning involves
natural, or uncontrolled responses (T-F)
Identify Skinner’s four major consequences of
operant learning and give an example for
each…
Classical conditioning is shaped by ________ ,
while operant is shaped by _________ .
Types of Reinforcers
Primary
Reinforcers
Natural reinforcers: Examples?
• Food, water, rest (innately pleasing)
Secondary
Reinforcers
Things we’ve learned to value : Examples?
• Praise, treasure box, to play video games
Money as a Reinforcer
Money
Can be used for anything
Token
= generalized reinforcer
Economy
Tokens as positive reinforcement
Cash in for other reinforcers
Sound familiar?
• Schools, mental institutions, prisons
Challenges of Parenthood
What’s the Best Consequence…?
For each of the following, choose only one of
Skinner’s 4 consequences and how you would
specifically enforce it.
1.
2.
3.
4.
Your ten year old, in a fit of anger, accidentally
breaks the living room window.
Your 11th grade teenager tells you she is going to
a movie and heads to a party.
Your four year old kicks you in the shin.
Your middle school daughter makes straight A’s.
Questions for Consideration…
Is
spanking an advisable punishment for
children?
To what extent should we reinforce our kids?
For each of the following provide a specific
reinforcement, if one is necessary.
Making good grades on your report card
Completing your chores all week
Is buying a car for your 16 year old reinforcement?
Biology and Operant Conditioning
Reinforcers: Effects can Vary
Effect of reinforcer can vary depending on
animal, its instincts, and situation
Instinctive Drift: ignore rewards to follow natural
(instinctive) behavior
Premack Principle
If two activities- the one preferred can be used to
reinforce the one not preferred.
Example?
• Eat your lima beans = “may be excused”
Reinforcement Schedules
Reinforcement
Schedules = pattern of
reinforcing behavior
Administered in 2 ways
• Ratio = number of responses made
• Interval= passage of time
Reinforcement Schedules
4
major reinforcement schedules
FR (Fixed ratio) = Reinforcement after set
number of responses- FR-5
VR (Variable ratio) = Reinforcement after
varied number of responses (average number
of responses set- VR-5)
FI (Fixed Interval) =Fixed amount of time set
before reward for behavior- FI 3
VI (variable interval) =varied amount of time
before reward (average time set- VI-3)
Learning and Extinction
FR
and FI = faster acquisition (learning),
but faster extinction
VR and VI: Slower learning but slower
extinction
Which of these four yields the highest rate
of response?
FR
Why
slower extinction with VR and VI?
• Noticing a break in pattern is more difficult
• “always that chance…”
Activity: Identifying reinforcement schedules..
Learning with Punishment
Escape
learning
To terminate an aversive stimulus: Example?
• To disrupt English class so as to “get out”
Avoidance
learning
To avoid stimulus all together: Example?
• Cut English class
Pitfalls of Punishment…
According
to behaviorists, what are the
potential pitfalls of punishment?
Tells only what not to do, not what to do…
Creates anxiety which interferes with learning
Only suppresses behavior, doesn’t eliminate
(discrimination)
Physical punishment = aggressive behavior
(correlation, not causation)
Classical v. Operant
Compare
and contrast Classical and Operant
Conditioning.
Similarities
Both forms of associative learning
Both involve acquisition, extinction, spontaneous
recovery, generalization, discrimination
Both influenced by biology, cognition
Differences
Classical
Response = automatic
Reward independent of action
learning = autonomic response
Operant
Response = voluntary
Reward contingent on action
learning = voluntary behavior
Behavioral Legacy
Pavlov:
Skinner
Classical Conditioning- how all organisms learn to
adapt to their environment
Practical applications for fears, phobias, etc.
Definitive insight into learned behavior
Practical applications abound
Both asserted that learning occurs without
thought (cognition)
Focused only on observable behavior
Cognitive Learning
How
could cognitive theorists argue that
cognition is influential in both classical and
operant conditioning?
Classical: CS triggers anticipation of US
Operant: awareness that responses =
consequences and thus act to maximize
reinforcement (minimize punishment)
Cognitive Learning
Observational Learning
AKA Modeling
Observation / imitation
Mirror neurons (frontal lobe / neural basis for
observational learning)
Albert Banduras: Bobo Doll Experiment
• Social learning theory (species specific)
Prosocial behavior (role modeling)
Antisocial behavior (Bobo Doll Experiment)
• Implications for television and youth?
By age 75 in U.S. 9 yrs of T.V.! (9 of 10 homes)
World Pop Culture (billion tv sets)
MTV 17 languages / CNN 150 countries
Cognitive Learning
Latent Learning (“hidden”)
Learning that is not directly observable
Tolman’s Rat maze study
Group 1: reward every time reached goal
Group 2: no reward when reached goal
Group 3: no reward 1st 10 days, reward on 11th
Finding: Latent learning (3rd group had learned
cognitive map in 1st trial, but didn’t show it until
reward)
Thus learning takes place without reinforcement
Cognitive Learning
Learning (“aha!”)
Sudden grasp of problem
Wolfgang Kohler: Chimpanzees
sudden insight, not gradual strengthening
of S-R association
Insight
Cognitive Learning
Abstract
Learning
Higher order thinking (inferring relationships,
complex problem solving)