Transcript File
WARMUP
1. What happens when Pavlov’s dogs have gone
through acquisition?
2. How could extinction occur with Pavlov’s dogs?
3. How could spontaneous recovery occur with
Dwight in “The Office?”
4. How could generalization occur with the squirt
bottle activity?
5. How could discrimination occur with Little
Albert?
LEARNING STYLE #2:
OPERANT
CONDITIONING
DIFFERENCE BETWEEN CC AND OC
In Classical Conditioning, do the
organisms have to DO anything to learn
a response?
No, it’s just automatic
In Operant Conditioning, the organisms
must “operate” in order to learn.
They have to DO something
LAW OF EFFECT
Thorndike’s
law of effect – Rewarded behavior is
likely to occur again
Thorndike used a puzzle box
Rewarded with a fish if they got
Cats improved on their time
out
Yale University Library
4
OPERANT CHAMBER
Skinner
Developed the Operant chamber, or the Skinner box
Used this to teach animals behaviors that were unlike
their natural behavior
http://www.youtube.com/watch?v=vGazyH6fQQ4
https://www.youtube.com/watch?v=jAQSEO25fa4
Walter Dawn/ Photo Researchers, Inc.
From The Essentials of Conditioning and Learning, 3rd
Edition by Michael P. Domjan, 2005. Used with permission
by Thomson Learning, Wadsworth Division
B.F.
5
6
I NEED A VOLUNTEER
SHAPING
Reinforcers
guide behavior towards the desired target
behavior through successive approximations.
“Little by little”
8
REWARDS AND PUNISHMENTS
Reinforcement
Any event that strengthens the behavior it follows
How could I strengthen your studying habits?
Punishment
Any event that decreases the behavior that it follows
What could I do to decrease your talking in class?
DIFFERENT TYPES OF
REINFORCERS AND PUNISHMENT
Basic thing to know:
Positive does not mean GOOD
Negative does not mean BAD
Positive means to “GIVE”
Negative means to “TAKE AWAY”
10
TYPES OF REINFORCERS
Reuters/ Corbis
11
PUNISHMENT
12
Operant Conditioning
Positive Reinforcement
Positive Punishment
*“Give something good”
*“Give bad”
Cookies for A’s
Spanking for lying
Negative Reinforcement
*“Take away something bad”
Negative Punishment
*“Take away good”
Take away a lap of running for
good practice
Take away cell phone for F’s
13
PUNISHMENT PROBLEMS
Although there may be some justification for occasional
punishment, it usually leads to negative effects.
1.
2.
3.
4.
5.
6.
Results in unwanted fears.
Conveys no information to the organism.
Justifies pain to others.
Causes unwanted behaviors to reappear in its
absence.
Causes aggression towards the agent.
Causes one unwanted behavior to appear in
place of another.
14
WARMUP!
1 – Positive Reinforcement
2 – Negative Reinforcement
3 – Positive Punishment
4 – Negative Punishment
PRIMARY & SECONDARY REINFORCERS
1.
2.
Primary Reinforcer: An innately reinforcing
stimulus like food or drink.
Conditioned Reinforcer: (Secondary Reinforcer)
A learned reinforcer that gets its reinforcing power
through association with the primary reinforcer.
16
IMMEDIATE & DELAYED REINFORCERS
1.
Immediate Reinforcer: A reinforcer that occurs
instantly after a behavior.
*A rat gets a food pellet for a bar press.
2.
Delayed Reinforcer: A reinforcer that is
delayed in time for a certain behavior.
* A paycheck that comes at the end of a week.
We may be inclined to engage in small immediate
reinforcers (watching TV) rather than large delayed
reinforcers (getting an A in a course) which require
consistent study.
17
REINFORCEMENT SCHEDULES
1. Continuous Reinforcement: Reinforces the desired
response each time it occurs.
Learning occurs quickly
2. Partial (Intermittent): Reinforces a response only
part of the time.
Though this results in slower acquisition in the beginning,
it shows greater resistance to extinction later on.
18
PARTIAL REINFORCEMENT SCHEDULES
Fixed-ratio
Variable-ratio
Fixed-interval
Variable-interval
RATIO SCHEDULES
Fixed-ratio schedule: Reinforces a response only after a
specified number of responses. (paid on commission)
1.
Positive - Will continue working again right when they finish
Variable-ratio schedule: Reinforces a response after an
unpredictable number of responses.
2.
Positive - This is hard to extinguish because of the unpredictability.
20
INTERVAL SCHEDULES
1.
Fixed-interval schedule: Reinforces a response only after a
specified time has elapsed.
2.
Negative: This produces a choppy start-stop pattern
Variable-interval schedule: Reinforces a response at
unpredictable time intervals
Positive/Negative: Produces slow, steady responses
21
SCHEDULES OF REINFORCEMENT
22
WARMUP
1. Give an example of positive punishment.
2. Give an example of negative reinforcement.
3. What is the difference between fixed and variable?
4. What is the difference between ratio and interval?
5. Give an example of a fixed ratio reinforcement schedule.
6. Give an example of a variable interval reinforcement
schedule.
COGNITION & OPERANT CONDITIONING
Can
we learn without rewards?
How do we know?
Latent
learning
Cognitive maps
24
MOTIVATION
Intrinsic
Motivation: The
desire to perform a behavior
for its own sake.
Extrinsic
Motivation: The
desire to perform a behavior
due to promised rewards or
threats of punishments.
25
OPERANT CONDITIONING IN OTHER FIELDS
OPERANT VS. CLASSICAL CONDITIONING
27
LEARNING TO COPE
Problem-focused coping
Alleviating stress directly – dealing with the problem
Emotion-focused coping
Alleviating stress by attending to emotional needs
LOCUS OF CONTROL
External Locus of Control
Perception that
outside forces control
your fate
“My teacher
sucks. I’m gonna
fail this class”
Internal Locus of Control
Perception that you
control your fate
“I gotta work
harder. I didn’t
study for that.”
“I CAN’T HELP IT”
Is this something that can be
learned?
Martin Seligman
Dogs strapped in harness
Given repeated shocks
Could not avoid them
What did they learn?
They are helpless
30
LEARNED HELPLESSNESS
Later, dogs were placed in a situation where they could
escape the shocks
The dogs cowered and did not escape shocks
What does this tell us?
31
LEARNED HELPLESSNESS
The hopelessness and passive resignation learned
when unable to avoid repeated aversive events