Transcript File

WARMUP
1. What happens when Pavlov’s dogs have gone
through acquisition?
 2. How could extinction occur with Pavlov’s dogs?
 3. How could spontaneous recovery occur with
Dwight in “The Office?”
 4. How could generalization occur with the squirt
bottle activity?
 5. How could discrimination occur with Little
Albert?

LEARNING STYLE #2:
OPERANT
CONDITIONING
DIFFERENCE BETWEEN CC AND OC
In Classical Conditioning, do the
organisms have to DO anything to learn
a response?
 No, it’s just automatic


In Operant Conditioning, the organisms
must “operate” in order to learn.

They have to DO something
LAW OF EFFECT
 Thorndike’s
law of effect – Rewarded behavior is
likely to occur again
 Thorndike used a puzzle box
 Rewarded with a fish if they got
 Cats improved on their time
out
Yale University Library
4
OPERANT CHAMBER
Skinner
 Developed the Operant chamber, or the Skinner box
 Used this to teach animals behaviors that were unlike
their natural behavior
 http://www.youtube.com/watch?v=vGazyH6fQQ4
 https://www.youtube.com/watch?v=jAQSEO25fa4
Walter Dawn/ Photo Researchers, Inc.
From The Essentials of Conditioning and Learning, 3rd
Edition by Michael P. Domjan, 2005. Used with permission
by Thomson Learning, Wadsworth Division
 B.F.
5
6
I NEED A VOLUNTEER
SHAPING
 Reinforcers
guide behavior towards the desired target
behavior through successive approximations.

“Little by little”
8
REWARDS AND PUNISHMENTS

Reinforcement
Any event that strengthens the behavior it follows
 How could I strengthen your studying habits?


Punishment


Any event that decreases the behavior that it follows
What could I do to decrease your talking in class?
DIFFERENT TYPES OF
REINFORCERS AND PUNISHMENT

Basic thing to know:
Positive does not mean GOOD
 Negative does not mean BAD

Positive means to “GIVE”
 Negative means to “TAKE AWAY”

10
TYPES OF REINFORCERS
Reuters/ Corbis
11
PUNISHMENT
12
Operant Conditioning
Positive Reinforcement
Positive Punishment
*“Give something good”
*“Give bad”
Cookies for A’s
Spanking for lying
Negative Reinforcement
*“Take away something bad”
Negative Punishment
*“Take away good”
Take away a lap of running for
good practice
Take away cell phone for F’s
13
PUNISHMENT PROBLEMS
Although there may be some justification for occasional
punishment, it usually leads to negative effects.
1.
2.
3.
4.
5.
6.
Results in unwanted fears.
Conveys no information to the organism.
Justifies pain to others.
Causes unwanted behaviors to reappear in its
absence.
Causes aggression towards the agent.
Causes one unwanted behavior to appear in
place of another.
14
WARMUP!
1 – Positive Reinforcement
 2 – Negative Reinforcement
 3 – Positive Punishment
 4 – Negative Punishment

PRIMARY & SECONDARY REINFORCERS
1.
2.
Primary Reinforcer: An innately reinforcing
stimulus like food or drink.
Conditioned Reinforcer: (Secondary Reinforcer)
A learned reinforcer that gets its reinforcing power
through association with the primary reinforcer.
16
IMMEDIATE & DELAYED REINFORCERS
1.
Immediate Reinforcer: A reinforcer that occurs
instantly after a behavior.
*A rat gets a food pellet for a bar press.
2.
Delayed Reinforcer: A reinforcer that is
delayed in time for a certain behavior.
* A paycheck that comes at the end of a week.
We may be inclined to engage in small immediate
reinforcers (watching TV) rather than large delayed
reinforcers (getting an A in a course) which require
consistent study.
17
REINFORCEMENT SCHEDULES
1. Continuous Reinforcement: Reinforces the desired
response each time it occurs.

Learning occurs quickly
2. Partial (Intermittent): Reinforces a response only
part of the time.

Though this results in slower acquisition in the beginning,
it shows greater resistance to extinction later on.
18
PARTIAL REINFORCEMENT SCHEDULES
Fixed-ratio
Variable-ratio
Fixed-interval
Variable-interval
RATIO SCHEDULES
Fixed-ratio schedule: Reinforces a response only after a
specified number of responses. (paid on commission)
1.

Positive - Will continue working again right when they finish
Variable-ratio schedule: Reinforces a response after an
unpredictable number of responses.
2.

Positive - This is hard to extinguish because of the unpredictability.
20
INTERVAL SCHEDULES
1.
Fixed-interval schedule: Reinforces a response only after a
specified time has elapsed.

2.
Negative: This produces a choppy start-stop pattern
Variable-interval schedule: Reinforces a response at
unpredictable time intervals

Positive/Negative: Produces slow, steady responses
21
SCHEDULES OF REINFORCEMENT
22
WARMUP
1. Give an example of positive punishment.
2. Give an example of negative reinforcement.
3. What is the difference between fixed and variable?
4. What is the difference between ratio and interval?
5. Give an example of a fixed ratio reinforcement schedule.
6. Give an example of a variable interval reinforcement
schedule.
COGNITION & OPERANT CONDITIONING
 Can
we learn without rewards?
 How do we know?
 Latent
learning
 Cognitive maps
24
MOTIVATION
 Intrinsic
Motivation: The
desire to perform a behavior
for its own sake.
 Extrinsic
Motivation: The
desire to perform a behavior
due to promised rewards or
threats of punishments.
25
OPERANT CONDITIONING IN OTHER FIELDS
OPERANT VS. CLASSICAL CONDITIONING
27
LEARNING TO COPE

Problem-focused coping


Alleviating stress directly – dealing with the problem
Emotion-focused coping

Alleviating stress by attending to emotional needs
LOCUS OF CONTROL
External Locus of Control

Perception that
outside forces control
your fate
“My teacher
sucks. I’m gonna
fail this class”
Internal Locus of Control

Perception that you
control your fate
“I gotta work
harder. I didn’t
study for that.”
“I CAN’T HELP IT”
Is this something that can be
learned?
 Martin Seligman

Dogs strapped in harness
 Given repeated shocks
 Could not avoid them

What did they learn?
 They are helpless

30
LEARNED HELPLESSNESS
Later, dogs were placed in a situation where they could
escape the shocks
 The dogs cowered and did not escape shocks
 What does this tell us?

31
LEARNED HELPLESSNESS

The hopelessness and passive resignation learned
when unable to avoid repeated aversive events