Transcript Chapter 6
Chapter 6:
Operant Conditioning:
Introduction
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Thorndike
• The first experimental studies of operant conditioning
were undertaken by Edwin L. Thorndike in the 1890s.
• He believed that the intellectual ability of animals could
be properly assessed only through systematic
investigation.
• His most famous experiments involved hungry cats that
were enclosed in a puzzle box, while a dish of food was
placed outside.
• To reach the food, the cat had to learn how to escape
from the box.
• The response that worked was strengthened, while the
responses that did not work were weakened.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Law of Effect
• Behaviors leading to a satisfying state of
affairs are strengthened or “stamped in.”
• Behaviors leading to an unsatisfying or
annoying state of affairs are weakened or
“stamped out.”
• The consequences of a behavior
determine whether that behavior will be
repeated.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Skinner
• Skinner believed there were two types of behaviors,
reflexive-type behaviors and operant behaviors.
• Skinner devised his own procedure for the study of
behavior, the “Skinner Box.”
• The rat is able to earn food pellets by pressing a response
lever or bar.
• The experimenter controls the contingencies within the
operant chamber, but the animal is free to respond at any
time.
• Rate of behavior is controlled by the conditions in the box.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Operant Conditioning
• a type of learning in which the future probability
of a behavior is affected by its consequences.
• Behaviors that are influenced by their
consequences are called operant behaviors.
• The response operates on the environment to
produce a consequence.
• Behaviors that lead to favorable outcomes are
more likely to be repeated than those that do not
lead to favorable outcomes.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Operant Conditioning Process
1. a discriminative stimulus that occurs
before the behavior and signals that a
certain consequence is now available
2. a response that produces a certain
consequence
3. the consequence that serves to either
increase or decrease the probability of
the preceding behavior
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Example of the Operant Process
1. a tone signals that a lever press will now
produce food
2. lever pressing produces a food pellet
3. the consequence of a food pellet
increases the rat’s tendency to again
press the lever
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Operant Behavior
• a class of emitted responses that result in
certain consequences.
• These consequences affect the future probability
or strength of those responses.
• They are not elicited by stimuli like in classical
conditioning.
• Rather operant behaviors are emitted by the
organism.
• They are more flexible and voluntary behaviors.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Operant Behavior Examples
Lever press Food pellet
The effect: The future probability of lever pressing
increases.
Tell a joke Person frowns
The effect: The future probability of telling a joke
decreases.
Operant behaviors are classes of responses.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Do operant behaviors occur
because we choose to emit them?
• Not necessarily.
• The behavior comes to be controlled by
the contingencies of reinforcement and
punishment that follow the behavior.
• There is sometimes only an illusion of
voluntariness.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Reinforcers
• Consequences of behavior that:
1. follow a behavior, and
2. as a result, the future probability of that
behavior increases.
• Example:
Lever press Food pellet
R
SR
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Punishers
• Consequences of behavior that:
1.follow a behavior, and
2.as a result, the future probability of that
behavior decreases.
• Example:
Tell a joke Person frowns
R
SP
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Operant Conditioning Perspective
• It is the behavior that has been reinforced or
punished.
• It is NOT the person or animal that has been
reinforced or punished.
• Only the behavior increases or decreases in
frequency.
• Consequences are defined entirely by their
effect on behavior.
• Reinforcers increase the frequency of behavior,
while punishers decrease the frequency of
behavior.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Reinforcement & Punishment
• the process or procedure by which a certain
consequence changes the strength of a
behavior.
• Example:
– reinforcement = the use of food to increase the
strength of lever pressing
– reinforcer = the food itself
– punishment = the process of frowning to encourage
Jonathan to stop telling jokes
– punisher = the frown itself
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Means of Weakening Behaviors
• Punishment
• Withdrawal of reinforcement – extinction
• Example:
– Suppose a child has learned to whine for
candy in the supermarket.
– When behaving that way no longer results in
candy, the child will cease whining.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Discriminative Stimuli
• a stimulus in the presence of which responses
are either reinforced or punished.
• Example:
Susan: Jonathan tells her a joke She laughs
SD
R
SR
Police car: You speed You receive a ticket
SD
R
SP
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Discriminative Stimulus
for Extinction
• The stimulus signals the nonavailability of a
previously available reinforcer.
• Example:
– Lever pressing is followed by food only when a tone is
sounding and NOT when a buzzer is sounding.
Tone: Lever press Food pellet
SD
R
SR
Buzzer: Lever press No food
SD
R
--Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Three-Term Contingency (ABC)
• an Antecedent event
• a Behavior
• a Consequence
Antecedent
Susan:
SD
Tone:
SD
Behavior
Tell her a joke
R
Lever press
R
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Consequence
She laughs
SR
Food pellet
SR
Easy Ways to Remember
• ABC
• Notice something, do something, get
something
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Operant & Classical Conditioning
• A stimulus can act as both a SD and a CS.
• Example:
Tone: Lever press Food
SD R
SR
Tone: Food Salivation
NS
US
UR
Tone Salivation
CS
CR
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Determining Type of Contingency
1. Does the consequence consist of
something being presented or
withdrawn?
2. Does the consequence serve to
strengthen or weaken the behavior?
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Negative vs. Positive
• Negative means that the behavior has
resulted in something being removed or
subtracted.
• Positive means that the behavior has
resulted in something being presented or
added.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Reinfocer vs. Punisher
• Reinforcer means the behavior has
strengthened as a result of the
consequence.
• Punisher means the behavior has
weakened as a result of the consequence.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Positive Reinforcement
• the presentation of a stimulus following a
response, which then leads to an increase in the
future strength of that response.
• Examples:
Smile at person The person smiles at you
R
SR
Study diligently for quiz Obtain an excellent mark
R
SR
Compliment partner Receive a kiss
R
SR
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Negative Reinforcement
• the removal of a stimulus following a response,
which then leads to an increase in the future
strength of that response.
• Examples:
Claim illness Avoid writing an exam
R
SR
Take aspirin Eliminate headache
R
SR
Turn on the heater Escape the cold
R
SR
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Heater Example
• Does the person turn on the heater
– to escape the cold (negative reinforcement) or
– to obtain warmth (positive reinforcement)?
• Either interpretation would be correct.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Escape Behavior
• results in the termination (stopping) of an
aversive stimulus.
• Examples:
– Opening an umbrella when it rains
– Turning on the heater when it is cold
– Taking an aspirin when you have a headache
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Avoidance Behavior
• occurs before the aversive stimulus is
presented and therefore prevents its
delivery.
• Examples:
– pretending to be ill to avoids having to write
an exam
– staying inside to avoid the rain
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Positive Punishment
• the presentation of a stimulus following a
response, which then leads to a decrease in the
future strength of that response.
• Examples:
Talk back to the boss Get reprimanded
R
SP
Swat at the wasp Get stung
R
SP
Meow constantly Get sprayed with water
R
SP
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Negative Punishment
• the removal of a stimulus following a response,
which then leads to a decrease in the future
strength of that response.
• Examples:
Stay out past curfew Lose car privileges
R
SP
Argue with boss Lose job
R
SP
Play with food Lose dessert
R
SP
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Punishment in Personal
Relationships
• A reduction in one person’s behavior as a
result of punishment can negatively
reinforce the behavior of the person who
implemented the punishment.
• Example:
GF ignores BF BF stops talking to women
R
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
SR
Punishment in Personal
Relationships, continued
• Punishment is often successful in
immediately getting a person to stop.
• That success then reinforces our tendency
to use punishment in the future.
• Punishing our partners more can create
major problems for our relationships in the
long run.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
The Key to a Great Relationship
• the ratio of positive (pleasant) interactions
to negative (aversive) interactions.
• The positives need to outweigh the
negatives by a ratio of about five to one
(Gottman, 1994).
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Immediate vs. Delayed
Reinforcement
• The more immediate the reinforcer, the
stronger its effect on the behavior.
• Example:
– Give a treat to a child who’s playing quietly
while she is still playing quietly to reinforce
quiet playing.
– The benefits of exercise and proper eating are
delayed and therefore weak.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Primary Reinforcers
• A primary reinforcer is an event that is
innately reinforcing.
• Examples:
– food, water, proper temperature, sexual
contact.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Secondary Reinforcers
• A secondary reinforcer is an event that is reinforcing
because it has been associated with some other
reinforcer.
• Examples:
– good marks, fine clothes, a nice car
• Conditioned stimuli that have been associated with
appetitive unconditioned stimuli (USs) can also function
as secondary reinforcers.
• Example:
– A metronome that has been associated with food can be a
secondary reinforcer for the operant response of lever pressing.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Generalized Reinforcer
• a type of secondary reinforcer that has been
associated with several other reinforcers.
• Example:
– Money is associated with food, clothing, furnishings,
entertainment, and even dates.
– Social attention is associated with food, play, and
comfort.
– A “token economy” is when tokens are given as
reinforcers. The tokens can be traded for other
rewards later.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Intrinsic Reinforcement
• reinforcement provided by the mere act of
performing the behavior.
• Examples:
– Rollerblading is invigorating.
– Attending parties is fun.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Extrinsic Reinforcement
• the reinforcement provided by some
consequence that is external to the
behavior
• Examples:
– You drive to get somewhere.
– You work for money.
– You date an attractive individual merely to
enhance your prestige.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
The distinction is not always clear.
• Is candy an intrinsic or extrinsic reinforcer?
• What happens if you are given an extrinsic
reinforcer for an activity that is already
intrinsically reinforcing?
• The research findings are mixed.
• A meta-analysis found that external rewards can
undermine intrinsic motivation when:
– the reward is expected,
– the reward is tangible, and
– the reward is given for simply performing the activity.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Natural Reinforcers
• reinforcers that are naturally provided for a
certain behavior.
• They are a typical consequence of the
behavior within that setting.
• Examples:
– Money is a natural consequence of selling
merchandise.
– Gold medals are a natural consequence of
hard training and a great performance.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Contrived Reinforcers
• reinforcers that have been deliberately
arranged to modify a behavior.
• They are not a typical consequence of the
behavior in that setting.
• Examples:
– Turning on the television is a contrived
reinforcer for studying.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Intrinsic/Extrinsic –
Natural/Contrived
• Intrinsic reinforcers are always natural
reinforcers.
• Extrinsic reinforcers can be either natural
or contrived.
• Some extrinsic reinforcers are part of the
typical contingencies in our environment,
while others have been artificially imposed
to modify a particular behavior.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
In Behavioral Therapy
• When contrived reinforcers are used, the
ultimate intention is to let the “natural
contingencies” eventually take over if at all
possible.
• Example:
– We initially use praise to increase the frequency with
which a child reads.
– Hopefully, the natural (and intrinsic) reinforcers
associated with reading will take over.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Which is more important?
• Natural contingencies tend to produce
more efficient behavior patterns than do
contrived contingencies.
• Example:
– Throwing a ball where you want it to go is
more important than throwing a ball and
receiving praise.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Shaping
• the gradual creation of new operant
behavior through reinforcement of
successive approximations to that
behavior.
• Example:
– Shaping a rat to press the lever
– Teaching a dog to catch a frisbee
– Teaching animals to perform tricks
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Our Shaping
• Most of our behaviors have, to some
extent, been learned or modified through
shaping.
• Example:
– Learning to eat with utensils
– Learning to dress ourselves
– Learning to write
– Learning to speak politely
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Summary
• Operant behaviors are controlled by their
consequences.
• Reinforcers strengthen a response.
• Punishers weaken a response.
• Positive means the presentation of
something.
• Negative means the removal of
something.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Summary, continued
• A discriminative stimulus signals when a
consequence is available.
• Reinforcers can be described as:
– immediate or delayed
– primary or secondary
– generalized
– intrinsic or extrinsic
– natural or contrived
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.
Summary, continued
• Shaping is the creation of novel behavior
through the reinforcement of gradual
approximations to that behavior.
• Effective shaping is often carried out with
the use of a secondary reinforcer.
Introduction to Learning and Behavior, 3e
by Russell A. Powell, Diane G. Symbaluk, and P. Lynne Honey
Copyright © 2009 Wadsworth Publishing, a division of Cengage Learning. All rights reserved.