Tori Collins - USD Biology

Download Report

Transcript Tori Collins - USD Biology

Wallis, JD
Helen Wills Neuroscience Institute
UC, Berkeley

Role of prefrontal cortex (PFC) in rewardguided choice behavior

2 Questions:
 Does PFC encode reward or behavioral sequelae of
reward?
 Is encoding specific to reward outcome or
reflective of abstract value signal?

Orbitofrontal cortex (OFC) is a key region in choice
behavior
 Has functions in
emotions and reward
• Thought to regulate planning
behavior associated with
sensitivity to reward and
punishment
Paul Wicks’ brain

Damage to the OFC leaves cognitive abilities
intact, but impairs ability to make decisions
 The cautionary tale of Elliot

OFC neurons encode expected rewards
 Neurons show response to delivery of juice
rewards predicted by a visual stimulus
 Useful for decision making

Reward encoding neurons are also found in
the dorsolateral PFC (DLPFC)
 Neurons showed a difference in firing rate
depending on large/small expected reward

However, are these neurons encoding the
value of the reward or a behavioral correlate
of the reward (tensed muscles, heightened
accuracy)?


Comparison of reward encoding in the DLPFC
and the OFC
2 monkeys choose between pictures
associated with small/large fruit juice rewards




Each picture associated with delivery of a
certain amount of juice
Subjects learn to maximize reward
Reward-picture contingencies reversed after
27 out of 30 successes
Most important neuronal activity after 2nd
picture appears

Activity was recorded simultaneously from
multiple electrodes in DLPFC and OFC
 167 DLPFC neurons
 134 OFC neurons





Spike density histograms from 2
representative OFC neurons
Activity recorded during predictive cue
One neuron showed higher firing rate when
the monkey expected 4 drops of juice
Another encoded the reward in parametric
fashion
Firing rate not affected by saccade
Figure 2



DLPFC neurons show responses related to
both reward and saccade
One neuron discriminated between different
amounts of juice only during right saccade
Another encoded reward in parametric
fashion (positive?) with a greater increase
during left saccade
Figure 3


2-way ANOVA on mean firing rate with factors
of Reward and Saccade
OFC:
 28% significant main effect of Reward
 19% significant interaction with Saccade

DLPFC
 13% significant main effect of Reward
 43% Reward-Saccade interaction

Some neurons in both areas had similar
properties:
 27% parametric increase with reward size
 15% parametric decrease with reward size
 59% encode specific reward

Sliding receiver operating characteristic (ROC)
analysis of the selectivity time-course
revealed differences in encoding of reward
between OFC and DLPFC
 Probability that an independent observer could
predict reward based on firing rate

Starting from 500ms prior to 2nd picture an
ROC curve was generated from 10ms
increments

Latency at which selectivity appeared was
computed as the point at which the curve
exceeded 0.6
 No difference between OFC (mean 426ms) and
DLPFC (mean 467ms)
▪ (t-test = 1, d.f. = 110, P > 0.1)

Selectivity rose more rapidly and peaked
earlier in OFC
 80 ms earlier on average

Short latency indicates OFC is encoding
reward’s value rather than correlated
behavioral/cognitive processes

Neurons sensitive to expected reward are
found in both the OFC and DLPFC

OFC neurons encoded only reward while
DLPFC neurons encoded reward and saccade

OFC neurons encoded reward earlier than
DLPFC

Therefore OFC is first prefrontal region
receiving reward information
 From basolateral amygdala
▪ Encodes immediate reward –Winstanley, CA et al
 From gustatory and olfactory cortices
Baxter, MG and Murray, EA The amygdala and reward

DLPFC is where reward value converges with
subjects actions: reward choice



OFC response indicates to the motor system
which action leads to largest reward
However, decision making needs to be more
complex
Reward value is determined by
 (Reward – Cost) x P of success
 To what extent does OFC encode these variables?

OFC may integrate all variables relevant to
decision making to derive an abstract value
signal
 Neuronal currency

Another study tested whether PFC neurons
were capable to responding to multiple
parameters

Monkeys were trained to choose between
pictures associated with particular rewards

Recordings from OFC, MPFC, and lateral PFC
taken simultaneously

3 variables were manipulated:
 Probability: Some pictures predict fixed amount of
juice on only a certain proportion of trials
 Reward: Some pictures were associated with
varying amounts of juice
 Effort: Monkey had to earn fixed amount of juice
by pressing a lever multiple times

1/3 of PFC neurons responded parametrically
to just 1 parameter
 Found in all 3 areas

Some neurons responded to a combination of
parameters
 Progressive increase from LPFC to OFC to MPFC
 16% -> 27% -> 48%

OFC and MPFC combine multiple variables in
order to make a decision by deriving abstract
value signals
 Too difficult to make direct comparisons between
all possible choices
 Each choice can be valued against a common
reference scale (currency)
 Example: how many bananas is your car worth?

Abstract value signals allow flexibility and
novelty
 Simplifies the task of the motor system
 Allows instantaneous choice

Patients with OFC and MPFC damage show
unusual patterns of decision making
 A>B,
B > C but A < C


Damage to the OFC impairs decision making
while leaving other cognitive abilities intact
OFC is implicated in reward info processing
 Must differentiate between reward and behavioral
sequelae of reward

Short latency of neuronal reward-related
responses indicates encoding of reward’s
value in OFC



In contrast, DLPFC encodes reward info in
relation to behavioral responses
PFC neurons also encode other variables
related to decision making, including
probability of success and effort required
OFC and MPFC neurons are responsible for
integrating these variables to derive an
abstract value signal