Value representations in the rodent orbitofrontal cortex drive learning, not choice

  1. Kevin J Miller  Is a corresponding author
  2. Matthew M Botvinick  Is a corresponding author
  3. Carlos D Brody  Is a corresponding author
  1. DeepMind, United Kingdom
  2. Princeton University, United States

Abstract

Humans and animals make predictions about the rewards they expect to receive in different situations. In formal models of behavior, these predictions are known as value representations, and they play two very different roles. Firstly, they drive choice: the expected values of available options are compared to one another, and the best option is selected. Secondly, they support learning: expected values are compared to rewards actually received, and future expectations are updated accordingly. Whether these different functions are mediated by different neural representations remains an open question. Here we employ a recently-developed multi-step task for rats that computationally separates learning from choosing. We investigate the role of value representations in the rodent orbitofrontal cortex, a key structure for value-based cognition. Electrophysiological recordings and optogenetic perturbations indicate that these representations do not directly drive choice. Instead, they signal expected reward information to a learning process elsewhere in the brain that updates choice mechanisms.

Data availability

Data collected for the purpose of this paper will be posted on Figshare upon acceptance. Software used to analyze the data will be made available as a Github release. Software used for training rats and design files for constructing behavioral rigs are available on the Brody lab website.

The following data sets were generated

Article and author information

Author details

  1. Kevin J Miller

    DeepMind, London, United Kingdom
    For correspondence
    kevinjmiller@deepmind.com
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-3465-2512
  2. Matthew M Botvinick

    DeepMind, London, United Kingdom
    For correspondence
    botvinick@deepmind.com
    Competing interests
    The authors declare that no competing interests exist.
  3. Carlos D Brody

    Princeton Neuroscience Institute, Princeton University, Princeton, United States
    For correspondence
    brody@princeton.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-4201-561X

Funding

National Institutes of Health (T-32 MH065214)

  • Kevin J Miller
  • Matthew M Botvinick
  • Carlos D Brody

Princeton University (Harold W Dodds Fellowship)

  • Kevin J Miller

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Ethics

Animal experimentation: All experimental procedures were performed in strict accordance with the recommendations in the Guide for the Care and Use of Laboratory Animals of the National Institutes of Health., and were approved by the Princeton University Institutional Animal Care and Use Committee (protocol #1853)

Copyright

© 2022, Miller et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 4,774
    views
  • 922
    downloads
  • 34
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Kevin J Miller
  2. Matthew M Botvinick
  3. Carlos D Brody
(2022)
Value representations in the rodent orbitofrontal cortex drive learning, not choice
eLife 11:e64575.
https://doi.org/10.7554/eLife.64575

Share this article

https://doi.org/10.7554/eLife.64575

Further reading

    1. Neuroscience
    Jakob Rupert, Dragomir Milovanovic
    Insight

    By influencing calcium homeostasis, local protein synthesis and the endoplasmic reticulum, a small protein called Rab10 emerges as a crucial cytoplasmic regulator of neuropeptide secretion.

    1. Neuroscience
    Yi-Yun Ho, Qiuwei Yang ... Melissa R Warden
    Research Article

    The infralimbic cortex (IL) is essential for flexible behavioral responses to threatening environmental events. Reactive behaviors such as freezing or flight are adaptive in some contexts, but in others a strategic avoidance behavior may be more advantageous. IL has been implicated in avoidance, but the contribution of distinct IL neural subtypes with differing molecular identities and wiring patterns is poorly understood. Here, we study IL parvalbumin (PV) interneurons in mice as they engage in active avoidance behavior, a behavior in which mice must suppress freezing in order to move to safety. We find that activity in inhibitory PV neurons increases during movement to avoid the shock in this behavioral paradigm, and that PV activity during movement emerges after mice have experienced a single shock, prior to learning avoidance. PV neural activity does not change during movement toward cued rewards or during general locomotion in the open field, behavioral paradigms where freezing does not need to be suppressed to enable movement. Optogenetic suppression of PV neurons increases the duration of freezing and delays the onset of avoidance behavior, but does not affect movement toward rewards or general locomotion. These data provide evidence that IL PV neurons support strategic avoidance behavior by suppressing freezing.