Skip to main content

The successor representation in human reinforcement learning

Author(s): Momennejad, Ida; Russek, Evan; Cheong, Jin; Botvinick, Matthew; Daw, Nathaniel; et al

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1tn3f
Full metadata record
DC FieldValueLanguage
dc.contributor.authorMomennejad, Ida-
dc.contributor.authorRussek, Evan-
dc.contributor.authorCheong, Jin-
dc.contributor.authorBotvinick, Matthew-
dc.contributor.authorDaw, Nathaniel-
dc.contributor.authorGershman, Samuel-
dc.date.accessioned2020-02-19T21:59:10Z-
dc.date.available2020-02-19T21:59:10Z-
dc.date.issued2016-10-27en_US
dc.identifier.citationMomennejad, Ida, Russek, Evan, Cheong, Jin, Botvinick, Matthew, Daw, Nathaniel, Gershman, Samuel. (2016). The successor representation in human reinforcement learning. 10.1101/083824en_US
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1tn3f-
dc.description.abstractTheories of reward learning in neuroscience have focused on two families of algorithms, thought to capture deliberative vs. habitual choice. Model-based algorithms compute the value of candidate actions from scratch, whereas model-free algorithms make choice more efficient but less flexible by storing pre-computed action values. We examine an intermediate algorithmic family, the successor representation (SR), which balances flexibility and efficiency by storing partially computed action values: predictions about future events. These pre-computation strategies differ in how they update their choices following changes in a task. SR's reliance on stored predictions about future states predicts a unique signature of insensitivity to changes in the task's sequence of events, but flexible adjustment following changes to rewards. We provide evidence for such differential sensitivity in two behavioral studies with humans. These results suggest that the SR is a computational substrate for semi-flexible choice in humans, introducing a subtler, more cognitive notion of habit.en_US
dc.format.extent680-692en_US
dc.language.isoen_USen_US
dc.relation.ispartofNature Human Behaviouren_US
dc.rightsAuthor's manuscripten_US
dc.titleThe successor representation in human reinforcement learningen_US
dc.typeJournal Articleen_US
dc.identifier.doidoi:10.1101/083824-
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/journal-articleen_US

Files in This Item:
File Description SizeFormat 
083824.full.pdf5.25 MBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.