The successor representation in human reinforcement learning

Momennejad, Ida; Russek, Evan; Cheong, Jin; Botvinick, Matthew; Daw, Nathaniel; Gershman, Samuel

The successor representation in human reinforcement learning

Author(s): Momennejad, Ida; Russek, Evan; Cheong, Jin; Botvinick, Matthew; Daw, Nathaniel; et al

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1tn3f

Full metadata record

DC Field	Value	Language
dc.contributor.author	Momennejad, Ida	-
dc.contributor.author	Russek, Evan	-
dc.contributor.author	Cheong, Jin	-
dc.contributor.author	Botvinick, Matthew	-
dc.contributor.author	Daw, Nathaniel	-
dc.contributor.author	Gershman, Samuel	-
dc.date.accessioned	2020-02-19T21:59:10Z	-
dc.date.available	2020-02-19T21:59:10Z	-
dc.date.issued	2016-10-27	en_US
dc.identifier.citation	Momennejad, Ida, Russek, Evan, Cheong, Jin, Botvinick, Matthew, Daw, Nathaniel, Gershman, Samuel. (2016). The successor representation in human reinforcement learning. 10.1101/083824	en_US
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr1tn3f	-
dc.description.abstract	Theories of reward learning in neuroscience have focused on two families of algorithms, thought to capture deliberative vs. habitual choice. Model-based algorithms compute the value of candidate actions from scratch, whereas model-free algorithms make choice more efficient but less flexible by storing pre-computed action values. We examine an intermediate algorithmic family, the successor representation (SR), which balances flexibility and efficiency by storing partially computed action values: predictions about future events. These pre-computation strategies differ in how they update their choices following changes in a task. SR's reliance on stored predictions about future states predicts a unique signature of insensitivity to changes in the task's sequence of events, but flexible adjustment following changes to rewards. We provide evidence for such differential sensitivity in two behavioral studies with humans. These results suggest that the SR is a computational substrate for semi-flexible choice in humans, introducing a subtler, more cognitive notion of habit.	en_US
dc.format.extent	680-692	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	Nature Human Behaviour	en_US
dc.rights	Author's manuscript	en_US
dc.title	The successor representation in human reinforcement learning	en_US
dc.type	Journal Article	en_US
dc.identifier.doi	doi:10.1101/083824	-
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/journal-article	en_US

Files in This Item:

File	Description	Size	Format
083824.full.pdf		5.25 MB	Adobe PDF	View/Download

Show Simple Item Record