Offline replay supports planning in human reinforcement learning.

Momennejad, Ida; Otto, A Ross.; Daw, Nathaniel D.; Norman, Kenneth A.

Offline replay supports planning in human reinforcement learning.

Author(s): Momennejad, Ida; Otto, A Ross.; Daw, Nathaniel D.; Norman, Kenneth A.

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1gf29

Full metadata record

DC Field	Value	Language
dc.contributor.author	Momennejad, Ida	-
dc.contributor.author	Otto, A Ross.	-
dc.contributor.author	Daw, Nathaniel D.	-
dc.contributor.author	Norman, Kenneth A.	-
dc.date.accessioned	2019-10-28T15:54:28Z	-
dc.date.available	2019-10-28T15:54:28Z	-
dc.date.issued	2018-12-14	en_US
dc.identifier.citation	Momennejad, Ida, Otto, A Ross, Daw, Nathaniel D, Norman, Kenneth A. (2018). Offline replay supports planning in human reinforcement learning.. eLife, 7 (10.7554/eLife.32548)	en_US
dc.identifier.issn	2050-084X	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr1gf29	-
dc.description.abstract	Making decisions in sequentially structured tasks requires integrating distally acquired information. The extensive computational cost of such integration challenges planning methods that integrate online, at decision time. Furthermore, it remains unclear whether ‘offline’ integration during replay supports planning, and if so which memories should be replayed. Inspired by machine learning, we propose that (a) offline replay of trajectories facilitates integrating representations that guide decisions, and (b) unsigned prediction errors (uncertainty) trigger such integrative replay. We designed a 2-step revaluation task for fMRI, whereby participants needed to integrate changes in rewards with past knowledge to optimally replan decisions. As predicted, we found that (a) multi-voxel pattern evidence for off-task replay predicts subsequent replanning; (b) neural sensitivity to uncertainty predicts subsequent replay and replanning; (c) off-task hippocampus and anterior cingulate activity increase when revaluation is required. These findings elucidate how the brain leverages offline mechanisms in planning and goal-directed behavior under uncertainty.	en_US
dc.language	eng	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	eLife	en_US
dc.rights	Final published version. Article is made available in OAR by the publisher's permission or policy.	en_US
dc.title	Offline replay supports planning in human reinforcement learning.	en_US
dc.type	Journal Article	en_US
dc.identifier.doi	doi:10.7554/eLife.32548	-
dc.identifier.eissn	2050-084X	-
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/journal-article	en_US

Files in This Item:

File	Description	Size	Format
elife-32548.pdf		2.43 MB	Adobe PDF	View/Download

Show Simple Item Record