Optimal Behavioral Hierarchy

Solway, Alec; Diuk, Carlos; Córdova, Natalia; Yee, Debbie; Barto, Andrew G.; Niv, Yael; Botvinick, Matthew M.

Optimal Behavioral Hierarchy

Author(s): Solway, Alec; Diuk, Carlos; Córdova, Natalia; Yee, Debbie; Barto, Andrew G.; et al

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr14t8h

Full metadata record

DC Field	Value	Language
dc.contributor.author	Solway, Alec	-
dc.contributor.author	Diuk, Carlos	-
dc.contributor.author	Córdova, Natalia	-
dc.contributor.author	Yee, Debbie	-
dc.contributor.author	Barto, Andrew G.	-
dc.contributor.author	Niv, Yael	-
dc.contributor.author	Botvinick, Matthew M.	-
dc.date.accessioned	2019-10-28T15:54:51Z	-
dc.date.available	2019-10-28T15:54:51Z	-
dc.date.issued	2014-08-14	en_US
dc.identifier.citation	Solway, Alec, Diuk, Carlos, Córdova, Natalia, Yee, Debbie, Barto, Andrew G, Niv, Yael, Botvinick, Matthew M. (2014). Optimal Behavioral Hierarchy. PLoS Computational Biology, 10 (8), e1003779 - e1003779. doi:10.1371/journal.pcbi.1003779	en_US
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr14t8h	-
dc.description.abstract	In reinforcement learning, a decision maker searching for the most rewarding option is often faced with the question: what is the value of an option that has never been tried before? One way to frame this question is as an inductive problem: how can I generalize my previous experience with one set of options to a novel option? We show how hierarchical Bayesian inference can be used to solve this problem, and describe an equivalence between the Bayesian model and temporal difference learning algorithms that have been proposed as models of reinforcement learning in humans and animals. According to our view, the search for the best option is guided by abstract knowledge about the relationships between different options in an environment, resulting in greater search efficiency compared to traditional reinforcement learning algorithms previously applied to human cognition. In two behavioral experiments, we test several predictions of our model, providing evidence that humans learn and exploit structured inductive knowledge to make predictions about novel options. In light of this model, we suggest a new interpretation of dopaminergic responses to novelty.	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	PLoS Computational Biology	en_US
dc.rights	Final published version. This is an open access article.	en_US
dc.title	Optimal Behavioral Hierarchy	en_US
dc.type	Journal Article	en_US
dc.identifier.doi	doi:10.1371/journal.pcbi.1003779	-
dc.date.eissued	2014-08-14	en_US
dc.identifier.eissn	1553-7358	-
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/journal-article	en_US

Files in This Item:

File	Description	Size	Format
journal.pcbi.1003779.PDF		553.73 kB	Adobe PDF	View/Download

Show Simple Item Record