The Successor Representation and Temporal Context
Author(s): Gershman, Samuel J.; Moore, Christopher D.; Todd, Michael T.; Norman, Kenneth A.; Sederberg, Per B.
DownloadTo refer to this page use:
http://arks.princeton.edu/ark:/88435/pr1qf2q
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Gershman, Samuel J. | - |
dc.contributor.author | Moore, Christopher D. | - |
dc.contributor.author | Todd, Michael T. | - |
dc.contributor.author | Norman, Kenneth A. | - |
dc.contributor.author | Sederberg, Per B. | - |
dc.date.accessioned | 2019-10-28T15:53:46Z | - |
dc.date.available | 2019-10-28T15:53:46Z | - |
dc.date.issued | 2012-06 | en_US |
dc.identifier.citation | Gershman, Samuel J, Moore, Christopher D, Todd, Michael T, Norman, Kenneth A, Sederberg, Per B. (2012). The Successor Representation and Temporal Context. Neural Computation, 24 (6), 1553 - 1568. doi:10.1162/NECO_a_00282 | en_US |
dc.identifier.issn | 0899-7667 | - |
dc.identifier.uri | http://arks.princeton.edu/ark:/88435/pr1qf2q | - |
dc.description.abstract | The successor representation was introduced into reinforcement learning by Dayan (1993) as a means of facilitating generalization between states with similar successors. Although reinforcement learning in general has been used extensively as a model of psychological and neural processes, the psychological validity of the successor representation has yet to be explored. An interesting possibility is that the successor representation can be used not only for reinforcement learning but for episodic learning as well. Our main contribution is to show that a variant of the temporal context model (TCM; Howard & Kahana, 2002), an influential model of episodic memory, can be understood as directly estimating the successor representation using the temporal difference learning algorithm (Sutton & Barto, 1998). This insight leads to a generalization of TCM and new experimental predictions. In addition to casting a new normative light on TCM, this equivalence suggests a previously unexplored point of contact between different learning systems. | en_US |
dc.format.extent | 1553 - 1568 | en_US |
dc.language.iso | en_US | en_US |
dc.relation.ispartof | Neural Computation | en_US |
dc.rights | Final published version. Article is made available in OAR by the publisher's permission or policy. | en_US |
dc.title | The Successor Representation and Temporal Context | en_US |
dc.type | Journal Article | en_US |
dc.identifier.doi | doi:10.1162/NECO_a_00282 | - |
dc.identifier.eissn | 1530-888X | - |
pu.type.symplectic | http://www.symplectic.co.uk/publications/atom-terms/1.0/journal-article | en_US |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
neco_a_00282.pdf | 235.61 kB | Adobe PDF | View/Download |
Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.