Finite-time analysis for the knowledge-gradient policy
Author(s): Wang, Y; Powell, William B
DownloadTo refer to this page use:
http://arks.princeton.edu/ark:/88435/pr10c4j
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Wang, Y | - |
dc.contributor.author | Powell, William B | - |
dc.date.accessioned | 2021-10-11T14:17:49Z | - |
dc.date.available | 2021-10-11T14:17:49Z | - |
dc.date.issued | 2018-01-01 | en_US |
dc.identifier.citation | Wang, Y, Powell, WB. (2018). Finite-time analysis for the knowledge-gradient policy. SIAM Journal on Control and Optimization, 56 (2), 1105 - 1129. doi:10.1137/16M1073388 | en_US |
dc.identifier.issn | 0363-0129 | - |
dc.identifier.uri | http://arks.princeton.edu/ark:/88435/pr10c4j | - |
dc.description.abstract | © 2018 Society for Industrial and Applied Mathematics. We consider sequential decision problems in which we adaptively choose one of finitely many alternatives and observe a stochastic reward. We offer a new perspective on interpreting Bayesian ranking and selection problems as adaptive stochastic multiset maximization problems and derive the first finite-time bound of the knowledge-gradient policy for adaptive submodular objective functions. In addition, we introduce the concept of prior-optimality and provide another insight into the performance of the knowledge-gradient policy based on the submodular assumption on the value of information. We demonstrate submodularity for the two-alternative case and provide other conditions for more general problems, bringing out the issue and importance of submodularity in learning problems. Empirical experiments are conducted to further illustrate the finite-time behavior of the knowledge-gradient policy. | en_US |
dc.format.extent | 1105 - 1129 | en_US |
dc.language.iso | en_US | en_US |
dc.relation.ispartof | SIAM Journal on Control and Optimization | en_US |
dc.rights | Author's manuscript | en_US |
dc.title | Finite-time analysis for the knowledge-gradient policy | en_US |
dc.type | Journal Article | en_US |
dc.identifier.doi | doi:10.1137/16M1073388 | - |
pu.type.symplectic | http://www.symplectic.co.uk/publications/atom-terms/1.0/journal-article | en_US |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Finite-time analysis for the knowledge-gradient policy.pdf | 402.74 kB | Adobe PDF | View/Download |
Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.