Skip to main content

Finite-time analysis for the knowledge-gradient policy

Author(s): Wang, Y; Powell, William B

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr10c4j
Full metadata record
DC FieldValueLanguage
dc.contributor.authorWang, Y-
dc.contributor.authorPowell, William B-
dc.date.accessioned2021-10-11T14:17:49Z-
dc.date.available2021-10-11T14:17:49Z-
dc.date.issued2018-01-01en_US
dc.identifier.citationWang, Y, Powell, WB. (2018). Finite-time analysis for the knowledge-gradient policy. SIAM Journal on Control and Optimization, 56 (2), 1105 - 1129. doi:10.1137/16M1073388en_US
dc.identifier.issn0363-0129-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr10c4j-
dc.description.abstract© 2018 Society for Industrial and Applied Mathematics. We consider sequential decision problems in which we adaptively choose one of finitely many alternatives and observe a stochastic reward. We offer a new perspective on interpreting Bayesian ranking and selection problems as adaptive stochastic multiset maximization problems and derive the first finite-time bound of the knowledge-gradient policy for adaptive submodular objective functions. In addition, we introduce the concept of prior-optimality and provide another insight into the performance of the knowledge-gradient policy based on the submodular assumption on the value of information. We demonstrate submodularity for the two-alternative case and provide other conditions for more general problems, bringing out the issue and importance of submodularity in learning problems. Empirical experiments are conducted to further illustrate the finite-time behavior of the knowledge-gradient policy.en_US
dc.format.extent1105 - 1129en_US
dc.language.isoen_USen_US
dc.relation.ispartofSIAM Journal on Control and Optimizationen_US
dc.rightsAuthor's manuscripten_US
dc.titleFinite-time analysis for the knowledge-gradient policyen_US
dc.typeJournal Articleen_US
dc.identifier.doidoi:10.1137/16M1073388-
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/journal-articleen_US

Files in This Item:
File Description SizeFormat 
Finite-time analysis for the knowledge-gradient policy.pdf402.74 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.