A lasso-based sparse knowledge gradient policy for sequential optimal learning

Li, Yan; Liu, Han; Powell, Warren B.

A lasso-based sparse knowledge gradient policy for sequential optimal learning

Author(s): Li, Yan; Liu, Han; Powell, Warren B.

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1wv3d

Full metadata record

DC Field	Value	Language
dc.contributor.author	Li, Yan	-
dc.contributor.author	Liu, Han	-
dc.contributor.author	Powell, Warren B.	-
dc.date.accessioned	2020-03-30T19:08:22Z	-
dc.date.available	2020-03-30T19:08:22Z	-
dc.date.issued	2016-01-01	en_US
dc.identifier.citation	Li, Y, Liu, H, Powell, WB. (2016). A lasso-based sparse knowledge gradient policy for sequential optimal learning. Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, AISTATS 2016, 417 - 425. Retrieved from http://proceedings.mlr.press/v51/li16a.html	en_US
dc.identifier.uri	http://proceedings.mlr.press/v51/li16a.html	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr1wv3d	-
dc.description.abstract	Copyright 2016 by the authors. We propose a sequential learning policy for noisy discrete global optimization and ranking and selection (R&S) problems with high dimensional sparse belief functions, where there are hundreds or even thousands of features, but only a small portion of these features contain explanatory power. Our problem setting, motivated by the experimental sciences, arises where we have to choose which experiment to run next. Here the experiments are time-consuming and expensive. We derive a sparse knowledge gradient (SpKG) decision policy based on the ℓ1-penalized regression Lasso to identify the sparsity pattern before our budget is exhausted. This policy is a unique and novel hybrid of Bayesian R&S with a frequentist learning approach. Theoretically, we provide the error bound of the posterior mean estimate, which has shown to be at the minimax optimal √slog p/n rate. Controlled experiments on both synthetic data and real application for automatically designing experiments to identify the structure of an RNA molecule show that the algorithm efficiently learns the correct set of nonzero parameters. It also outperforms several other learning policies.	en_US
dc.format.extent	417 - 425	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, AISTATS 2016	en_US
dc.rights	Final published version. This is an open access article.	en_US
dc.title	A lasso-based sparse knowledge gradient policy for sequential optimal learning	en_US
dc.type	Journal Article	en_US
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
LassoSparseKnowledgeGradientPolicyLearning.pdf		1.9 MB	Adobe PDF	View/Download

Show Simple Item Record