Boosting for Online Convex Optimization

Hazan, Elad; Singh, Karan

Boosting for Online Convex Optimization

Author(s): Hazan, Elad; Singh, Karan

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr11v5bd4z

Full metadata record

DC Field	Value	Language
dc.contributor.author	Hazan, Elad	-
dc.contributor.author	Singh, Karan	-
dc.date.accessioned	2023-12-28T16:06:04Z	-
dc.date.available	2023-12-28T16:06:04Z	-
dc.date.issued	2021	en_US
dc.identifier.citation	Hazan, Elad and Singh, Karan. "Boosting for Online Convex Optimization." Proceedings of the 38th International Conference on Machine Learning 139 (2021): 4140-4149.	en_US
dc.identifier.issn	2640-3498	-
dc.identifier.uri	https://proceedings.mlr.press/	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr11v5bd4z	-
dc.description.abstract	We consider the decision-making framework of online convex optimization with a very large number of experts. This setting is ubiquitous in contextual and reinforcement learning problems, where the size of the policy class renders enumeration and search within the policy class infeasible. Instead, we consider generalizing the methodology of online boosting. We define a weak learning algorithm as a mechanism that guarantees multiplicatively approximate regret against a base class of experts. In this access model, we give an efficient boosting algorithm that guarantees near-optimal regret against the convex hull of the base class. We consider both full and partial (a.k.a. bandit) information feedback models. We also give an analogous efficient boosting algorithm for the i.i.d. statistical setting. Our results simultaneously generalize online boosting and gradient boosting guarantees to contextual learning model, online convex optimization and bandit linear optimization settings.	en_US
dc.format.extent	4140 - 4149	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	Proceedings of the 38th International Conference on Machine Learning	en_US
dc.rights	Final published version. This is an open access article.	en_US
dc.title	Boosting for Online Convex Optimization	en_US
dc.type	Conference Article	en_US
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
BoostingOnlineConvexOptimization.pdf		302.73 kB	Adobe PDF	View/Download

Show Simple Item Record