Skip to main content

Boosting for Online Convex Optimization

Author(s): Hazan, Elad; Singh, Karan

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr11v5bd4z
Full metadata record
DC FieldValueLanguage
dc.contributor.authorHazan, Elad-
dc.contributor.authorSingh, Karan-
dc.date.accessioned2023-12-28T16:06:04Z-
dc.date.available2023-12-28T16:06:04Z-
dc.date.issued2021en_US
dc.identifier.citationHazan, Elad and Singh, Karan. "Boosting for Online Convex Optimization." Proceedings of the 38th International Conference on Machine Learning 139 (2021): 4140-4149.en_US
dc.identifier.issn2640-3498-
dc.identifier.urihttps://proceedings.mlr.press/-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr11v5bd4z-
dc.description.abstractWe consider the decision-making framework of online convex optimization with a very large number of experts. This setting is ubiquitous in contextual and reinforcement learning problems, where the size of the policy class renders enumeration and search within the policy class infeasible. Instead, we consider generalizing the methodology of online boosting. We define a weak learning algorithm as a mechanism that guarantees multiplicatively approximate regret against a base class of experts. In this access model, we give an efficient boosting algorithm that guarantees near-optimal regret against the convex hull of the base class. We consider both full and partial (a.k.a. bandit) information feedback models. We also give an analogous efficient boosting algorithm for the i.i.d. statistical setting. Our results simultaneously generalize online boosting and gradient boosting guarantees to contextual learning model, online convex optimization and bandit linear optimization settings.en_US
dc.format.extent4140 - 4149en_US
dc.language.isoen_USen_US
dc.relation.ispartofProceedings of the 38th International Conference on Machine Learningen_US
dc.rightsFinal published version. This is an open access article.en_US
dc.titleBoosting for Online Convex Optimizationen_US
dc.typeConference Articleen_US
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceedingen_US

Files in This Item:
File Description SizeFormat 
BoostingOnlineConvexOptimization.pdf302.73 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.