Label optimal regret bounds for online local learning

Awasthi, Pranjal; Charikar, Moses S; Lai, Kevin A; Risteski, Andrej

Label optimal regret bounds for online local learning

Author(s): Awasthi, Pranjal; Charikar, Moses S; Lai, Kevin A; Risteski, Andrej

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr17k0k

Full metadata record

DC Field	Value	Language
dc.contributor.author	Awasthi, Pranjal	-
dc.contributor.author	Charikar, Moses S	-
dc.contributor.author	Lai, Kevin A	-
dc.contributor.author	Risteski, Andrej	-
dc.date.accessioned	2021-10-08T19:44:42Z	-
dc.date.available	2021-10-08T19:44:42Z	-
dc.date.issued	2015	en_US
dc.identifier.citation	Awasthi, Pranjal, Moses Charikar, Kevin A. Lai, and Andrej Risteski. "Label optimal regret bounds for online local learning." In Conference on Learning Theory (2015): pp. 150-166.	en_US
dc.identifier.issn	2640-3498	-
dc.identifier.uri	http://proceedings.mlr.press/v40/Awasthi15a.html	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr17k0k	-
dc.description.abstract	We resolve an open question from Christiano (2014b) posed in COLT’14 regarding the optimal dependency of the regret achievable for online local learning on the size of the label set. In this framework, the algorithm is shown a pair of items at each step, chosen from a set of n items. The learner then predicts a label for each item, from a label set of size L and receives a real valued payoff. This is a natural framework which captures many interesting scenarios such as online gambling and online max cut. Christiano (2014a) designed an efficient online learning algorithm for this problem achieving a regret of O(\sqrtnL^3 T), where T is the number of rounds. Information theoretically, one can achieve a regret of O(\sqrtn \log L T). One of the main open questions left in this framework concerns closing the above gap. In this work, we provide a complete answer to the question above via two main results. We show, via a tighter analysis, that the semi-definite programming based algorithm of Christiano (2014a) in fact achieves a regret of O(\sqrtnLT). Second, we show a matching computational lower bound. Namely, we show that a polynomial time algorithm for online local learning with lower regret would imply a polynomial time algorithm for the planted clique problem which is widely believed to be hard. We prove a similar hardness result under a related conjecture concerning planted dense subgraphs that we put forth. Unlike planted clique, the planted dense subgraph problem does not have any known quasi-polynomial time algorithms. Computational lower bounds for online learning are relatively rare, and we hope that the ideas developed in this work will lead to lower bounds for other online learning scenarios as well.	en_US
dc.format.extent	150 - 166	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	Proceedings of The 28th Conference on Learning Theory	en_US
dc.relation.ispartofseries	Proceedings of Machine Learning Research;	-
dc.rights	Final published version. Article is made available in OAR by the publisher's permission or policy.	en_US
dc.title	Label optimal regret bounds for online local learning	en_US
dc.type	Conference Article	en_US
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
LabelOptimalRegretBoundsOnlineLocalLearning.pdf		322.76 kB	Adobe PDF	View/Download

Show Simple Item Record