Towards Minimax Online Learning with Unknown Time Horizon

Luo, Haipeng; Schapire, Robert E

Towards Minimax Online Learning with Unknown Time Horizon

Author(s): Luo, Haipeng; Schapire, Robert E

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1q837

Full metadata record

DC Field	Value	Language
dc.contributor.author	Luo, Haipeng	-
dc.contributor.author	Schapire, Robert E	-
dc.date.accessioned	2021-10-08T19:48:29Z	-
dc.date.available	2021-10-08T19:48:29Z	-
dc.date.issued	2014	en_US
dc.identifier.citation	Luo, Haipeng, Schapire, Robert E. (Towards Minimax Online Learning with Unknown Time Horizon	en_US
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr1q837	-
dc.description.abstract	We consider online learning when the time horizon is unknown. We apply a minimax analysis, beginning with the fixed horizon case, and then moving on to two unknown-horizon settings, one that assumes the horizon is chosen randomly according to some known distribution, and the other which allows the adversary full control over the horizon. For the random horizon setting with restricted losses, we derive a fully optimal minimax algorithm. And for the adversarial horizon setting, we prove a nontrivial lower bound which shows that the adversary obtains strictly more power than when the horizon is fixed and known. Based on the minimax solution of the random horizon setting, we then propose a new adaptive algorithm which "pretends" that the horizon is drawn from a distribution from a special family, but no matter how the actual horizon is chosen, the worst-case regret is of the optimal rate. Furthermore, our algorithm can be combined and applied in many ways, for instance, to online convex optimization, follow the perturbed leader, exponential weights algorithm and first order bounds. Experiments show that our algorithm outperforms many other existing algorithms in an online linear optimization setting.	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	31st International Conference on Machine Learning	en_US
dc.rights	Author's manuscript	en_US
dc.title	Towards Minimax Online Learning with Unknown Time Horizon	en_US
dc.type	Conference Article	en_US
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
TowardsMinimaxOnlineLearningUnknownTimeHorizon.pdf		487.93 kB	Adobe PDF	View/Download
1307.8187v2.pdf		326.67 kB	Adobe PDF	View/Download

Show Simple Item Record