Logarithmic Regret for Online Control

Agarwal, Naman; Hazan, Elad; Singh, Karan

Logarithmic Regret for Online Control

Author(s): Agarwal, Naman; Hazan, Elad; Singh, Karan

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1g55h

Full metadata record

DC Field	Value	Language
dc.contributor.author	Agarwal, Naman	-
dc.contributor.author	Hazan, Elad	-
dc.contributor.author	Singh, Karan	-
dc.date.accessioned	2021-10-08T19:49:28Z	-
dc.date.available	2021-10-08T19:49:28Z	-
dc.date.issued	2019	en_US
dc.identifier.citation	Agarwal, Naman, Elad Hazan, and Karan Singh. "Logarithmic Regret for Online Control." Advances in Neural Information Processing Systems 32 (2019).	en_US
dc.identifier.issn	1049-5258	-
dc.identifier.uri	https://papers.nips.cc/paper/2019/file/78719f11fa2df9917de3110133506521-Paper.pdf	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr1g55h	-
dc.description.abstract	We study optimal regret bounds for control in linear dynamical systems under adversarially changing strongly convex cost functions, given the knowledge of transition dynamics. This includes several well studied and influential frameworks such as the Kalman filter and the linear quadratic regulator. State of the art methods achieve regret which scales as T^0.5, where T is the time horizon. We show that the optimal regret in this fundamental setting can be significantly smaller, scaling as polylog(T). This regret bound is achieved by two different efficient iterative methods, online gradient descent and online natural gradient.	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	Advances in Neural Information Processing Systems	en_US
dc.rights	Final published version. Article is made available in OAR by the publisher's permission or policy.	en_US
dc.title	Logarithmic Regret for Online Control	en_US
dc.type	Conference Article	en_US
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
LogRegretOnlineControl.pdf		333.17 kB	Adobe PDF	View/Download

Show Simple Item Record