Non-Stochastic Control with Bandit Feedback

Gradu, Paula; Hallman, John; Hazan, Elad

Non-Stochastic Control with Bandit Feedback

Author(s): Gradu, Paula; Hallman, John; Hazan, Elad

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1wg37

Full metadata record

DC Field	Value	Language
dc.contributor.author	Gradu, Paula	-
dc.contributor.author	Hallman, John	-
dc.contributor.author	Hazan, Elad	-
dc.date.accessioned	2021-10-08T19:50:50Z	-
dc.date.available	2021-10-08T19:50:50Z	-
dc.date.issued	2020	en_US
dc.identifier.citation	Gradu, Paula, John Hallman, and Elad Hazan. "Non-Stochastic Control with Bandit Feedback." Advances in Neural Information Processing Systems 33 (2020).	en_US
dc.identifier.issn	1049-5258	-
dc.identifier.uri	https://proceedings.neurips.cc/paper/2020/file/7a1d9028a78f418cb8f01909a348d9b2-Paper.pdf	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr1wg37	-
dc.description.abstract	We study the problem of controlling a linear dynamical system with adversarial perturbations where the only feedback available to the controller is the scalar loss, and the loss function itself is unknown. For this problem, with either a known or unknown system, we give an efficient sublinear regret algorithm. The main algorithmic difficulty is the dependence of the loss on past controls. To overcome this issue, we propose an efficient algorithm for the general setting of bandit convex optimization for loss functions with memory, which may be of independent interest.	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	Advances in Neural Information Processing Systems	en_US
dc.rights	Final published version. Article is made available in OAR by the publisher's permission or policy.	en_US
dc.title	Non-Stochastic Control with Bandit Feedback	en_US
dc.type	Conference Article	en_US
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
NonstochasticControl.pdf		2.44 MB	Adobe PDF	View/Download

Show Simple Item Record