A Regret Minimization Approach to Iterative Learning Control
Author(s): Agarwal, Naman; Hazan, Elad; Majumdar, Anirudha; Singh, Karan
DownloadTo refer to this page use:
http://arks.princeton.edu/ark:/88435/pr18c9r426
Abstract: | We consider the setting of iterative learning control, or model-based policy learning in the presence of uncertain, time-varying dynamics. In this setting, we propose a new performance metric, planning regret, which replaces the standard stochastic uncertainty assumptions with worst case regret. Based on recent advances in non-stochastic control, we design a new iterative algorithm for minimizing planning regret that is more robust to model mismatch and uncertainty. We provide theoretical and empirical evidence that the proposed algorithm outperforms existing methods on several benchmarks. |
Publication Date: | 2021 |
Citation: | Agarwal, Naman, Hazan, Elad, Majumdar, Anirudha and Singh, Karan. "A Regret Minimization Approach to Iterative Learning Control." Proceedings of the 38th International Conference on Machine Learning 139 (2021): 100-109. |
ISSN: | 2640-3498 |
Pages: | 100 - 109 |
Type of Material: | Conference Article |
Series/Report no.: | Proceedings of Machine Learning Research; |
Journal/Proceeding Title: | Proceedings of the 38th International Conference on Machine Learning |
Version: | Final published version. This is an open access article. |
Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.