Efficient optimization of loops and limits with randomized telescoping sums

Beatson, Alex; Adams, Ryan P

Efficient optimization of loops and limits with randomized telescoping sums

Author(s): Beatson, Alex; Adams, Ryan P

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr10g01

Full metadata record

DC Field	Value	Language
dc.contributor.author	Beatson, Alex	-
dc.contributor.author	Adams, Ryan P	-
dc.date.accessioned	2021-10-08T19:45:42Z	-
dc.date.available	2021-10-08T19:45:42Z	-
dc.date.issued	2019	en_US
dc.identifier.citation	Beatson, Alex, and Ryan P. Adams. "Efficient optimization of loops and limits with randomized telescoping sums." Proceedings of the 36th International Conference on Machine Learning 97 (2019), pp. 534-543.	en_US
dc.identifier.issn	2640-3498	-
dc.identifier.uri	http://proceedings.mlr.press/v97/beatson19a.html	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr10g01	-
dc.description.abstract	We consider optimization problems in which the objective requires an inner loop with many steps or is the limit of a sequence of increasingly costly approximations. Meta-learning, training recurrent neural networks, and optimization of the solutions to differential equations are all examples of optimization problems with this character. In such problems, it can be expensive to compute the objective function value and its gradient, but truncating the loop or using less accurate approximations can induce biases that damage the overall solution. We propose randomized telescope (RT) gradient estimators, which represent the objective as the sum of a telescoping series and sample linear combinations of terms to provide cheap unbiased gradient estimates. We identify conditions under which RT estimators achieve optimization convergence rates independent of the length of the loop or the required accuracy of the approximation. We also derive a method for tuning RT estimators online to maximize a lower bound on the expected decrease in loss per unit of computation. We evaluate our adaptive RT estimators on a range of applications including meta-optimization of learning rates, variational inference of ODE parameters, and training an LSTM to model long sequences.	en_US
dc.format.extent	534 - 543	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	Proceedings of the 36th International Conference on Machine Learning	en_US
dc.rights	Final published version. Article is made available in OAR by the publisher's permission or policy.	en_US
dc.title	Efficient optimization of loops and limits with randomized telescoping sums	en_US
dc.type	Conference Article	en_US
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
OptimizeLoopsLimitsRandomTelescopingSums.pdf		1.99 MB	Adobe PDF	View/Download

Show Simple Item Record