Variance reduction for faster non-convex optimization

Allen-Zhu, Z; Hazan, Elad

Variance reduction for faster non-convex optimization

Author(s): Allen-Zhu, Z; Hazan, Elad

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1cd4q

Full metadata record

DC Field	Value	Language
dc.contributor.author	Allen-Zhu, Z	-
dc.contributor.author	Hazan, Elad	-
dc.date.accessioned	2018-07-20T15:06:20Z	-
dc.date.available	2018-07-20T15:06:20Z	-
dc.date.issued	2016	en_US
dc.identifier.citation	Allen-Zhu, Z, Hazan, E. (2016). Variance reduction for faster non-convex optimization. 2 (1093 - 1101	en_US
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr1cd4q	-
dc.description.abstract	We consider the fundamental problem in nonconvex optimization of efficiently reaching a stationary point. In contrast to the convex case, in the long history of this basic problem, the only known theoretical results on first-order nonconvex optimization remain to be full gradient descent that converges in 0(1/∈) iterations for smooth objectives, and stochastic gradient descent that converges in 0(1/∈2) iterations for objectives that are sum of smooth functions. We provide the first improvement in this line of research. Our result is based on the variance reduction trick recently introduced to convex optimization, as well as a brand new analysis of variance reduction that is suitable for non-convex optimization. For objectives that are sum of smooth functions, our first-order minibatch stochastic method converges with an 0(1/∈) rate, and is faster than full gradient descent by Ω(n1/3). We demonstrate the effectiveness of our methods on empirical risk minimizations with non-convex loss functions and training neural nets.	en_US
dc.format.extent	1093 - 1101	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	33rd International Conference on Machine Learning, ICML 2016	en_US
dc.rights	Author's manuscript	en_US
dc.title	Variance reduction for faster non-convex optimization	en_US
dc.type	Conference Article	en_US
dc.date.eissued	2016	en_US
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceeding	en_US

Files in This Item:

File	Description	Size	Format
Variance reduction for faster non convex optimization.pdf		2.89 MB	Adobe PDF	View/Download

Show Simple Item Record