Skip to main content

Hyperparameter optimization: a spectral approach

Author(s): Hazan, Elad; Klivans, Adam; Yuan, Yang

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr19k08
Full metadata record
DC FieldValueLanguage
dc.contributor.authorHazan, Elad-
dc.contributor.authorKlivans, Adam-
dc.contributor.authorYuan, Yang-
dc.date.accessioned2021-10-08T19:49:17Z-
dc.date.available2021-10-08T19:49:17Z-
dc.date.issued2018en_US
dc.identifier.citationHazan, Elad, Adam Klivans, and Yang Yuan. "Hyperparameter optimization: a spectral approach." In International Conference on Learning Representations (2018).en_US
dc.identifier.urihttps://arxiv.org/pdf/1706.00764.pdf-
dc.identifier.urihttps://openreview.net/forum?id=H1zriGeCZ-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr19k08-
dc.description.abstractWe give a simple, fast algorithm for hyperparameter optimization inspired by techniques from the analysis of Boolean functions. We focus on the high-dimensional regime where the canonical example is training a neural network with a large number of hyperparameters. The algorithm — an iterative application of compressed sensing techniques for orthogonal polynomials — requires only uniform sampling of the hyperparameters and is thus easily parallelizable. Experiments for training deep neural networks on Cifar-10 show that compared to state-of-the-art tools (e.g., Hyperband and Spearmint), our algorithm finds significantly improved solutions, in some cases better than what is attainable by hand-tuning. In terms of overall running time (i.e., time required to sample various settings of hyperparameters plus additional computation time), we are at least an order of magnitude faster than Hyperband and Bayesian Optimization. We also outperform Random Search 8×. Our method is inspired by provably-efficient algorithms for learning decision trees using the discrete Fourier transform. We obtain improved sample-complexty bounds for learning decision trees while matching state-of-the-art bounds on running time (polynomial and quasipolynomial, respectively).en_US
dc.language.isoen_USen_US
dc.relation.ispartofInternational Conference on Learning Representationsen_US
dc.rightsAuthor's manuscripten_US
dc.titleHyperparameter optimization: a spectral approachen_US
dc.typeConference Articleen_US
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceedingen_US

Files in This Item:
File Description SizeFormat 
HyperparameterOptSpectral.pdf542.57 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.