Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality
Author(s): Zhang, Yi; Plevrakis, Orestis; Du, Simon S; Li, Xingguo; Song, Zhao; et al
DownloadTo refer to this page use:
http://arks.princeton.edu/ark:/88435/pr1nz7s
Abstract: | Adversarial training is a popular method to give neural nets robustness against adversarial perturbations. In practice adversarial training leads to low robust training loss. However, a rigorous explanation for why this happens under natural conditions is still missing. Recently a convergence theory of standard (non-adversarial) supervised training was developed by various groups for {\em very overparametrized} nets. It is unclear how to extend these results to adversarial training because of the min-max objective. Recently, a first step towards this direction was made by Gao et al. using tools from online learning, but they require the width of the net to be \emph{exponential} in input dimension d , and with an unnatural activation function. Our work proves convergence to low robust training loss for \emph{polynomial} width instead of exponential, under natural assumptions and with ReLU activations. A key element of our proof is showing that ReLU networks near initialization can approximate the step function, which may be of independent interest. |
Publication Date: | 2020 |
Citation: | Zhang, Yi, Orestis Plevrakis, Simon S. Du, Xingguo Li, Zhao Song, and Sanjeev Arora. "Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality." Advances in Neural Information Processing Systems 33 (2020). |
ISSN: | 1049-5258 |
Type of Material: | Conference Article |
Journal/Proceeding Title: | Advances in Neural Information Processing Systems |
Version: | Final published version. Article is made available in OAR by the publisher's permission or policy. |
Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.