Publication Date: 10/11/2021
Event: ICCV 2021
Reference: pp. 7496-7505, 2021
Authors: Yao Li, NEC Laboratories America, Inc., University of North Carolina, Chapel Hill; Martin Renqiang Min, NEC Laboratories America, Inc.; Thomas Lee, University of California, Davis; Wenchao Yu, NEC Laboratories America, Inc.; Erik Kruus, NEC Laboratories America, Inc.; Wei Wang, University of California, Los Angeles; Cho-Jui Hsieh, University of California, Los Angeles
Abstract: Recent studies have demonstrated the vulnerability of deep neural networks against adversarial examples. In-spired by the observation that adversarial examples often lie outside the natural image data manifold and the intrinsic dimension of image data is much smaller than its pixel space dimension, we propose to embed high-dimensional input images into a low-dimensional space and apply regularization on the embedding space to push the adversarial examples back to the manifold. The proposed framework is called Embedding Regularized Classifier (ER-Classifier), which improves the adversarial robustness of the classifier through embedding regularization. Besides improving classification accuracy against adversarial examples, the framework can be combined with detection methods to detect adversarial examples. Experimental results on several benchmark datasets show that, our proposed framework achieves good performance against strong adversarial at-tack methods.
Publication Link: https://ieeexplore.ieee.org/document/9710826