Skip to main content

CODA: High dimensional Copula Discriminant Analysis

Author(s): Han, Fang; Zhao, Tuo; Liu, Han

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1rr5q
Abstract: We propose a high dimensional classification method, named the Copula Discriminant Analysis (CODA). The CODA generalizes the normal-based linear discriminant analysis to the larger Gaussian Copula models (or the nonparanormal) as proposed by Liu et al. (2009). To simultaneously achieve estimation efficiency and robustness, the nonparametric rank-based methods including the Spearman's rho and Kendall's tau are exploited in estimating the covariance matrix. In high dimensional settings, we prove that the sparsity pattern of the discriminant features can be consistently recovered with the parametric rate, and the expected misclassification error is consistent to the Bayes risk. Our theory is backed up by careful numerical experiments, which show that the extra flexibility gained by the CODA method incurs little efficiency loss even when the data are truly Gaussian. These results suggest that the CODA method can be an alternative choice besides the normal-based high dimensional linear discriminant analysis.
Publication Date: 2013
Citation: Han, Fang, Tuo Zhao, and Han Liu. "CODA: High dimensional copula discriminant analysis." Journal of Machine Learning Research 14, no. Feb (2013): 629-671. Retrieved from http://www.jmlr.org/papers/v14/han13a.html
ISSN: 1532-4435
EISSN: 1533-7928
Pages: 629 - 671
Type of Material: Journal Article
Journal/Proceeding Title: Journal of Machine Learning Research
Version: Final published version. This is an open access article.



Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.