Skip to main content

Robust Sparse Principal Component Regression under the High Dimensional Elliptical Model

Author(s): Han, F; Liu, H

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1w28r
Full metadata record
DC FieldValueLanguage
dc.contributor.authorHan, F-
dc.contributor.authorLiu, H-
dc.date.accessioned2021-10-11T14:17:02Z-
dc.date.available2021-10-11T14:17:02Z-
dc.date.issued2013en_US
dc.identifier.citationHan, Fang, and Han Liu. "Robust sparse principal component regression under the high dimensional elliptical model." In Advances in Neural Information Processing Systems, pp. 1941-1949. 2013.en_US
dc.identifier.issn1049-5258-
dc.identifier.urihttp://papers.nips.cc/paper/4869-robust-sparse-principal-component-regression-under-the-high-dimensional-elliptical-model-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1w28r-
dc.description.abstractIn this paper we focus on the principal component regression and its application to high dimension non-Gaussian data. The major contributions are in two folds. First, in low dimensions and under a double asymptotic framework where both the dimension d and sample size n can increase, by borrowing the strength from recent development in minimax optimal principal component estimation, we first time sharply characterize the potential advantage of classical principal component regression over least square estimation under the Gaussian model. Secondly, we propose and analyze a new robust sparse principal component regression on high dimensional elliptically distributed data. The elliptical distribution is a semiparametric generalization of the Gaussian, including many well known distributions such as multivariate Gaussian, rank-deficient Gaussian, t, Cauchy, and logistic. It allows the random vector to be heavy tailed and have tail dependence. These extra flexibilities make it very suitable for modeling finance and biomedical imaging data. Under the elliptical model, we prove that our method can estimate the regression coefficients in the optimal parametric rate and therefore is a good alternative to the Gaussian based methods. Experiments on synthetic and real world data are conducted to illustrate the empirical usefulness of the proposed method.en_US
dc.format.extent1941 - 1949en_US
dc.language.isoen_USen_US
dc.relation.ispartofAdvances in Neural Information Processing Systemsen_US
dc.rightsFinal published version. Article is made available in OAR by the publisher's permission or policy.en_US
dc.titleRobust Sparse Principal Component Regression under the High Dimensional Elliptical Modelen_US
dc.typeConference Articleen_US
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceedingen_US

Files in This Item:
File Description SizeFormat 
RegressionHighDEllipticalModel.pdf354.04 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.