Skip to main content

Embracing the Blessing of Dimensionality in Factor Models.

Author(s): Li, Quefeng; Cheng, Guang; Fan, Jianqing; Wang, Yuyan

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1b005
Full metadata record
DC FieldValueLanguage
dc.contributor.authorLi, Quefeng-
dc.contributor.authorCheng, Guang-
dc.contributor.authorFan, Jianqing-
dc.contributor.authorWang, Yuyan-
dc.date.accessioned2021-10-11T14:17:40Z-
dc.date.available2021-10-11T14:17:40Z-
dc.date.issued2018-01en_US
dc.identifier.citationLi, Quefeng, Cheng, Guang, Fan, Jianqing, Wang, Yuyan. (2018). Embracing the Blessing of Dimensionality in Factor Models.. Journal of the American Statistical Association, 113 (521), 380 - 389. doi:10.1080/01621459.2016.1256815en_US
dc.identifier.issn0162-1459-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1b005-
dc.description.abstractFactor modeling is an essential tool for exploring intrinsic dependence structures among high-dimensional random variables. Much progress has been made for estimating the covariance matrix from a high-dimensional factor model. However, the blessing of dimensionality has not yet been fully embraced in the literature: much of the available data are often ignored in constructing covariance matrix estimates. If our goal is to accurately estimate a covariance matrix of a set of targeted variables, shall we employ additional data, which are beyond the variables of interest, in the estimation? In this article, we provide sufficient conditions for an affirmative answer, and further quantify its gain in terms of Fisher information and convergence rate. In fact, even an oracle-like result (as if all the factors were known) can be achieved when a sufficiently large number of variables is used. The idea of using data as much as possible brings computational challenges. A divide-and-conquer algorithm is thus proposed to alleviate the computational burden, and also shown not to sacrifice any statistical accuracy in comparison with a pooled analysis. Simulation studies further confirm our advocacy for the use of full data, and demonstrate the effectiveness of the above algorithm. Our proposal is applied to a microarray data example that shows empirical benefits of using more data. Supplementary materials for this article are available online.en_US
dc.format.extent380 - 389en_US
dc.languageengen_US
dc.language.isoen_USen_US
dc.relation.ispartofJournal of the American Statistical Associationen_US
dc.rightsAuthor's manuscripten_US
dc.titleEmbracing the Blessing of Dimensionality in Factor Models.en_US
dc.typeJournal Articleen_US
dc.identifier.doidoi:10.1080/01621459.2016.1256815-
dc.identifier.eissn1537-274X-
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/journal-articleen_US

Files in This Item:
File Description SizeFormat 
Embracing the Blessing of Dimensionality in Factor Models.pdf477.96 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.