Skip to main content

Subspace estimation from unbalanced and incomplete data matrices: ℓ2,∞ statistical guarantees

Author(s): Cai, Changxiao; Li, Gen; Chi, Yuejie; Poor, H Vincent; Chen, Yuxin

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr10k26b6m
Full metadata record
DC FieldValueLanguage
dc.contributor.authorCai, Changxiao-
dc.contributor.authorLi, Gen-
dc.contributor.authorChi, Yuejie-
dc.contributor.authorPoor, H Vincent-
dc.contributor.authorChen, Yuxin-
dc.date.accessioned2024-02-04T02:02:02Z-
dc.date.available2024-02-04T02:02:02Z-
dc.date.issued2021-04en_US
dc.identifier.citationCai, Changxiao, Li, Gen, Chi, Yuejie, Poor, H Vincent, Chen, Yuxin. (2021). Subspace estimation from unbalanced and incomplete data matrices: ℓ2,∞ statistical guarantees. The Annals of Statistics, 49 (2), 10.1214/20-aos1986en_US
dc.identifier.issn0090-5364-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr10k26b6m-
dc.description.abstractThis paper is concerned with estimating the column space of an unknown low-rank matrix A⋆ ∈ Rd1×d2, given noisy and partial observations of its entries. There is no shortage of scenarios where the observations—while being too noisy to support faithful recovery of the entire matrix—still convey sufficient information to enable reliable estimation of the column space of interest. This is particularly evident and crucial for the highly unbalanced case where the column dimension d2 far exceeds the row dimension d1, which is the focal point of the current paper. We investigate an efficient spectral method, which operates upon the sample Gram matrix with diagonal deletion. While this algorithmic idea has been studied before, we establish new statistical guarantees for this method in terms of both ℓ2 and ℓ2,∞ estimation accuracy, which improve upon prior results if d2 is substantially larger than d1. To illustrate the effectiveness of our findings, we derive matching minimax lower bounds with respect to the noise levels, and develop consequences of our general theory for three applications of practical importance: (1) tensor completion from noisy data, (2) covariance estimation/principal component analysis with missing data and (3) community recovery in bipartite graphs. Our theory leads to improved performance guarantees for all three cases.en_US
dc.format.extent944-967en_US
dc.language.isoen_USen_US
dc.relation.ispartofThe Annals of Statisticsen_US
dc.rightsAuthor's manuscripten_US
dc.titleSubspace estimation from unbalanced and incomplete data matrices: ℓ2,∞ statistical guaranteesen_US
dc.typeJournal Articleen_US
dc.identifier.doidoi:10.1214/20-aos1986-
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/journal-articleen_US

Files in This Item:
File Description SizeFormat 
1910.04267.pdf1.91 MBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.