Bayesian group factor analysis with structured sparsity

Zhao, Shiwen; Gao, Chuan; Mukherjee, Sayan; Engelhardt, Barbara E

Bayesian group factor analysis with structured sparsity

Author(s): Zhao, Shiwen; Gao, Chuan; Mukherjee, Sayan; Engelhardt, Barbara E

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr12c0k

Full metadata record

DC Field	Value	Language
dc.contributor.author	Zhao, Shiwen	-
dc.contributor.author	Gao, Chuan	-
dc.contributor.author	Mukherjee, Sayan	-
dc.contributor.author	Engelhardt, Barbara E	-
dc.date.accessioned	2021-10-08T19:48:47Z	-
dc.date.available	2021-10-08T19:48:47Z	-
dc.date.issued	2016	en_US
dc.identifier.citation	Zhao, Shiwen, Chuan Gao, Sayan Mukherjee, and Barbara E. Engelhardt. "Bayesian group factor analysis with structured sparsity." The Journal of Machine Learning Research 17, no. 196 (2016): pp. 1-47.	en_US
dc.identifier.issn	1532-4435	-
dc.identifier.uri	https://www.jmlr.org/papers/volume17/14-472/14-472.pdf	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr12c0k	-
dc.description.abstract	Latent factor models are the canonical statistical tool for exploratory analyses of low-dimensional linear structure for a matrix of p features across n samples. We develop a structured Bayesian group factor analysis model that extends the factor model to multiple coupled observation matrices; in the case of two observations, this reduces to a Bayesian model of canonical correlation analysis. Here, we carefully define a structured Bayesian prior that encourages both element-wise and column-wise shrinkage and leads to desirable behavior on high- dimensional data. In particular, our model puts a structured prior on the joint factor loading matrix, regularizing at three levels, which enables element-wise sparsity and unsupervised recovery of latent factors corresponding to structured variance across arbitrary subsets of the observations. In addition, our structured prior allows for both dense and sparse latent factors so that covariation among either all features or only a subset of features can be recovered. We use fast parameter-expanded expectation-maximization for parameter estimation in this model. We validate our method on simulated data with substantial structure. We show results of our method applied to three high- dimensional data sets, comparing results against a number of state-of-the-art approaches. These results illustrate useful properties of our model, including i) recovering sparse signal in the presence of dense effects; ii) the ability to scale naturally to large numbers of observations; iii) flexible observation- and factor-specific regularization to recover factors with a wide variety of sparsity levels and percentage of variance explained; and iv) tractable inference that scales to modern genomic and text data sizes.	en_US
dc.format.extent	1 - 47	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	The Journal of Machine Learning Research	en_US
dc.rights	Final published version. Article is made available in OAR by the publisher's permission or policy.	en_US
dc.title	Bayesian group factor analysis with structured sparsity	en_US
dc.type	Journal Article	en_US
dc.identifier.eissn	1533-7928	-
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/journal-article	en_US

Files in This Item:

File	Description	Size	Format
BayesianGroupFactorAnalysis.pdf		2.37 MB	Adobe PDF	View/Download

Show Simple Item Record