Skip to main content

A loop-counting method for covariate-corrected low-rank biclustering of gene-expression and genome-wide association study data

Author(s): Rangan, Aaditya V; McGrouther, Caroline C; Kelsoe, John; Schork, Nicholas; Stahl, Eli; et al

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1mz61
Full metadata record
DC FieldValueLanguage
dc.contributor.authorRangan, Aaditya V-
dc.contributor.authorMcGrouther, Caroline C-
dc.contributor.authorKelsoe, John-
dc.contributor.authorSchork, Nicholas-
dc.contributor.authorStahl, Eli-
dc.contributor.authorZhu, Qian-
dc.contributor.authorKrishnan, Arjun-
dc.contributor.authorYao, Vicky-
dc.contributor.authorTroyanskaya, Olga-
dc.contributor.authorBilaloglu, Seda-
dc.contributor.authorRaghavan, Preeti-
dc.contributor.authorBergen, Sarah-
dc.contributor.authorJureus, Anders-
dc.contributor.authorLanden, Mikael-
dc.contributor.authorBipolar Disorders Working Group of the Psychiatric Genomics Consortium-
dc.date.accessioned2021-10-08T19:47:39Z-
dc.date.available2021-10-08T19:47:39Z-
dc.date.issued2018-05-14en_US
dc.identifier.citationRangan, Aaditya V., Caroline C. McGrouther, John Kelsoe, Nicholas Schork, Eli Stahl, Qian Zhu, Arjun Krishnan, Vicky Yao, Olga Troyanskaya, Seda Bilaloglu, Preeti Raghavan, Sarah Bergen, Anders Jureus, Mikael Landen, Bipolar Disorders Working Group of the Psychiatric Genomics Consortium. "A loop-counting method for covariate-corrected low-rank biclustering of gene-expression and genome-wide association study data." PLoS Computational Biology 14, no. 5 (2018): pp. e1006105. doi:10.1371/journal.pcbi.1006105en_US
dc.identifier.issn1553-734X-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1mz61-
dc.description.abstractA common goal in data-analysis is to sift through a large data-matrix and detect any significant submatrices (i.e., biclusters) that have a low numerical rank. We present a simple algorithm for tackling this biclustering problem. Our algorithm accumulates information about 2-by-2 submatrices (i.e., ‘loops’) within the data-matrix, and focuses on rows and columns of the data-matrix that participate in an abundance of low-rank loops. We demonstrate, through analysis and numerical-experiments, that this loop-counting method performs well in a variety of scenarios, outperforming simple spectral methods in many situations of interest. Another important feature of our method is that it can easily be modified to account for aspects of experimental design which commonly arise in practice. For example, our algorithm can be modified to correct for controls, categorical- and continuous-covariates, as well as sparsity within the data. We demonstrate these practical features with two examples; the first drawn from gene-expression analysis and the second drawn from a much larger genome-wide-association-study (GWAS).en_US
dc.format.extente1006105en_US
dc.languageengen_US
dc.language.isoen_USen_US
dc.relation.ispartofPLoS Computational Biologyen_US
dc.rightsFinal published version. This is an open access article.en_US
dc.titleA loop-counting method for covariate-corrected low-rank biclustering of gene-expression and genome-wide association study dataen_US
dc.typeJournal Articleen_US
dc.identifier.doi10.1371/journal.pcbi.1006105-
dc.identifier.eissn1553-7358-
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/journal-articleen_US

Files in This Item:
File Description SizeFormat 
LoopCountingGeneExpression.pdf35.82 MBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.