Skip to main content

netNMF-sc: leveraging gene-gene interactions for imputation and dimensionality reduction in single-cell expression analysis.

Author(s): Elyanow, Rebecca; Dumitrascu, Bianca; Engelhardt, Barbara E; Raphael, Benjamin J

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1wr7x
Full metadata record
DC FieldValueLanguage
dc.contributor.authorElyanow, Rebecca-
dc.contributor.authorDumitrascu, Bianca-
dc.contributor.authorEngelhardt, Barbara E-
dc.contributor.authorRaphael, Benjamin J-
dc.date.accessioned2021-10-08T19:47:07Z-
dc.date.available2021-10-08T19:47:07Z-
dc.date.issued2020-02en_US
dc.identifier.citationElyanow, Rebecca, Dumitrascu, Bianca, Engelhardt, Barbara E, Raphael, Benjamin J. (2020). netNMF-sc: leveraging gene-gene interactions for imputation and dimensionality reduction in single-cell expression analysis.. Genome research, 30 (2), 195 - 204. doi:10.1101/gr.251603.119en_US
dc.identifier.issn1088-9051-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1wr7x-
dc.description.abstractSingle-cell RNA-sequencing (scRNA-seq) enables high-throughput measurement of RNA expression in single cells. However, because of technical limitations, scRNA-seq data often contain zero counts for many transcripts in individual cells. These zero counts, or dropout events, complicate the analysis of scRNA-seq data using standard methods developed for bulk RNA-seq data. Current scRNA-seq analysis methods typically overcome dropout by combining information across cells in a lower-dimensional space, leveraging the observation that cells generally occupy a small number of RNA expression states. We introduce netNMF-sc, an algorithm for scRNA-seq analysis that leverages information across both cells and genes. netNMF-sc learns a low-dimensional representation of scRNA-seq transcript counts using network-regularized non-negative matrix factorization. The network regularization takes advantage of prior knowledge of gene-gene interactions, encouraging pairs of genes with known interactions to be nearby each other in the low-dimensional representation. The resulting matrix factorization imputes gene abundance for both zero and nonzero counts and can be used to cluster cells into meaningful subpopulations. We show that netNMF-sc outperforms existing methods at clustering cells and estimating gene-gene covariance using both simulated and real scRNA-seq data, with increasing advantages at higher dropout rates (e.g., >60%). We also show that the results from netNMF-sc are robust to variation in the input network, with more representative networks leading to greater performance gains.en_US
dc.format.extent195 - 204en_US
dc.languageengen_US
dc.language.isoen_USen_US
dc.relation.ispartofGenome researchen_US
dc.rightsAuthor's manuscripten_US
dc.titlenetNMF-sc: leveraging gene-gene interactions for imputation and dimensionality reduction in single-cell expression analysis.en_US
dc.typeJournal Articleen_US
dc.identifier.doidoi:10.1101/gr.251603.119-
dc.identifier.eissn1549-5469-
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/journal-articleen_US

Files in This Item:
File Description SizeFormat 
GeneInteractionsDimensionReductionSingleCellExpressionAnalysis.pdf3 MBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.