Skip to main content

Context aware group nearest shrunken centroids in large-scale genomic studies

Author(s): Yang, Juemin; Han, Fang; Irizarry, Rafael A.; Liu, Han

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1h80x
Full metadata record
DC FieldValueLanguage
dc.contributor.authorYang, Juemin-
dc.contributor.authorHan, Fang-
dc.contributor.authorIrizarry, Rafael A.-
dc.contributor.authorLiu, Han-
dc.date.accessioned2020-04-07T18:41:37Z-
dc.date.accessioned2020-04-09T17:40:41Z-
dc.date.available2020-04-07T18:41:37Z-
dc.date.available2020-04-09T17:40:41Z-
dc.date.issued2014en_US
dc.identifier.citationYang, Juemin, Fang Han, Rafael Irizarry, and Han Liu. "Context aware group nearest shrunken centroids in large-scale genomic studies." In Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, (2014): pp. 1051-1059.en_US
dc.identifier.issn2640-3498-
dc.identifier.urihttp://proceedings.mlr.press/v33/yang14b.html-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1h80x-
dc.description.abstractRecent genomic studies have identified genes related to specific phenotypes. In addition to marginal association analysis for individual genes, analyzing gene pathways (functionally related sets of genes) may yield additional valuable insights. We have devised an approach to phenotype classification from gene expression profiling. Our method named “group Nearest Shrunken Centroids (gNSC)” is an enhancement of the Nearest Shrunken Centroids (NSC) which is a popular and scalable method to analyze big data. While fully utilizing the variable structure of gene pathways, gNSC shares comparable computational speed as NSC if the group size is small. Comparing with NSC, gNSC improves the power of classification by utilizing the gene pathway information. In practice, we investigate the performance of gNSC on one of the largest microarray datasets aggregated from the internet. We show the effectiveness of our method by comparing the misclassification rate of gNSC with that of NSC. Additionally, we present a novel application of NSC/gNSC on context analysis of association between pathways and certain medical words. Some newest biological findings are rediscovered.en_US
dc.format.extent1051 - 1059en_US
dc.language.isoen_USen_US
dc.relation.ispartofProceedings of the Seventeenth International Conference on Artificial Intelligence and Statisticsen_US
dc.relation.ispartofseriesProceedings of Machine Learning Research;-
dc.relation.replaceshttp://arks.princeton.edu/ark:/88435/pr18n5r-
dc.relation.replaces88435/pr18n5r-
dc.rightsFinal published version. This is an open access article.en_US
dc.titleContext aware group nearest shrunken centroids in large-scale genomic studiesen_US
dc.typeConference Articleen_US
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceedingen_US

Files in This Item:
File Description SizeFormat 
ContextAwareNearCentroidsGenomStudies.pdf689.43 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.