Gene set bagging for estimating the probability a statistically significant result will replicate
Author(s): Jaffe, Andrew E; Storey, John D; Ji, Hongkai; Leek, Jeffrey T
DownloadTo refer to this page use:
http://arks.princeton.edu/ark:/88435/pr1s756j6w
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Jaffe, Andrew E | - |
dc.contributor.author | Storey, John D | - |
dc.contributor.author | Ji, Hongkai | - |
dc.contributor.author | Leek, Jeffrey T | - |
dc.date.accessioned | 2022-01-25T14:51:22Z | - |
dc.date.available | 2022-01-25T14:51:22Z | - |
dc.date.issued | 2013-12-12 | en_US |
dc.identifier.citation | Jaffe, Andrew E, Storey, John D, Ji, Hongkai, Leek, Jeffrey T. (2013). Gene set bagging for estimating the probability a statistically significant result will replicate. BMC Bioinformatics, 14 (1), 360 - 360. doi:10.1186/1471-2105-14-360 | en_US |
dc.identifier.issn | 1471-2105 | - |
dc.identifier.uri | http://arks.princeton.edu/ark:/88435/pr1s756j6w | - |
dc.description.abstract | Significance analysis plays a major role in identifying and ranking genes, transcription factor binding sites, DNA methylation regions, and other high-throughput features associated with illness. We propose a new approach, called gene set bagging, for measuring the probability that a gene set replicates in future studies. Gene set bagging involves resampling the original high-throughput data, performing gene-set analysis on the resampled data, and confirming that biological categories replicate in the bagged samples. Using both simulated and publicly-available genomics data, we demonstrate that significant categories in a gene set enrichment analysis may be unstable when subjected to resampling. We show our method estimates the replication probability (R), the probability that a gene set will replicate as a significant result in future studies, and show in simulations that this method reflects replication better than each set’s p-value. Our results suggest that gene lists based on p-values are not necessarily stable, and therefore additional steps like gene set bagging may improve biological inference on gene sets. | en_US |
dc.format.extent | 360 - 360 | en_US |
dc.language.iso | en_US | en_US |
dc.relation.ispartof | BMC Bioinformatics | en_US |
dc.rights | Final published version. This is an open access article. | en_US |
dc.title | Gene set bagging for estimating the probability a statistically significant result will replicate | en_US |
dc.type | Journal Article | en_US |
dc.identifier.doi | doi:10.1186/1471-2105-14-360 | - |
pu.type.symplectic | http://www.symplectic.co.uk/publications/atom-terms/1.0/journal-article | en_US |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Gene_set_bagging_statistically_significant_result_replicate.pdf | 677.88 kB | Adobe PDF | View/Download |
Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.