Skip to main content

A Statistical Framework to Identify Deviation from Time Linearity in Epigenetic Aging

Author(s): Snir, Sagi; vonHoldt, Bridgett M.; Pellegrini, Matteo

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1rx46
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSnir, Sagi-
dc.contributor.authorvonHoldt, Bridgett M.-
dc.contributor.authorPellegrini, Matteo-
dc.date.accessioned2019-05-06T18:08:53Z-
dc.date.available2019-05-06T18:08:53Z-
dc.date.issued2016-11-11en_US
dc.identifier.citationSnir, Sagi, vonHoldt, Bridgett M, Pellegrini, Matteo. (2016). A Statistical Framework to Identify Deviation from Time Linearity in Epigenetic Aging. PLOS Computational Biology, 12 (11), e1005183 - e1005183. doi:10.1371/journal.pcbi.1005183en_US
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1rx46-
dc.description.abstractIn multiple studies DNA methylation has proven to be an accurate biomarker of age. To develop these biomarkers, the methylation of multiple CpG sites is typically linearly combined to predict chronological age. By contrast, in this study we apply the Universal PaceMaker (UPM) model to investigate changes in DNA methylation during aging. The UPM was initially developed to study rate acceleration/deceleration in sequence evolution. Rather than identifying which linear combinations of sites predicts age, the UPM models the rates of change of multiple CpG sites, as well as their starting methylation levels, and estimates the age of each individual to optimize the model fit. We refer to the estimated age as the “epigenetic age”, which is in contrast to the known chronological age of each individual. We construct a statistical framework and devise an algorithm to determine whether a genomic pacemaker is in effect (i.e rates of change vary with age). The decision is made by comparing two competing likelihood based models, the molecular clock (MC) and UPM. For the molecular clock model, we use the known chronological age of each individual and fit the methylation rates at multiple sites, and express the problem as a linear least squares and solve it in polynomial time. For the UPM case, the search space is larger as we are fitting both the epigenetic age of each individual as well as the rates for each site, yet we succeed to reduce the problem to the space of individuals and polynomial in the more significant space—the methylated sites. We first tested our algorithm on simulated data to elucidate the factors affecting the identification of the pacemaker model. We find that, provided with enough data, our algorithm is capable of identifying a pacemaker even when a weak signal is present in the data. Based on these results, we applied our method to DNA methylation data from human blood from individuals of various ages. Although the improvement in variance across sites between the UPM and MC was small, the results suggest that the existence of a pacemaker is highly significant. The PaceMaker results also suggest a decay in the rate of change in DNA methylation with age.en_US
dc.format.extente1005183 - e1005183en_US
dc.language.isoen_USen_US
dc.relation.ispartofPLOS Computational Biologyen_US
dc.rightsFinal published version. This is an open access article.en_US
dc.titleA Statistical Framework to Identify Deviation from Time Linearity in Epigenetic Agingen_US
dc.typeJournal Articleen_US
dc.identifier.doidoi:10.1371/journal.pcbi.1005183-
dc.date.eissued2016-11-11en_US
dc.identifier.eissn1553-7358-
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/journal-articleen_US

Files in This Item:
File Description SizeFormat 
a statistical framework.PDF1.8 MBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.