Skip to main content

Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk

Author(s): Zhou, Jian; Theesfeld, Chandra L; Yao, Kevin; Chen, Kathleen M; Wong, Aaron K; et al

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr12k18
Full metadata record
DC FieldValueLanguage
dc.contributor.authorZhou, Jian-
dc.contributor.authorTheesfeld, Chandra L-
dc.contributor.authorYao, Kevin-
dc.contributor.authorChen, Kathleen M-
dc.contributor.authorWong, Aaron K-
dc.contributor.authorTroyanskaya, Olga G-
dc.date.accessioned2021-10-08T19:48:32Z-
dc.date.available2021-10-08T19:48:32Z-
dc.date.issued2018en_US
dc.identifier.citationZhou, Jian, Chandra L. Theesfeld, Kevin Yao, Kathleen M. Chen, Aaron K. Wong, and Olga G. Troyanskaya. "Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk." Nature Genetics 50, no. 8 (2018): 1171-1179. doi:10.1038/s41588-018-0160-6en_US
dc.identifier.issn1061-4036-
dc.identifier.urihttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6094955/pdf/nihms965313.pdf-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr12k18-
dc.description.abstractKey challenges for human genetics, precision medicine and evolutionary biology include deciphering the regulatory code of gene expression and understanding the transcriptional effects of genome variation. However, this is extremely difficult because of the enormous scale of the noncoding mutation space. We developed a deep learning–based framework, ExPecto, that can accurately predict, ab initio from a DNA sequence, the tissue-specific transcriptional effects of mutations, including those that are rare or that have not been observed. We prioritized causal variants within disease- or trait-associated loci from all publicly available genome-wide association studies and experimentally validated predictions for four immune-related diseases. By exploiting the scalability of ExPecto, we characterized the regulatory mutation space for human RNA polymerase II–transcribed genes by in silico saturation mutagenesis and profiled > 140 million promoter-proximal mutations. This enables probing of evolutionary constraints on gene expression and ab initio prediction of mutation disease effects, making ExPecto an end-to-end computational framework for the in silico prediction of expression and disease risk.en_US
dc.format.extent1171 - 1179en_US
dc.languageengen_US
dc.language.isoen_USen_US
dc.relation.ispartofNature Geneticsen_US
dc.rightsAuthor's manuscripten_US
dc.titleDeep learning sequence-based ab initio prediction of variant effects on expression and disease risken_US
dc.typeJournal Articleen_US
dc.identifier.doi10.1038/s41588-018-0160-6-
dc.identifier.eissn1546-1718-
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/journal-articleen_US

Files in This Item:
File Description SizeFormat 
DeepLearningSequenceAbInitioPrediction.pdf908.06 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.