Domain prediction with probabilistic directional context.
Author(s): Ochoa, Alejandro; Singh, Mona
DownloadTo refer to this page use:
http://arks.princeton.edu/ark:/88435/pr17v7h
Abstract: | <h4>Motivation</h4>Protein domain prediction is one of the most powerful approaches for sequence-based function prediction. Although domain instances are typically predicted independently of each other, newer approaches have demonstrated improved performance by rewarding domain pairs that frequently co-occur within sequences. However, most of these approaches have ignored the order in which domains preferentially co-occur and have also not modeled domain co-occurrence probabilistically.<h4>Results</h4>We introduce a probabilistic approach for domain prediction that models 'directional' domain context. Our method is the first to score all domain pairs within a sequence while taking their order into account, even for non-sequential domains. We show that our approach extends a previous Markov model-based approach to additionally score all pairwise terms, and that it can be interpreted within the context of Markov random fields. We formulate our underlying combinatorial optimization problem as an integer linear program, and demonstrate that it can be solved quickly in practice. Finally, we perform extensive evaluation of domain context methods and demonstrate that incorporating context increases the number of domain predictions by ∼15%, with our approach dPUC2 (Domain Prediction Using Context) outperforming all competing approaches.<h4>Availability and implementation</h4>dPUC2 is available at http://github.com/alexviiia/dpuc2.<h4>Contact</h4>mona@cs.princeton.edu.<h4>Supplementary information</h4>Supplementary data are available at Bioinformatics online. |
Publication Date: | Aug-2017 |
Citation: | Ochoa, Alejandro, Singh, Mona. (2017). Domain prediction with probabilistic directional context.. Bioinformatics (Oxford, England), 33 (16), 2471 - 2478. doi:10.1093/bioinformatics/btx221 |
DOI: | doi:10.1093/bioinformatics/btx221 |
ISSN: | 1367-4803 |
EISSN: | 1367-4811 |
Pages: | 2471 - 2478 |
Language: | eng |
Type of Material: | Journal Article |
Journal/Proceeding Title: | Bioinformatics (Oxford, England) |
Version: | Final published version. This is an open access article. |
Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.