Skip to main content

A cost-aggregating integer linear program for motif finding

Author(s): Kingsford, C; Zaslavsky, E; Singh, Mona

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr14t06
Full metadata record
DC FieldValueLanguage
dc.contributor.authorKingsford, C-
dc.contributor.authorZaslavsky, E-
dc.contributor.authorSingh, Mona-
dc.date.accessioned2018-07-20T15:06:34Z-
dc.date.available2018-07-20T15:06:34Z-
dc.date.issued2011-04-05en_US
dc.identifier.citationKingsford, C, Zaslavsky, E, Singh, M. (2011). A cost-aggregating integer linear program for motif finding. Journal of Discrete Algorithms, 9 (326 - 334. doi:10.1016/j.jda.2011.04.001en_US
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr14t06-
dc.description.abstractIn the motif finding problem one seeks a set of mutually similar substrings within a collection of biological sequences. This is an important and widely-studied problem, as such shared motifs in DNA often correspond to regulatory elements. We study a combinatorial framework where the goal is to find substrings of a given length such that the sum of their pairwise distances is minimized. We describe a novel integer linear program for the problem, which uses the fact that distances between substrings come from a limited set of possibilities allowing for aggregate consideration of sequence position pairs with the same distances. We show how to tighten its linear programming relaxation by adding an exponential set of constraints and give an efficient separation algorithm that can find violated constraints, thereby showing that the tightened linear program can still be solved in polynomial time. We apply our approach to find optimal solutions for the motif finding problem and show that it is effective in practice in uncovering known transcription factor binding sites.en_US
dc.format.extent326 - 334en_US
dc.language.isoen_USen_US
dc.relation.ispartofJournal of Discrete Algorithmsen_US
dc.rightsAuthor's manuscripten_US
dc.titleA cost-aggregating integer linear program for motif findingen_US
dc.typeJournal Articleen_US
dc.identifier.doidoi:10.1016/j.jda.2011.04.001-
dc.date.eissued2011-04-05en_US
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceedingen_US

Files in This Item:
File Description SizeFormat 
A cost-aggregating integer linear program for motif finding.pdf791.4 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.