A knowledge gradient policy for sequencing experiments to identify the structure of RNA molecules using a sparse additive belief model

Li, Y; Reyes, KG; Vazquez-Anderson, J; Wang, Y; Contreras, LM; Powell, William B

A knowledge gradient policy for sequencing experiments to identify the structure of RNA molecules using a sparse additive belief model

Author(s): Li, Y; Reyes, KG; Vazquez-Anderson, J; Wang, Y; Contreras, LM; et al

Download

To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1vp35

Full metadata record

DC Field	Value	Language
dc.contributor.author	Li, Y	-
dc.contributor.author	Reyes, KG	-
dc.contributor.author	Vazquez-Anderson, J	-
dc.contributor.author	Wang, Y	-
dc.contributor.author	Contreras, LM	-
dc.contributor.author	Powell, William B	-
dc.date.accessioned	2021-10-11T14:17:49Z	-
dc.date.available	2021-10-11T14:17:49Z	-
dc.date.issued	2018-01-01	en_US
dc.identifier.citation	Li, Y, Reyes, KG, Vazquez-Anderson, J, Wang, Y, Contreras, LM, Powell, WB. (2018). A knowledge gradient policy for sequencing experiments to identify the structure of RNA molecules using a sparse additive belief model. INFORMS Journal on Computing, 30 (4), 750 - 767. doi:10.1287/ijoc.2017.0803	en_US
dc.identifier.issn	1091-9856	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/pr1vp35	-
dc.description.abstract	Copyright: © 2018 INFORMS We present a sparse knowledge gradient (SpKG) algorithm for adaptively selecting the targeted regions within a large RNA molecule to identify which regions are most amenable to interactions with other molecules. Experimentally, such regions can be inferred from fluorescence measurements obtained by binding a complementary probe with fluorescence markers to the targeted regions. We perform a regularized, sparse linear model with a log link function where the marginal contribution to the thermodynamic cycle of each nucleotide is purely additive. The SpKG algorithm uniquely combines the Bayesian ranking and selection problem with the frequentist l 1 regularized regression approach Lasso. We use this algorithm to identify the sparsity pattern of the linear model as well as sequentially decide the best regions to test before exhausting an experimental budget. We also develop two new algorithms: batch SpKG and batch SpKG-LM. The first algorithm generates more suggestions sequentially to run parallel experiments. The second one dynamically adds new alternatives, in the form of types of probes, which are created by inserting, deleting, or mutating nucleotides within existing probes. In simulation, we demonstrate these algorithms on the Tetrahymena Group I intron (a midsize RNA molecule), showing that they efficiently learn the correct sparsity pattern, identify the most accessible region, and outperform several other policies.	en_US
dc.format.extent	750 - 767	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartof	INFORMS Journal on Computing	en_US
dc.rights	Author's manuscript	en_US
dc.title	A knowledge gradient policy for sequencing experiments to identify the structure of RNA molecules using a sparse additive belief model	en_US
dc.type	Journal Article	en_US
dc.identifier.doi	doi:10.1287/ijoc.2017.0803	-
dc.identifier.eissn	1526-5528	-
pu.type.symplectic	http://www.symplectic.co.uk/publications/atom-terms/1.0/journal-article	en_US

Files in This Item:

File	Description	Size	Format
A knowledge gradient policy for sequencing experiments to identify the structure of RNA molecules using a sparse additive belief model.pdf		2.32 MB	Adobe PDF	View/Download

Show Simple Item Record