Build, Compute, Critique, Repeat: Data Analysis with Latent Variable Models
Author(s): Blei, David M
DownloadTo refer to this page use:
http://arks.princeton.edu/ark:/88435/pr12b6p
Abstract: | We survey latent variable models for solving data-analysis problems. A latent variable model is a probabilistic model that encodes hidden patterns in the data. We uncover these patterns from their conditional distribution and use them to summarize data and form predictions. Latent variable models are important in many fields, including computational biology, natural language processing, and social network analysis. Our perspective is that models are developed iteratively: We build a model, use it to analyze data, assess how it succeeds and fails, revise it, and repeat. We describe how new research has transformed these essential activities. First, we describe probabilistic graphical models, a language for formulating latent variable models. Second, we describe mean field variational inference, a generic algorithm for approximating conditional distributions. Third, we describe how to use our analyses to solve problems: exploring the data, forming predictions, and pointing us in the direction of improved models. |
Publication Date: | 3-Jan-2014 |
Citation: | Blei, David M. (2014). Build, Compute, Critique, Repeat: Data Analysis with Latent Variable Models. Annual Review of Statistics and Its Application, 1 (1), 203 - 232. doi:10.1146/annurev-statistics-022513-115657 |
DOI: | doi:10.1146/annurev-statistics-022513-115657 |
ISSN: | 2326-8298 |
EISSN: | 2326-831X |
Pages: | 203 - 232 |
Type of Material: | Journal Article |
Journal/Proceeding Title: | Annual Review of Statistics and Its Application |
Version: | Author's manuscript |
Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.