Skip to main content

Online variational inference for the hierarchical Dirichlet process

Author(s): Wang, C; Paisley, J; Blei, DM

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1fc0c
Full metadata record
DC FieldValueLanguage
dc.contributor.authorWang, C-
dc.contributor.authorPaisley, J-
dc.contributor.authorBlei, DM-
dc.date.accessioned2020-04-01T13:21:27Z-
dc.date.accessioned2021-10-08T19:44:15Z-
dc.date.available2020-04-01T13:21:27Z-
dc.date.available2021-10-08T19:44:15Z-
dc.date.issued2011en_US
dc.identifier.citationWang, C., Paisley, J. & Blei, D.. (2011). Online Variational Inference for the Hierarchical Dirichlet Process. Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, in PMLR 15:752-760en_US
dc.identifier.issn2640-3498-
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1fc0c-
dc.description.abstractThe hierarchical Dirichlet process (HDP) is a Bayesian nonparametric model that can be used to model mixed-membership data with a potentially infinite number of components. It has been applied widely in probabilistic topic modeling, where the data are documents and the components are distributions of terms that reflect recurring patterns (or "topics") in the collection. Given a document collection, posterior inference is used to determine the number of topics needed and to characterize their distributions. One limitation of HDP analysis is that existing posterior inference algorithms require multiple passes through all the data-these algorithms are intractable for very large scale applications. We propose an online variational inference algorithm for the HDP, an algorithm that is easily applicable to massive and streaming data. Our algorithm is significantly faster than traditional inference algorithms for the HDP, and lets us analyze much larger data sets. We illustrate the approach on two large collections of text, showing improved performance over online LDA, the finite counterpart to the HDP topic model. Copyright 2011 by the authors.en_US
dc.format.extent752 - 760en_US
dc.language.isoen_USen_US
dc.relation.ispartofProceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, in PMLRen_US
dc.relation.replaceshttp://arks.princeton.edu/ark:/88435/pr19v24-
dc.relation.replaces88435/pr19v24-
dc.rightsFinal published version. This is an open access article.en_US
dc.titleOnline variational inference for the hierarchical Dirichlet processen_US
dc.typeConference Articleen_US
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/conference-proceedingen_US

Files in This Item:
File Description SizeFormat 
Blei - Online Variational Inference for the Hierarchical.pdf1.3 MBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.