Skip to main content

Nonparametric Bayes Pachinko Allocation

Author(s): Li, Wei; Blei, David M; McCallum, Andrew

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1fn5g
Full metadata record
DC FieldValueLanguage
dc.contributor.authorLi, Wei-
dc.contributor.authorBlei, David M-
dc.contributor.authorMcCallum, Andrew-
dc.date.accessioned2020-04-01T13:21:26Z-
dc.date.available2020-04-01T13:21:26Z-
dc.date.issued2007en_US
dc.identifier.citationWei Li, David Blei, and Andrew McCallum. 2007. Nonparametric Bayes pachinko allocation. In Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI'07), Ron Parr and Linda van der Gaag (Eds.). AUAI Press, Arlington, Virginia, United States, 243-250.en_US
dc.identifier.urihttp://arks.princeton.edu/ark:/88435/pr1fn5g-
dc.description.abstractRecent advances in topic models have explored complicated structured distributions to represent topic correlation. For example, the pachinko allocation model (PAM) captures arbitrary, nested, and possibly sparse correlations between topics using a directed acyclic graph (DAG). While PAM provides more flexibility and greater expressive power than previous models like latent Dirichlet allocation (LDA), it is also more difficult to determine the appropriate topic structure for a specific dataset. In this paper, we propose a nonparametric Bayesian prior for PAM based on a variant of the hierarchical Dirichlet process (HDP). Although the HDP can capture topic correlations defined by nested data structure, it does not automatically discover such correlations from unstructured data. By assuming an HDP-based prior for PAM, we are able to learn both the number of topics and how the topics are correlated. We evaluate our model on synthetic and real-world text datasets, and show that nonparametric PAM achieves performance matching the best of PAM without manually tuning the number of topics.en_US
dc.format.extent243 - 250en_US
dc.language.isoen_USen_US
dc.relation.ispartofProceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI'07)en_US
dc.rightsAuthor's manuscripten_US
dc.titleNonparametric Bayes Pachinko Allocationen_US
dc.typeConference Articleen_US
pu.type.symplectichttp://www.symplectic.co.uk/publications/atom-terms/1.0/journal-articleen_US

Files in This Item:
File Description SizeFormat 
Blei - Nonparametric Bayes Pachinko Allocation.pdf537.86 kBAdobe PDFView/Download


Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.