Skip to main content

Nested Hierarchical Dirichlet Processes

Author(s): Paisley, John; Wang, Chong; Blei, David M; Jordan, Michael I

Download
To refer to this page use: http://arks.princeton.edu/ark:/88435/pr1gn4d
Abstract: We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP generalizes the nested Chinese restaurant process (nCRP) to allow each word to follow its own path to a topic node according to a per-document distribution over the paths on a shared tree. This alleviates the rigid, single-path formulation assumed by the nCRP, allowing documents to easily express complex thematic borrowings. We derive a stochastic variational inference algorithm for the model, which enables efficient inference for massive collections of text documents. We demonstrate our algorithm on 1.8 million documents from The New York Times and 2.7 million documents from Wikipedia.
Publication Date: 18-Apr-2015
Citation: Paisley, J., Wang, C., Blei, D. M., & Jordan, M. I. (2014). Nested hierarchical Dirichlet processes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37(2), 256-270.
DOI: doi:10.1109/TPAMI.2014.2318728
ISSN: 0162-8828
EISSN: 2160-9292
Pages: 256 - 270
Type of Material: Journal Article
Journal/Proceeding Title: IEEE Transactions on Pattern Analysis and Machine Intelligence
Version: Author's manuscript



Items in OAR@Princeton are protected by copyright, with all rights reserved, unless otherwise indicated.