Title: The Nested Indian Buffet Process :for Flexible Topic Modeling
Authors: Chien, Jen-Tzung
Chang, Ying-Lan
電機工程學系
Department of Electrical and Computer Engineering
Keywords: Bayesian learning;structural learning;topic model;Indian buffet process
Issue Date: 1-Jan-2014
Abstract: This paper presents a flexible topic model based on the nested Indian buffet process (nIBP). The flexibility is achieved by relaxing three constraints: (1) number of topics is fixed, (2) topics are independent, and (3) topic hierarchy for a document is limited by a single tree path. Bayesian nonparametric learning is conducted to build a tree model where the number of topics and the topic hierarchies are automatically learnt from the given data. In particular, we propose the nIBP to construct the topic mixture model for representation of heterogeneous documents where the mixture components are flexibly selected from tree nodes or dishes that a document or customer chooses in Indian buffet process. The selection is performed in a nested and hierarchical manner. The experiments on document representation show the benefits of using the proposed nIBP.
URI: http://hdl.handle.net/11536/146421
ISSN: 2308-457X
Journal: 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4
Begin Page: 1434
End Page: 1437
Appears in Collections:Conferences Paper