標題: | The Nested Indian Buffet Process :for Flexible Topic Modeling |
作者: | Chien, Jen-Tzung Chang, Ying-Lan 電機工程學系 Department of Electrical and Computer Engineering |
關鍵字: | Bayesian learning;structural learning;topic model;Indian buffet process |
公開日期: | 1-Jan-2014 |
摘要: | This paper presents a flexible topic model based on the nested Indian buffet process (nIBP). The flexibility is achieved by relaxing three constraints: (1) number of topics is fixed, (2) topics are independent, and (3) topic hierarchy for a document is limited by a single tree path. Bayesian nonparametric learning is conducted to build a tree model where the number of topics and the topic hierarchies are automatically learnt from the given data. In particular, we propose the nIBP to construct the topic mixture model for representation of heterogeneous documents where the mixture components are flexibly selected from tree nodes or dishes that a document or customer chooses in Indian buffet process. The selection is performed in a nested and hierarchical manner. The experiments on document representation show the benefits of using the proposed nIBP. |
URI: | http://hdl.handle.net/11536/146421 |
ISSN: | 2308-457X |
期刊: | 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4 |
起始頁: | 1434 |
結束頁: | 1437 |
Appears in Collections: | Conferences Paper |