標題: The Nested Indian Buffet Process :for Flexible Topic Modeling
作者: Chien, Jen-Tzung
Chang, Ying-Lan
電機工程學系
Department of Electrical and Computer Engineering
關鍵字: Bayesian learning;structural learning;topic model;Indian buffet process
公開日期: 1-Jan-2014
摘要: This paper presents a flexible topic model based on the nested Indian buffet process (nIBP). The flexibility is achieved by relaxing three constraints: (1) number of topics is fixed, (2) topics are independent, and (3) topic hierarchy for a document is limited by a single tree path. Bayesian nonparametric learning is conducted to build a tree model where the number of topics and the topic hierarchies are automatically learnt from the given data. In particular, we propose the nIBP to construct the topic mixture model for representation of heterogeneous documents where the mixture components are flexibly selected from tree nodes or dishes that a document or customer chooses in Indian buffet process. The selection is performed in a nested and hierarchical manner. The experiments on document representation show the benefits of using the proposed nIBP.
URI: http://hdl.handle.net/11536/146421
ISSN: 2308-457X
期刊: 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4
起始頁: 1434
結束頁: 1437
Appears in Collections:Conferences Paper