標題: | HIERARCHICAL THEME AND TOPIC MODEL FOR SUMMARIZATION |
作者: | Chien, Jen-Tzung Chang, Ying-Lan 電機資訊學士班 Undergraduate Honors Program of Electrical Engineering and Computer Science |
關鍵字: | Topic model;structural learning;Bayesian nonparametrics;document summarization |
公開日期: | 1-一月-2013 |
摘要: | This paper presents a hierarchical summarization model to extract representative sentences from a set of documents. In this study, we select the thematic sentences and identify the topical words based on a hierarchical theme and topic model (H2TM). The latent themes and topics are inferred from document collection. A tree stick-breaking process is proposed to draw the theme proportions for representation of sentences. The structural learning is performed without fixing the number of themes and topics. This H2TM is delicate and flexible to represent words and sentences from heterogeneous documents. Thematic sentences are effectively extracted for document summarization. In the experiments, the proposed H2TM outperforms the other methods in terms of precision, recall and F-measure. |
URI: | http://hdl.handle.net/11536/124936 |
ISBN: | 978-1-4799-1180-6 |
ISSN: | 2161-0363 |
期刊: | 2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP) |
顯示於類別: | 會議論文 |