| 標題: | Chinese text summarization using a trainable summarizer and latent semantic analysis |
| 作者: | Yeh, JY Ke, HR Yang, WP 資訊工程學系 圖書館 Department of Computer Science Library |
| 公開日期: | 2002 |
| 摘要: | In this paper, two novel approaches are proposed to extract important sentences from a document to create its summary. The first is a corpus-based approach using feature analysis. It brings up three new ideas: 1) to employ ranked position to emphasize the significance of sentence position, 2) to reshape word unit to achieve higher accuracy of keyword importance, and 3) to train a score function by the genetic algorithm for obtaining a suitable combination of feature weights. The second approach combines the ideas of latent semantic analysis and text relationship maps to interpret conceptual structures of a document. Both approaches are applied to Chinese text summarization. The two approaches were evaluated by using a data corpus composed of 100 articles about politics from New Taiwan Weekly, and when the compression ratio was 30%, average recalls of 52.0% and 45.6% were achieved respectively. |
| URI: | http://hdl.handle.net/11536/29101 |
| ISBN: | 3-540-00261-8 |
| ISSN: | 0302-9743 |
| 期刊: | DIGITAL LIBRARIES: PEOPLE, KNOWLEDGE, AND TECHNOLOGY, PROCEEDINGS |
| Volume: | 2555 |
| 起始頁: | 76 |
| 結束頁: | 87 |
| 顯示於類別: | 會議論文 |

