完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | Yeh, JY | en_US |
dc.contributor.author | Ke, HR | en_US |
dc.contributor.author | Yang, WP | en_US |
dc.date.accessioned | 2014-12-08T15:42:57Z | - |
dc.date.available | 2014-12-08T15:42:57Z | - |
dc.date.issued | 2002 | en_US |
dc.identifier.isbn | 3-540-00261-8 | en_US |
dc.identifier.issn | 0302-9743 | en_US |
dc.identifier.uri | http://hdl.handle.net/11536/29101 | - |
dc.description.abstract | In this paper, two novel approaches are proposed to extract important sentences from a document to create its summary. The first is a corpus-based approach using feature analysis. It brings up three new ideas: 1) to employ ranked position to emphasize the significance of sentence position, 2) to reshape word unit to achieve higher accuracy of keyword importance, and 3) to train a score function by the genetic algorithm for obtaining a suitable combination of feature weights. The second approach combines the ideas of latent semantic analysis and text relationship maps to interpret conceptual structures of a document. Both approaches are applied to Chinese text summarization. The two approaches were evaluated by using a data corpus composed of 100 articles about politics from New Taiwan Weekly, and when the compression ratio was 30%, average recalls of 52.0% and 45.6% were achieved respectively. | en_US |
dc.language.iso | en_US | en_US |
dc.title | Chinese text summarization using a trainable summarizer and latent semantic analysis | en_US |
dc.type | Article; Proceedings Paper | en_US |
dc.identifier.journal | DIGITAL LIBRARIES: PEOPLE, KNOWLEDGE, AND TECHNOLOGY, PROCEEDINGS | en_US |
dc.citation.volume | 2555 | en_US |
dc.citation.spage | 76 | en_US |
dc.citation.epage | 87 | en_US |
dc.contributor.department | 資訊工程學系 | zh_TW |
dc.contributor.department | 圖書館 | zh_TW |
dc.contributor.department | Department of Computer Science | en_US |
dc.contributor.department | Library | en_US |
dc.identifier.wosnumber | WOS:000181472700008 | - |
顯示於類別: | 會議論文 |