完整後設資料紀錄
DC 欄位語言
dc.contributor.authorYeh, JYen_US
dc.contributor.authorKe, HRen_US
dc.contributor.authorYang, WPen_US
dc.date.accessioned2014-12-08T15:42:57Z-
dc.date.available2014-12-08T15:42:57Z-
dc.date.issued2002en_US
dc.identifier.isbn3-540-00261-8en_US
dc.identifier.issn0302-9743en_US
dc.identifier.urihttp://hdl.handle.net/11536/29101-
dc.description.abstractIn this paper, two novel approaches are proposed to extract important sentences from a document to create its summary. The first is a corpus-based approach using feature analysis. It brings up three new ideas: 1) to employ ranked position to emphasize the significance of sentence position, 2) to reshape word unit to achieve higher accuracy of keyword importance, and 3) to train a score function by the genetic algorithm for obtaining a suitable combination of feature weights. The second approach combines the ideas of latent semantic analysis and text relationship maps to interpret conceptual structures of a document. Both approaches are applied to Chinese text summarization. The two approaches were evaluated by using a data corpus composed of 100 articles about politics from New Taiwan Weekly, and when the compression ratio was 30%, average recalls of 52.0% and 45.6% were achieved respectively.en_US
dc.language.isoen_USen_US
dc.titleChinese text summarization using a trainable summarizer and latent semantic analysisen_US
dc.typeArticle; Proceedings Paperen_US
dc.identifier.journalDIGITAL LIBRARIES: PEOPLE, KNOWLEDGE, AND TECHNOLOGY, PROCEEDINGSen_US
dc.citation.volume2555en_US
dc.citation.spage76en_US
dc.citation.epage87en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.department圖書館zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.contributor.departmentLibraryen_US
dc.identifier.wosnumberWOS:000181472700008-
顯示於類別:會議論文