標題: | Subtopic segmentation for small corpus using a novel fuzzy model |
作者: | Chang, Tao-Hsing Lee, Chia-Hoang 資訊工程學系 Department of Computer Science |
關鍵字: | fuzzy modeling;fuzzy semantics;semantic similarity measurement;small corpus;topic segmentation |
公開日期: | 1-八月-2007 |
摘要: | Subtopic segmentation is a critical task in numerous applications, including information retrieval, automatic summarization, essay scoring, and others. Although several approaches have been developed, many are ineffective for specific domains with a small corpus because of the fuzziness of the semantics of words and sentences in the corpus. This paper explores the problem of subtopic segmentation by proposing a fuzzy model for the semantics of both words and sentences. The model has three characteristics. First, it can deal with the uncertainty in the semantics of words and sentences. Secondly, it can measure the fuzzy similarity between the fuzzy semantics of sentences. Thirdly, it can develop a fuzzy algorithm for segmenting a text into several subtopic segments. The experiments, especially for a short text with a small corpus in a specific domain, indicate that the method can efficiently increase the accuracy of subtopic segmentation over previous methods. |
URI: | http://dx.doi.org/10.1109/TFUZZ.2006.889911 http://hdl.handle.net/11536/10530 |
ISSN: | 1063-6706 |
DOI: | 10.1109/TFUZZ.2006.889911 |
期刊: | IEEE TRANSACTIONS ON FUZZY SYSTEMS |
Volume: | 15 |
Issue: | 4 |
起始頁: | 699 |
結束頁: | 709 |
顯示於類別: | 期刊論文 |