以語料為基礎的中文專有名詞分類之研究

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	葉政輝	en_US
dc.contributor.author	Cheng-Hui Yeh	en_US
dc.contributor.author	梁婷	en_US
dc.contributor.author	Tyne Liang	en_US
dc.date.accessioned	2014-12-12T02:30:23Z	-
dc.date.available	2014-12-12T02:30:23Z	-
dc.date.issued	2002	en_US
dc.identifier.uri	http://140.113.39.130/cdrfb3/record/nctu/#NT910394002	en_US
dc.identifier.uri	http://hdl.handle.net/11536/70174	-
dc.description.abstract	專有名詞的分類在自然語言處理中屬於重要的一環，尤其是針對文件處理以及語意的了解上。正確的專有名詞識別在文件搜尋中不僅可以扮演索引詞彙，在語意上也可以藉此了解人物、事件、地點與時間等關係。本論文中，我們使用了中文字元機率模型，利用人名常見字元來解決中文人名分類的問題。此外，藉由相鄰共現雙詞彙模型以及前後詞類兩模型，將專有名詞前後常見詞彙與詞類標記整合使用來識別與分類中文人名與組織名稱。經過訓練後，在測試上中文人名可以達到89%的正確率與99%的召回率，而組織名稱上也有89%的正確率與84%的召回率。	zh_TW
dc.description.abstract	Named-entity identification plays an important role in natural language processing, especially in document processing and message understanding. Named-entity can be a keyword on web or full-text retrieval. We can understand relationships among persons, events, locations, date or time in documents via correct named-entity identification. In this thesis, we use probabilities of characters used in common Chinese person names to retrieve Chinese person name. Furthermore, we propose co-occurring-neighbor word model and part-of-speech model to combine key terms and tagging information prior/posterior to named-entities. After training, we have 89% precision and 99% recall rate on Chinese person name classification experiments, 89% precision and 84% recall rate on organization classification experiments.	en_US
dc.language.iso	zh_TW	en_US
dc.subject	中文名詞	zh_TW
dc.subject	分類	zh_TW
dc.subject	Named-Entity	en_US
dc.subject	Classification	en_US
dc.title	以語料為基礎的中文專有名詞分類之研究	zh_TW
dc.title	A Corpus-Based Chinese Named-Entity Classification	en_US
dc.type	Thesis	en_US
dc.contributor.department	資訊科學與工程研究所	zh_TW
顯示於類別：	畢業論文