完整後設資料紀錄
DC 欄位語言
dc.contributor.authorLin, Ten_US
dc.contributor.authorWu, JCen_US
dc.contributor.authorChang, JSen_US
dc.date.accessioned2014-12-08T15:39:48Z-
dc.date.available2014-12-08T15:39:48Z-
dc.date.issued2004en_US
dc.identifier.isbn3-540-23300-8en_US
dc.identifier.issn0302-9743en_US
dc.identifier.urihttp://hdl.handle.net/11536/27197-
dc.description.abstractNamed-entities in free text represent a challenge to text analysis in Machine Translation and Cross Language Information Retrieval. These phrases are often transliterated into another language with a different sound inventory and writing system. Named-entities found in free text are often not listed in bilingual dictionaries. Although it is possible to identify and translate named-entities on the fly without a list of proper names and transliterations, an extensive list of existing transliterations certainly will ensure high precision rate. We use a seed list of proper names and transliterations to train a Machine Transliteration Model. With the model it is possible to extract proper names and their transliterations in monolingual or parallel corpora with high precision and recall rates.en_US
dc.language.isoen_USen_US
dc.titleExtraction of name and transliteration in monolingual and parallel corporaen_US
dc.typeArticle; Proceedings Paperen_US
dc.identifier.journalMACHINE TRANSLATION: FROM REAL USERS TO RESEARCH, PROCEEDINGSen_US
dc.citation.volume3265en_US
dc.citation.spage177en_US
dc.citation.epage186en_US
dc.contributor.department電信工程研究所zh_TW
dc.contributor.departmentInstitute of Communications Engineeringen_US
dc.identifier.wosnumberWOS:000224611600020-
顯示於類別:會議論文