標題: Anchor text mining for translation of Web queries: A transitive translation approach
作者: Lu, WH
Chien, LF
Lee, HJ
資訊工程學系
Department of Computer Science
關鍵字: algorithms;experimentation;performance;multilingual translation;anchor text mining;cross-language information retrieval;cross-language Web search;competitive linking algorithm
公開日期: 1-四月-2004
摘要: To discover translation knowledge in diverse data resources on the Web, this article proposes an effective approach to finding translation equivalents of query terms and constructing multilingual lexicons through the mining of Web anchor texts and link structures. Although Web anchor texts are wide-scoped hypertext resources, not every particular pair of languages contains sufficient anchor texts for effective extraction of translations for Web queries. For more generalized applications, the approach is designed based on a transitive translation model. The translation equivalents of a query term can be extracted via its translation in an intermediate language. To reduce interference from translation errors, the approach further integrates a competitive linking algorithm into the process of determining the most probable translation. A series of experiments has been conducted, including performance tests on term translation extraction, cross-language information retrieval, and translation suggestions for practical Web search services, respectively. The obtained experimental results have shown that the proposed approach is effective in extracting translations of unknown queries, is easy to combine with the probabilistic retrieval model to improve the cross-language retrieval performance, and is very useful when the considered language pairs lack a sufficient number of anchor texts. Based on the approach, an experimental system called LiveTrans has been developed for English-Chinese cross-language Web search.
URI: http://dx.doi.org/10.1145/984321.984324
http://hdl.handle.net/11536/26904
ISSN: 1046-8188
DOI: 10.1145/984321.984324
期刊: ACM TRANSACTIONS ON INFORMATION SYSTEMS
Volume: 22
Issue: 2
起始頁: 242
結束頁: 269
顯示於類別:期刊論文


文件中的檔案:

  1. 000220853800003.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。