標題: SimSearcher: A local similarity search engine for biological sequence databases
作者: Tsai, TH
Lee, SY
資訊工程學系
Department of Computer Science
公開日期: 2003
摘要: In this paper an efficient local similarity search engine is developed exploiting some techniques of data mining. First of all, all frequent patterns in the database are retrieved and recorded in a one-time preprocessing process. Then a query sequence is checked for whether any pattern from the preprocessing stage is matched to the query. Two regions coming from the query and a database sequence that both match to a pattern form a possible seed for the local similarity. Finally, we extend and score each such seed region pair to see whether there really exists a local similarity with a score high enough for reporting. For computational efficiency, a novel clustering approach is proposed and is integrated into the proposed system, which is based on the local similarity search engine - DELPHI system proposed by IBM. Extensive experiments are demonstrated to show the performance of our system.
URI: http://hdl.handle.net/11536/18620
ISBN: 0-7695-2031-6
期刊: IEEE FIFTH INTERNATIOANL SYMPOSIUM ON MULTIMEDIA SOFTWARE ENGINEERING, PROCEEDINGS
起始頁: 305
結束頁: 312
顯示於類別:會議論文