標題: | SimSearcher: A local similarity search engine for biological sequence databases |
作者: | Tsai, TH Lee, SY 資訊工程學系 Department of Computer Science |
公開日期: | 2003 |
摘要: | In this paper an efficient local similarity search engine is developed exploiting some techniques of data mining. First of all, all frequent patterns in the database are retrieved and recorded in a one-time preprocessing process. Then a query sequence is checked for whether any pattern from the preprocessing stage is matched to the query. Two regions coming from the query and a database sequence that both match to a pattern form a possible seed for the local similarity. Finally, we extend and score each such seed region pair to see whether there really exists a local similarity with a score high enough for reporting. For computational efficiency, a novel clustering approach is proposed and is integrated into the proposed system, which is based on the local similarity search engine - DELPHI system proposed by IBM. Extensive experiments are demonstrated to show the performance of our system. |
URI: | http://hdl.handle.net/11536/18620 |
ISBN: | 0-7695-2031-6 |
期刊: | IEEE FIFTH INTERNATIOANL SYMPOSIUM ON MULTIMEDIA SOFTWARE ENGINEERING, PROCEEDINGS |
起始頁: | 305 |
結束頁: | 312 |
Appears in Collections: | Conferences Paper |