Full metadata record
DC FieldValueLanguage
dc.contributor.authorSuri, Meeren_US
dc.contributor.authorRini, Stefanoen_US
dc.date.accessioned2020-10-05T02:00:32Z-
dc.date.available2020-10-05T02:00:32Z-
dc.date.issued2019-01-01en_US
dc.identifier.isbn978-1-7281-4300-2en_US
dc.identifier.issn1058-6393en_US
dc.identifier.urihttp://hdl.handle.net/11536/155065-
dc.description.abstractIn the dictionary based string matching problem, the positions in which a short sequence of symbols, query, occurs in a longer sequence, source, have to be retrieved. To improve the query search efficiency, the source can be pre-processed to build an inverted index, that is a table storing positions of occurrences of certain substrings in the text. Given an inverted index, various algorithms can be utilized to retrieve the position of the query in the source. In the literature, motivated by text retrieval and web search applications, the small v/s small (SvS) algorithm is often considered for the case in which the substrings stored are k-gram codes, i.e. all possible substrings of length k. In [9] we have proposed a novel approach to the string matching problem by letting the strings generating the inverted index be a variable length code, i.e. posting code, to be optimally design for a given performance metric. Posting codes [9] generalize classic k-gram codes and enable new query retrieval algorithms with potentially lower complexity than the SvS algorithm. In this paper, we propose a new query retrieval algorithm, the forward-backward SvS (FB-SvS) algorithm which can be applied when the posting code is a complete fix-free code. The complexity of the FB-SvS algorithm for various fix-free codes is investigated and numerical experiments using the human DNA sequence as the source are presented.en_US
dc.language.isoen_USen_US
dc.subjectDictionary-based string matchingen_US
dc.subjectSmall vs small algorithmen_US
dc.subjectInverted indexen_US
dc.subjectComplete fix-free codesen_US
dc.titleComplete Fix-Free Codes for the Statistical Dictionary-Based String Matching Problemen_US
dc.typeProceedings Paperen_US
dc.identifier.journalCONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERSen_US
dc.citation.spage1389en_US
dc.citation.epage1393en_US
dc.contributor.department電機工程學系zh_TW
dc.contributor.departmentDepartment of Electrical and Computer Engineeringen_US
dc.identifier.wosnumberWOS:000544249200266en_US
dc.citation.woscount0en_US
Appears in Collections:Conferences Paper