Copyright © 1996 Published by Elsevier Science B.V.
A time and space efficient data structure for string searching on large texts
Received 10 October 1995;
References and further reading may be available for this article. To view references and further reading you must purchase this article.
Abstract
Suffix tree and suffix array are data structures that allow fast search in a large static text. By using the suffix tree data structure we can find all k occurrences of a pattern w in a text of length n in time O(¦w¦ + k). The same problem can be solved by using the suffix array data structure in time O(¦w¦ + log(n) + k). Thus suffix trees perform better than suffix arrays with respect to the search time. On the other hand suffix trees require as much as four times more memory space than suffix arrays. We propose a new data structure, the augmented suffix array, that allows searching in O(¦w¦ + log log(n) + k) time and requires about the same memory space as the suffix array. Moreover, in case of very large texts, most of the new data structure and the text itself can be stored in secondary memory without compromising search operation efficiency. This is not the case for both suffix trees and suffix arrays.
Author Keywords: Data structures; String matching







E-mail Article
Add to my Quick Links

Cited By in Scopus (11)






