ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Information Processing Letters
Volume 58, Issue 5, 10 June 1996, Pages 217-222
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Purchase PDF (574 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/0020-0190(96)00061-0    How to Cite or Link Using DOI (Opens New Window)
Copyright © 1996 Published by Elsevier Science B.V.

A time and space efficient data structure for string searching on large texts

Livio ColussiCorresponding Author Contact Information, E-mail The Corresponding Author and Alessia De Col

Dipartimento di Matematica Pura ed Applicata, Università di Padova, Via Belzoni 7, I-35131, Padova, Italy

Received 10 October 1995; 
revised 22 April 1996. 
Communicated by W.M. Turski 
Available online 16 February 1999.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Suffix tree and suffix array are data structures that allow fast search in a large static text. By using the suffix tree data structure we can find all k occurrences of a pattern w in a text of length n in time Ow¦ + k). The same problem can be solved by using the suffix array data structure in time O(¦w¦ + log(n) + k). Thus suffix trees perform better than suffix arrays with respect to the search time. On the other hand suffix trees require as much as four times more memory space than suffix arrays. We propose a new data structure, the augmented suffix array, that allows searching in O(¦w¦ + log log(n) + k) time and requires about the same memory space as the suffix array. Moreover, in case of very large texts, most of the new data structure and the text itself can be stored in secondary memory without compromising search operation efficiency. This is not the case for both suffix trees and suffix arrays.

Author Keywords: Data structures; String matching

Article Outline

• References

 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.