ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Data & Knowledge Engineering
Volume 46, Issue 1, July 2003, Pages 97-121
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (568 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/S0169-023X(02)00209-4    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2003 Elsevier Science B.V. All rights reserved.

Incremental mining of sequential patterns in large databases

Florent MassegliaCorresponding Author Contact Information, E-mail The Corresponding Author, a, 1, Pascal PonceletE-mail The Corresponding Author, b and Maguelonne TeisseireE-mail The Corresponding Author, c

a INRIA Sophia Antipolis, 2004 route des lucioles, BP 93, Sophia Antipolis FR-06902, France b Laboratoire LGI2P, Ecole des Mines d’Alès, Site EERIE, Parc Scientifique Georges Besse 30035, Nîmes Cedex 1, France c LIRMM, 161 rue Ada 34392, Montpellier Cedex 5, France

Received 13 March 2002; 
accepted 21 August 2002. ;
Available online 10 January 2003.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

In this paper, we consider the problem of the incremental mining of sequential patterns when new transactions or new customers are added to an original database. We present a new algorithm for mining frequent sequences that uses information collected during an earlier mining process to cut down the cost of finding new sequential patterns in the updated database. Our test shows that the algorithm performs significantly faster than the naive approach of mining the whole updated database from scratch. The difference is so pronounced that this algorithm could also be useful for mining sequential patterns, since in many cases it is faster to apply our algorithm than to mine sequential patterns using a standard algorithm, by breaking down the database into an original database plus an increment.

Author Keywords: Sequential patterns; Incremental mining; Data mining

Article Outline

1. Introduction
2. Statement of the problem
2.1. Mining of sequential patterns
2.2. Incremental mining on discovered sequential patterns
2.3. Related work
2.3.1. SuffixTree and FASTUP approaches
2.3.2. ISM
3. IImage algorithm
3.1. An overview
3.1.1. First iteration
3.1.2. jth Iteration
3.2. The IImage algorithm
3.3. Optimization
4. Experiments
4.1. Datasets
4.2. Comparison of IImage with GSP
4.2.1. Naive vs. IImage algorithm
4.2.2. Performance in scaled-up databases
4.2.3. Varying the size of added transactions
4.2.4. Varying the number of added customers
4.3. IImage for mining sequential patterns
4.3.1. Candidate sets
4.3.2. Varying the size of updates
5. Conclusion
References
Vitae














 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.