ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Data & Knowledge Engineering
Volume 37, Issue 3, June 2001, Pages 243-266
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (259 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/S0169-023X(01)00008-8    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2001 Elsevier Science B.V. All rights reserved.

Mining patterns from graph traversals*1

Alexandros NanopoulosE-mail The Corresponding Author, a and Yannis ManolopoulosCorresponding Author Contact Information, E-mail The Corresponding Author, b, 1

a Data Engineering Lab, Department of Informatics, Aristotle University, Thessaloniki 54006, Greece b Department of Computer Science, University of Cyprus, Nicosia 1678, Cyprus

Received 3 August 2000;
revised 22 November 2000;
accepted 24 January 2001
Available online 25 May 2001.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

In data models that have graph representations, users navigate following the links of the graph structure. Conducting data mining on collected information about user accesses in such models, involves the determination of frequently occurring access sequences. In this paper, the problem of finding traversal patterns from such collections is examined. The determination of patterns is based on the graph structure of the model. For this purpose, three algorithms, one which is level-wise with respect to the lengths of the patterns and two which are not are presented. Additionally, we consider the fact that accesses within patterns may be interleaved with random accesses due to navigational purposes. The definition of the pattern type generalizes existing ones in order to take into account this fact. The performance of all algorithms and their sensitivity to several parameters is examined experimentally.

Author Keywords: Web log mining; Path traversal; Graph model

Article Outline

1. Introduction
2. Background and motivation
2.1. Definitions
2.2. Overview of web log mining methods
2.2.1. Standard association rules and sequential patterns
2.2.2. Maximal reference sequences
2.2.3. Composite association rules
2.3. Motivation
3. Mining access patterns from graph traversals
3.1. Subpath definition
3.2. Problem statement
3.3. Pruning with the support criterion
4. Level-wise determination of large paths
4.1. Data structures
4.2. Support counting
4.3. Candidate generation
5. Non-level-wise algorithms for large path determination
5.1. Phase-merging algorithm
5.2. Selective candidate extension
6. Performance results
6.1. Synthetic data generator
6.2. Results
7. Conclusions
Acknowledgements
References
Vitae









 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.