ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
Data & Knowledge Engineering
Volume 59, Issue 3, December 2006, Pages 627-651
Including: ER 2003 - Selection of papers presented at the 22nd International Conference on Conceptual Modeling, 22nd International Conference on Conceptual Modeling
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (455 K)

  E-mail Article   
  Add to my Quick Links   
Bookmark and share in 2collab (opens in new window)
Request permission to reuse this article
  Cited By in Scopus (0)
 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.datak.2005.10.002    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2005 Elsevier B.V. All rights reserved.

XML structural delta mining: Issues and challenges

Qiankun Zhaoa, Ling Chena, Sourav S. Bhowmicka, Corresponding Author Contact Information, E-mail The Corresponding Author and Sanjay Madriab

aSchool of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore bDepartment of Computer Science, University of Missouri, Rolla, USA

Received 24 August 2005; 
accepted 12 October 2005. 
Available online 17 November 2005.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Recently, there is an increasing research efforts in XML data mining. These research efforts largely assumed that XML documents are static. However, in reality, the documents are rarely static. In this paper, we propose a novel research problem called XML structural delta mining. The objective of XML structural delta mining is to discover knowledge by analyzing structural evolution pattern (also called structural delta) of history of XML documents. Unlike existing approaches, XML structural delta mining focuses on the dynamic and temporal features of XML data. Furthermore, the data source for this novel mining technique is a sequence of historical versions of an XML document rather than a set of snapshot XML documents. Such mining technique can be useful in many applications such as change detection for very large XML documents, efficient XML indexing, XML search engine, etc. Our aim in this paper is not to provide a specific solution to a particular mining problem. Rather, we present the vision of the mining framework and present the issues and challenges for three types of XML structural delta mining: identifying various interesting structures, discovering association rules from structural deltas, and structural change pattern-based classification.

Keywords: Versions of XML documents; Structural delta; Dynamic metrics; XML structural delta mining; Research issues; Applications

Article Outline

1. Introduction
1.1. Motivation
1.2. The framework
1.3. Paper organization
2. Related work
2.1. XML change detection
2.2. XML data mining
3. Preliminaries
3.1. Types of XML structural changes
3.2. XML structural delta
3.3. Dynamic metrics
3.4. XML structural delta mining
4. Discovering interesting structures
4.1. Frequently changing structure
4.2. Frozen structure
4.3. Periodic dynamic structure
4.4. Increasing and decreasing dynamic structures
4.5. Outlier structure
5. Discovering association rules
5.1. Positive association rule mining
5.2. Negative association rule mining
5.3. Specialized association rule mining
6. Structure change pattern-based classification
7. Research issues
7.1. Duration of real data collection
7.2. Determining schedules for structural delta generation
7.3. Efficient and scalable structural change detection
7.4. Scalable and efficient mining algorithms
7.5. Semantic-conscious algorithms
7.6. Unified mining framework
8. Applications
8.1. Applications of interesting structures
8.1.1. Efficient change detection for very large XML documents
8.1.2. Efficient XML indexing
8.1.3. Dynamic-conscious XML caching
8.1.4. Semantic meaning extraction
8.2. Applications of XML structural delta association
8.2.1. Structure-based document clustering
8.2.2. Semantic XML search engine
8.3. Applications of change pattern-based classification
9. Conclusions
References
Vitae





Data & Knowledge Engineering
Volume 59, Issue 3, December 2006, Pages 627-651
Including: ER 2003 - Selection of papers presented at the 22nd International Conference on Conceptual Modeling, 22nd International Conference on Conceptual Modeling
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.