ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
Data & Knowledge Engineering
Volume 62, Issue 1, July 2007, Pages 118-137
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (576 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.datak.2006.07.008    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2006 Elsevier B.V. All rights reserved.

Soft constraint based pattern mining

Stefano Bistarellia, E-mail The Corresponding Author, E-mail The Corresponding Author and Francesco Bonchib, Corresponding Author Contact Information, E-mail The Corresponding Author, E-mail The Corresponding Author

aDipartimento di Scienze, Università degli Studi “G. d’Annunzio”, Viale Pindaro, 87 – 65127 Pescara, Italy bPisa KDD Laboratory, ISTI – CNR, Area della Ricerca di Pisa, Via Giuseppe Moruzzi, 1 – 56124 Pisa, Italy

Received 30 March 2006; 
revised 21 June 2006; 
accepted 27 July 2006. 
Available online 28 August 2006.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

The paradigm of pattern discovery based on constraints was introduced with the aim of providing to the user a tool to drive the discovery process towards potentially interesting patterns, with the positive side effect of achieving a more efficient computation. So far the research on this paradigm has mainly focused on the latter aspect: the development of efficient algorithms for the evaluation of constraint-based mining queries. Due to the lack of research on methodological issues, the constraint-based pattern mining framework still suffers from many problems which limit its practical relevance. In this paper, we analyze such limitations and we show how they flow out from the same source: the fact that in the classical constraint-based mining, a constraint is a rigid boolean function which returns either true or false. Indeed, interestingness is not a dichotomy. Following this consideration, we introduce the new paradigm of pattern discovery based on Soft Constraints, where constraints are no longer rigid boolean functions.

Albeit based on a simple idea, our proposal has many merits: it provides a rigorous theoretical framework, which is very general (having the classical paradigm as a particular instance), and which overcomes all the major methodological drawbacks of the classical constraint-based paradigm, representing an important step further towards practical pattern discovery.

Keywords: Frequent pattern mining; Constraint-based mining; Soft constraints; Semiring-based constraints

Article Outline

1. Background and motivations
1.1. Paper contributions and organization
2. Introducing soft constraints
3. Instances of the semiring
3.1. Fuzzy semiring
3.2. Probabilistic semiring
3.3. Weighted semiring
4. Soft constraint based pattern mining
5. Implementing the framework
5.1. Mining View the MathML source (λ-interesting itemsets on the fuzzy semiring)
5.2. Mining View the MathML source (λ-interesting itemsets on the probabilistic semiring
5.3. Mining View the MathML source (λ-interesting itemsets on the weighted semiring)
5.4. Mining top-k itemsets
6. Related work
7. Conclusions and future work
References
Vitae












 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.