ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Data & Knowledge Engineering
Volume 59, Issue 3, December 2006, Pages 603-626
Including: ER 2003 - Selection of papers presented at the 22nd International Conference on Conceptual Modeling, 22nd International Conference on Conceptual Modeling
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (332 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.datak.2005.10.004    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2005 Elsevier B.V. All rights reserved.

Mining itemset utilities from transaction databases

Hong Yaoa, E-mail The Corresponding Author and Howard J. HamiltonCorresponding Author Contact Information, a, E-mail The Corresponding Author

aDepartment of Computer Science, University of Regina, 3737 Wascana Parkway, Regina, SK, Canada S4S 0A2

Received 13 October 2005; 
accepted 13 October 2005. 
Available online 18 November 2005.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

The rationale behind mining frequent itemsets is that only itemsets with high frequency are of interest to users. However, the practical usefulness of frequent itemsets is limited by the significance of the discovered itemsets. A frequent itemset only reflects the statistical correlation between items, and it does not reflect the semantic significance of the items. In this paper, we propose a utility based itemset mining approach to overcome this limitation. The proposed approach permits users to quantify their preferences concerning the usefulness of itemsets using utility values. The usefulness of an itemset is characterized as a utility constraint. That is, an itemset is interesting to the user only if it satisfies a given utility constraint. We show that the pruning strategies used in previous itemset mining approaches cannot be applied to utility constraints. In response, we identify several mathematical properties of utility constraints. Then, two novel pruning strategies are designed. Two algorithms for utility based itemset mining are developed by incorporating these pruning strategies. The algorithms are evaluated by applying them to synthetic and real world databases. Experimental results show that the proposed algorithms are effective on the databases tested.

Keywords: Utility mining; Data mining; Semantic significance; User preference; Itemset

Article Outline

1. Introduction
2. Related work
3. Definitions
4. Theoretical foundation
5. Pruning strategy
6. Algorithms for utility based itemset mining
6.1. The UMining algorithm
6.2. The UMining_H algorithm
7. Experimental results
7.1. Experimental summary
7.2. Algorithm comparison
7.3. Utility versus support
8. Conclusion
Acknowledgements
References
Vitae




Data & Knowledge Engineering
Volume 59, Issue 3, December 2006, Pages 603-626
Including: ER 2003 - Selection of papers presented at the 22nd International Conference on Conceptual Modeling, 22nd International Conference on Conceptual Modeling
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.