ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Discrete Applied Mathematics
Volume 144, Issues 1-2, 30 November 2004, Pages 79-102
Discrete Mathematics and Data Mining
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (360 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.dam.2003.08.013    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2004 Elsevier B.V. All rights reserved.

Pareto-optimal patterns in logical analysis of datastar, open

Peter L. Hammera, E-mail The Corresponding Author, Alexander Koganb, E-mail The Corresponding Author, Bruno Simeonec, E-mail The Corresponding Author and Sándor Szedmáka, E-mail The Corresponding Author

aRUTCOR, Rutgers University, 640 Bartholomew Road, Piscataway, New Jersey 08854-8003, USA bAccounting and Information Systems, Rutgers Business School, Rutgers University, 180 University Ave., Newark, New Jersey 07102, USA cDipartimento di Statistica, Probabilità e Statistiche Applicate,University of Rome I “La Sapienza”,Piazzale Aldo Moro 5, 00185 - Roma, Italy

Received 5 August 2002; 
revised 10 March 2003; 
accepted 20 August 2003. 
Available online 8 August 2004.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Patterns are the key building blocks in the logical analysis of data (LAD). It has been observed in empirical studies and practical applications that some patterns are more “suitable” than others for use in LAD. In this paper, we model various such suitability criteria as partial preorders defined on the set of patterns. We introduce three such preferences, and describe patterns which are Pareto-optimal with respect to any one of them, or to certain combinations of them. We develop polynomial time algorithms for recognizing Pareto-optimal patterns, as well as for transforming an arbitrary pattern to a better Pareto-optimal one with respect to any one of the considered criteria, or their combinations. We obtain analytical representations characterizing some of the sets of Pareto-optimal patterns, and investigate the computational complexity of generating all Pareto-optimal patterns. The empirical evaluation of the relative merits of various types of Pareto-optimality is carried out by comparing the classification accuracy of Pareto-optimal theories on several real life data sets. This evaluation indicates the advantages of “strong patterns”, i.e. those patterns which are Pareto-optimal with respect to the “evidential preference” introduced in this paper.

Keywords: Extremal patterns, Data mining, Machine learning, Classification accuracy, Boolean functions

Article Outline

1. Introduction
2. Notation and terminology
3. Preferences and Pareto-optimal patterns
4. Pareto-optimization of patterns and theories
4.1. Prime patterns
4.2. Spanned patterns
4.3. Strong patterns
4.4. Pareto-optimization of theories
5. Complexity of generation of strong spanned patterns
6. Analytical description of Pareto-optimal patterns
6.1. Description of patterns and prime patterns
6.2. Description of spanned patterns
6.3. Description of strong spanned patterns
7. Empirical evaluation
8. Conclusions
References




Discrete Applied Mathematics
Volume 144, Issues 1-2, 30 November 2004, Pages 79-102
Discrete Mathematics and Data Mining
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.