ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Information Sciences
Volume 165, Issues 1-2, 3 September 2004, Pages 73-90
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (247 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.ins.2003.09.018    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2003 Elsevier Inc. All rights reserved.

Building multi-way decision trees with numerical attributes

Fernando BerzalCorresponding Author Contact Information, E-mail The Corresponding Author, Juan-Carlos CuberoE-mail The Corresponding Author, Nicolás MarínE-mail The Corresponding Author and Daniel SánchezE-mail The Corresponding Author

Department of Computer Science and Artificial Intelligence, University of Granada, 18071, Granada, Spain

Received 20 June 2002; 
Revised 20 June 2003; 
accepted 4 September 2003. 
Available online 4 November 2003.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Decision trees are probably the most popular and commonly used classification model. They are recursively built following a top-down approach (from general concepts to particular examples) by repeated splits of the training dataset. When this dataset contains numerical attributes, binary splits are usually performed by choosing the threshold value which minimizes the impurity measure used as splitting criterion (e.g. C4.5 gain ratio criterion or CART Gini's index). In this paper we propose the use of multi-way splits for continuous attributes in order to reduce the tree complexity without decreasing classification accuracy. This can be done by intertwining a hierarchical clustering algorithm with the usual greedy decision tree learning.

Author Keywords: Supervised learning; Classification; Decision trees; Numerical attributes; Hierarchical clustering

Article Outline

1. Introduction
2. Splitting criteria
3. Binary splits for numerical attributes
4. Multi-way splits for numerical attributes
4.1. Classical value clustering
4.2. Discretization techniques
4.3. An alternative approach: taking context into account
4.3.1. An example
4.3.2. Measuring similarity
4.3.3. Interleaving the hierarchical clustering algorithm with the TDIDT evaluation process
5. Experimental results
6. Conclusions
References



Information Sciences
Volume 165, Issues 1-2, 3 September 2004, Pages 73-90
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.