Copyright © 2003 Elsevier Inc. All rights reserved.
Building multi-way decision trees with numerical attributes
Received 20 June 2002;
References and further reading may be available for this article. To view references and further reading you must purchase this article.
Abstract
Decision trees are probably the most popular and commonly used classification model. They are recursively built following a top-down approach (from general concepts to particular examples) by repeated splits of the training dataset. When this dataset contains numerical attributes, binary splits are usually performed by choosing the threshold value which minimizes the impurity measure used as splitting criterion (e.g. C4.5 gain ratio criterion or CART Gini's index). In this paper we propose the use of multi-way splits for continuous attributes in order to reduce the tree complexity without decreasing classification accuracy. This can be done by intertwining a hierarchical clustering algorithm with the usual greedy decision tree learning.
Author Keywords: Supervised learning; Classification; Decision trees; Numerical attributes; Hierarchical clustering







E-mail Article
Add to my Quick Links

Cited By in Scopus (7)






B(x)=max{μA(x),μB(x)}.