Copyright © 2004 Elsevier B.V. All rights reserved.
On domain-partitioning induction criteria: worst-case bounds for the worst-case based
Received 26 June 2003;
References and further reading may be available for this article. To view references and further reading you must purchase this article.
Abstract
One of the most popular induction scheme for supervised learning is also one of the oldest. It builds a classifier in a top-down fashion, following the minimization of a so-called index criterion. While numerous papers have reported experiments on this scheme, little has been known on its theoretical aspect until recent works on decision trees and branching programs using a powerful classification tool: boosting.
In this paper, we look at this problem from a worst-case computational (rather than informational) standpoint. Our conclusions for the ranking of these indexes’ minimization follow almost exactly that of boosting (with matching upper and lowerbounds), and provide extensions to more classes of Boolean formulas such as decision lists, multilinear polynomials and symmetric functions. Our results also exhibit a strong worst-case for the induction scheme, as we build particularly hard samples for which the replacement of most index criteria, or the class of concept representation, even when producing the same ranking as boosting does for the indexes, makes no difference at all for the concept induced. This is clearly not a limit of previous analyses, but a consequence of the induction scheme.
Author Keywords: Classifier induction; Computational complexity; Index criteria






E-mail Article
Add to my Quick Links

Cited By in Scopus (1)








