Skip to main content

A Machine Learning Algorithm Based on Supervised Clustering and Classification

  • Conference paper
  • First Online:
Active Media Technology (AMT 2001)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2252))

Included in the following conference series:

Abstract

In this paper a novel data mining technique - Clustering and Classification Algorithm-Supervised (CCA-S)1 is introduced. CCA-S supports incremental learning and non-hierarchical clustering, and is scalable for processing large data sets. CCA-S incorporates the class information in making clustering decisions, and uses the resulting clusters to classify new data records. We apply and test CCA-S on several common data sets for classification problems. The testing results show that the classification performance of CCAS is comparable to the other classification algorithms such as decision trees, artificial neural networks and discriminant analysis.

A US and international patent on this algorithm has been filed, and is currently pending with ASU Case No. M1-015.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cherkassky, V., Mulier, F.: Learning from Data. John Wiley & Sons, Inc. (1998)

    Google Scholar 

  2. Ester, M., Kriegel H.P., Sander, J., Wimmer, M., Xu, X.: Incremental Clustering for Mining in a data Warehousing Environment. Proceedings of 24th VLDB Conference. New York (1998)

    Google Scholar 

  3. Harsha, S.G., Choudhary, A.: Parallel Subspace Clustering for Very Large Data Sets. Techinical Report No. CPDC-TR-9906-010. Northwestern University (1999)

    Google Scholar 

  4. Jain, A. K., Dubes, R. C.: Algorithms for Clustering Data. Prentice Hall, New Jersey (1988)

    MATH  Google Scholar 

  5. Johnson, R. A., Wichern, D. W.: Applied multivariate Statistical Analysis. Prentice Hall, New Jersey (1998)

    Google Scholar 

  6. Lim, T., Loh, W., Shih, Y.: A Comparison of Prediction Accuracy, Complexity, and Training Time of Thirty-three Old and New Classification Algorithms. Machine Learning, Vol. 40. (2000) 203–228

    Article  MATH  Google Scholar 

  7. Mangasarian, O. L., Wolberg, W. H.: Cancer Diagnosis via Linear Programming. SIAM News, Vol. 23(5). (1990) 1–18.

    Google Scholar 

  8. Mitchell, T.: Machine Learning. WCB/McGraw-Hill. (1997)

    Google Scholar 

  9. Sheikholeslami, G., Chatterjee, S., Zhang, A.: WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases. Proceedings of 24th VLDB Conference. New York (1998)

    Google Scholar 

  10. Zhang, T.: Data Clustering For Very Large Datasets Plus Applications. Ph.D. Thesis, Dept. of Computer Science, University of Wisconsin-Madison (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ye, N., Li, X. (2001). A Machine Learning Algorithm Based on Supervised Clustering and Classification. In: Liu, J., Yuen, P.C., Li, Ch., Ng, J., Ishida, T. (eds) Active Media Technology. AMT 2001. Lecture Notes in Computer Science, vol 2252. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45336-9_38

Download citation

  • DOI: https://doi.org/10.1007/3-540-45336-9_38

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43035-3

  • Online ISBN: 978-3-540-45336-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics