Skip to main content

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 236))

Abstract

Clustering is an unsupervised classification method and plays essential role in applications in diverse fields. The evolutionary methods attracted attention and gained popularity among the data mining researchers for clustering due to their expedient implementation, parallel nature, ability to search global optima, and other advantages over conventional methods. However, conventional clustering methods, e.g., K-means, are computationally efficient and widely used local search methods. Therefore, many researchers paid attention to hybrid algorithms. However, most of the algorithms lag in proper balancing of exploration and exploitation of solutions in the search space. In this work, the authors propose a hybrid method DKGK. It uses DE to diversify candidate solutions in the search space. The obtained solutions are refined by K-means. Further, GA with heuristic crossover operator is applied for fast convergence of solutions and the obtained solutions are further refined by K-means. This is why proposed method is called DKGK. Performance of the proposed method is compared to that of Deferential Evolution (DE), genetic algorithm (GA), a hybrid of DE and K-means (DEKM), and a hybrid of GA and K-Means (GAKM) based on the sum of intra-cluster distances. The results obtained on three real and two synthetic datasets are very encouraging as the proposed method DKGK outperforms all the competing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ali, M.M.: Törn, A.: Population set-based global optimization algorithms: Some modifications and numerical studies. Comput. Oper. Res. 31(10), 1703–1725 (2004)

    Google Scholar 

  2. Chang, D., Zhang, X., Zheng, C., Zhang, D.: A robust dynamic niching genetic algorithm with niche migration for automatic clustering problem. Pattern Recogn. 43, 1346–1360 (2010)

    Article  MATH  Google Scholar 

  3. Chiou, J.-P., Wang, F.-S.: A hybrid method of di?erential evolution with application to optimal control problems of a bioprocess system, In: IEEE World Congress on Computational Intelligence, Proceedings of the IEEE International Conference on Evolutionary Computation, pp. 627–632. (1998)

    Google Scholar 

  4. Chuang, L.Y., Hsiao, C.J., Yang, C.H.: Chaotic particle swarm optimization for data clustering. Expert Syst. Appl. 38, 14555–14563 (2011)

    Article  Google Scholar 

  5. Cura, T.: A particle swarm optimization approach to clustering. Expert Syst. Appl. 39, 1582–1588 (2012)

    Article  Google Scholar 

  6. Das, S., Abraham, A., Konar, A.: Automatic clustering using an improved differential evolution algorithm. IEEE Trans. Syst. Man Cybern. Part A: Syst. Hum. 38(1), 218–237 (2008)

    Article  Google Scholar 

  7. Goldberg, D.E.: Genetic Algorithms-in Search, Optimization and Machine Learning. Addison-Wesley Publishing Company Inc., London (1989)

    MATH  Google Scholar 

  8. Handl, J., Knowles, J.: Improving the scalability of multiobjective clustering. In: Proceedings of the Congress on Evolutionary Computation, vol. 3, pp. 2372–2379 (2005)

    Google Scholar 

  9. He, H., Tan, Y.: A two-stage genetic algorithm for automatic clustering. Neurocomput. 81, 49–59 (2012)

    Article  Google Scholar 

  10. Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice-Hall, Engle-wood Cliffs, NJ (1988)

    MATH  Google Scholar 

  11. Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999)

    Article  Google Scholar 

  12. Kwedlo, W., Iwanowicz, P.: Using genetic algorithm for selection of initial cluster centers for the K-Means method. In: Proceedings of \(10^{th}\) International Conference on Artificial Intelligence and Soft Computing. Part II, LNAI 6114, pp. 165–172, (2010)

    Google Scholar 

  13. Kwedlo, W.: A clustering method combining differential evolution with the K-means algorithm. Pattern Recogn. Lett. 32, 1613–1621 (2011)

    Article  Google Scholar 

  14. Laszlo, M., Mukharjee, S.: A genetic algorithm that exchanges neighboring centers for k-means clustering. Pattern Recogn. Lett. 28, 2359–2366 (2007)

    Article  Google Scholar 

  15. MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1, 281–297, (1967)

    Google Scholar 

  16. Murphy, P., Aha, D.: UCI repository of machine learning data bases. (1995). URL http://www.sgi.com/tech/mlc/db

  17. Peltokangas, R., Sorsa, A.: Real-coded genetic algorithms and nonlinear parameter identification. University of Oulu Control Engineering Laboratory Report, vol. 34, pp. 1–32 (2008)

    Google Scholar 

  18. Storn, R., Price, K.: Differential evolution—a simple and efficient heuristic for global optimization over continuous spaces. J. Global Optim. 11(4), 341–359 (1997)

    Article  MATH  MathSciNet  Google Scholar 

  19. Tian, Y., Liu, D., Qi, H.: K-harmonic means data clustering with differential evolution. In: Proceedings International Conference on Future BioMedical Information, Engineering, pp. 369–372, (2009)

    Google Scholar 

  20. Tvrd’ık, J., Křiv’y, I.: Differential evolution with competing strategies applied to partitional clustering. In: Proceedings Symposium on Swarm Intelligence and Differential Intelligence, LNCS 7269. pp. 136–144 (2012)

    Google Scholar 

  21. Velmurugan, T., Santhanam, T.: A survey of partition based clustering algorithms on data mining: an experimental approach. Int. Technol. J. 10, 478–484 (2011)

    Google Scholar 

  22. Xu, R., Wunsch II, D.: Survey of clustering algorithms. IEEE Trans. Neural Networks 16(3), 645–678 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jay Prakash .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer India

About this paper

Cite this paper

Prakash, J., Singh, P.K. (2014). An Effective Hybrid Method Based on DE, GA, and K-means for Data Clustering. In: Babu, B., et al. Proceedings of the Second International Conference on Soft Computing for Problem Solving (SocProS 2012), December 28-30, 2012. Advances in Intelligent Systems and Computing, vol 236. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1602-5_155

Download citation

  • DOI: https://doi.org/10.1007/978-81-322-1602-5_155

  • Published:

  • Publisher Name: Springer, New Delhi

  • Print ISBN: 978-81-322-1601-8

  • Online ISBN: 978-81-322-1602-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics