Skip to main content
Log in

A subgraphs-density based overlapping community detection algorithm for large-scale complex networks

  • Regular Paper
  • Published:
Computing Aims and scope Submit manuscript

Abstract

Overlapping community detection with low computation is one of the fundamental issues and challenges in large-scale complex network analysis. Detecting a community in a network means discovering a cluster of network nodes so that the density of edges between them is high. The existing methods use entire structure information or subgraphs with a fixed size to detect dense communities. Therefore, they are not efficient and accurate for large-scale networks. In this paper, the authors introduce an overlapping community detection algorithm that gradually improves density estimation by expanding the size of subgraphs and gathering information during the search process for finding communities. It is an efficient algorithm with low computational complexity for complex networks with one hundred thousand to several millions of nodes. Experimental results on synthetic and real-world networks with several hundred to four million nodes validate the performance assessment of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Data Availability

The datasets analyses during the current study are available in the references [50].

References

  1. Perlasca P, Frasca M, Ba CT, Gliozzo J, Notaro M, Pennacchioni M, Valentini G, Mesiti M (2020) Multi-resolution visualization and analysis of biomolecular networks through hierarchical community detection and web-based graphical tools. PLoS ONE. https://doi.org/10.1371/journal.pone.0244241

    Article  Google Scholar 

  2. You X, Ma Y, Liu Z (2020) A three-stage algorithm on community detection in social networks. Knowl-Based Syst. https://doi.org/10.1016/j.knosys.2019.06.030

    Article  Google Scholar 

  3. Karatas A, Sahin S (2019) In: Proceedings of international congress on big data, deep learning and fighting cyber terrorism, IBIGDELFT 2018. https://doi.org/10.1109/IBIGDELFT.2018.8625349

  4. Varsha K, Patil KK (2020) In: Proceedings of the 5th international conference on inventive computation technologies, ICICT 2020. https://doi.org/10.1109/ICICT48043.2020.9112563

  5. Cao J, Bu Z, Wang Y, Yang H, Jiang J, Li HJ (2019) Detecting prosumer-community groups in smart grids from the multiagent. Perspective. https://doi.org/10.1109/TSMC.2019.2899366

    Article  Google Scholar 

  6. Allesina S, Pascual M (2008) Network structure, predator: prey modules, and stability in large food webs. Thyroid Res. https://doi.org/10.1007/s12080-007-0007-8

    Article  Google Scholar 

  7. Roy UK, Muhuri PK, Biswas SK (2021) NeSiFC: neighbors’ similarity-based fuzzy community detection using modified local random walk. IEEE Trans Cybern. https://doi.org/10.1109/TCYB.2021.3071542

    Article  Google Scholar 

  8. Shang R, Zhang W, Zhang J, Feng J, Jiao L (2022) Local community detection based on higher-order structure and edge information. Phys A. https://doi.org/10.1016/j.physa.2021.126513

    Article  Google Scholar 

  9. Van Lierde H, Chow TW, Chen G (2020) Scalable spectral clustering for overlapping community detection in large-scale networks. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2019.2892096

    Article  Google Scholar 

  10. Cheng F, Wang C, Zhang X, Yang Y (2021) A local-neighborhood information based overlapping community detection algorithm for large-scale complex networks. IEEE/ACM Trans Netw 29(2):543–556. https://doi.org/10.1109/TNET.2020.3038756

    Article  Google Scholar 

  11. Aghaalizadeh S, Afshord ST, Bouyer A, Anari B (2021) A three-stage algorithm for local community detection based on the high node importance ranking in social networks. Phys A. https://doi.org/10.1016/j.physa.2020.125420

    Article  MathSciNet  MATH  Google Scholar 

  12. Ben El Kouni I, Karoui W, Romdhane LB (2020) Node importance based label propagation algorithm for overlapping community detection in networks. Expert Syst Appl. https://doi.org/10.1016/j.eswa.2019.113020

    Article  Google Scholar 

  13. Gao Y, Zhang H, Zhang Y (2019) Overlapping community detection based on conductance optimization in large-scale networks. Phys A. https://doi.org/10.1016/j.physa.2019.01.142

    Article  MathSciNet  MATH  Google Scholar 

  14. Wang X, Liu G, Li J, Nees JP (2017) Locating structural centers: a density-based clustering method for community detection. PLoS ONE. https://doi.org/10.1371/journal.pone.0169355

    Article  Google Scholar 

  15. Palla G, Derényi I, Farkas I, Vicsek T (2005) Uncovering the overlapping community structure of complex networks in nature and society. Nature. https://doi.org/10.1038/nature03607

    Article  Google Scholar 

  16. Raghavan UN, Albert R, Kumara S (2007) Near linear time algorithm to detect community structures in large-scale networks. Phys Rev E: Stat Nonlin Soft Matter Phys. https://doi.org/10.1103/PhysRevE.76.036106

    Article  Google Scholar 

  17. Clauset A (2005) Finding local community structure in networks. Phys Rev E: Stat Nonlin Soft Matter Phys. https://doi.org/10.1103/PhysRevE.72.026132

    Article  Google Scholar 

  18. Guo K, Huang X, Wu L, Chen Y (2021) Local community detection algorithm based on local modularity density. Appl Intell. https://doi.org/10.1007/s10489-020-02052-0

    Article  Google Scholar 

  19. Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech: Theory Exp. https://doi.org/10.1088/1742-5468/2008/10/P10008

    Article  MATH  Google Scholar 

  20. Hao L, Liu XW (2019) A unified community detection algorithm in large-scale complex networks. Adv Complex Syst. https://doi.org/10.1142/S0219525919500048

    Article  MathSciNet  Google Scholar 

  21. Berahmand K, Bouyer A, Vasighi M (2018) Community detection in complex networks by detecting and expanding core nodes through extended local similarity of nodes. IEEE Trans Comput Soc Syst. https://doi.org/10.1109/TCSS.2018.2879494

    Article  Google Scholar 

  22. Berahmand K, Bouyer A (2018) LP-LPA: a link influence-based label propagation algorithm for discovering community structures in networks. Int J Mod Phys B. https://doi.org/10.1142/S0217979218500625

    Article  Google Scholar 

  23. Newman ME, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E: Stat Nonlin Soft Matter Phys. https://doi.org/10.1103/PhysRevE.69.026113

    Article  Google Scholar 

  24. Souravlas S, Sifaleras A, Tsintogianni M, Katsavounis S (2021) A classification of community detection methods in social networks: a survey. Int J Gen Syst. https://doi.org/10.1080/03081079.2020.1863394

    Article  MathSciNet  Google Scholar 

  25. Samie ME, Hamzeh A (2017) Community detection in dynamic social networks: A local evolutionary approach. J Inf Sci. https://doi.org/10.1177/0165551516657717

    Article  Google Scholar 

  26. Liu X, Du Y, Jiang M, Zeng X (2020) Multiobjective particle swarm optimization based on network embedding for complex network community detection. IEEE Trans Comput Soc Syst. https://doi.org/10.1109/TCSS.2020.2964027

    Article  Google Scholar 

  27. Yang J, Leskovec J (2013) In: WSDM 2013 Proceedings of the 6th ACM international conference on web search and data mining. https://doi.org/10.1145/2433396.2433471

  28. Zeng X, Wang W, Chen C, Yen GG (2020) A consensus community-based particle swarm optimization for dynamic community detection. IEEE Trans Cybern. https://doi.org/10.1109/TCYB.2019.2938895

    Article  Google Scholar 

  29. Eiben AE, Smith JE (2015) Introduction to evolutionary computing genetic algorithms

  30. Zhan W, Deng L, Guan J, Niu J, Sun D (2020) Revealing dynamic communities in networks using genetic algorithm with merge and split operators. Physica A. https://doi.org/10.1016/j.physa.2020.124897

    Article  Google Scholar 

  31. Messaoudi I, Kamel N (2019) A multi-objective bat algorithm for community detection on dynamic social networks. Appl Intell. https://doi.org/10.1007/s10489-018-1386-9

    Article  Google Scholar 

  32. Orouskhani M, Shi D, Orouskhani Y (2021) Multi-objective evolutionary clustering with complex networks. Expert Syst Appl 165:113–916. https://doi.org/10.1016/j.eswa.2020.113916

    Article  Google Scholar 

  33. Traag VA, Waltman L, van Eck NJ (2019) From Louvain to Leiden: guaranteeing well-connected communities. Sci Rep. https://doi.org/10.1038/s41598-019-41695-z

    Article  Google Scholar 

  34. Li H, Zhang R, Zhao Z, Liu X (2021) Lpa-mni: an improved label propagation algorithm based on modularity and node importance for community detection. Entropy. https://doi.org/10.3390/e23050497

    Article  MathSciNet  Google Scholar 

  35. Shang R, Zhang W, Jiao L, Stolkin R, Xue Y (2017) A community integration strategy based on an improved modularity density increment for large-scale networks. Physica A. https://doi.org/10.1016/j.physa.2016.11.066

    Article  Google Scholar 

  36. Asmi K, Lotfi D, Abarda A (2021) The greedy coupled-seeds expansion method for the overlapping community detection in social networks. Computing. https://doi.org/10.1007/s00607-021-00948-4

    Article  Google Scholar 

  37. Whang JJ, Gleich DF, Dhillon IS (2016) Overlapping community detection using neighborhood-inflated seed expansion. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2016.2518687

    Article  Google Scholar 

  38. Zhang X, Wang C, Su Y, Pan L, Zhang HF (2017) A fast overlapping community detection algorithm based on weak cliques for large-scale networks. IEEE Trans Comput Soc Syst 4(4):218–230. https://doi.org/10.1109/TCSS.2017.2749282

    Article  Google Scholar 

  39. Kumpula JM, Kivelä M, Kaski K, Saramäki J (2008) Sequential algorithm for fast clique percolation. Phys Rev E. https://doi.org/10.1103/physreve.78.026109

    Article  Google Scholar 

  40. Zhang Z, Cui P, Zhu W (2022) Deep learning on graphs: a survey. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2020.2981333

    Article  Google Scholar 

  41. Su X, Xue S, Liu F, Wu J, Yang J, Zhou C, Hu W, Paris C, Nepal S, Jin D, Sheng QZ, Yu PS (2022) A comprehensive survey on community detection with deep learning. IEEE Trans Neural Netw Learn Syst. https://doi.org/10.1109/TNNLS.2021.3137396

    Article  Google Scholar 

  42. Bhatia V, Rani R (2018) DFuzzy: a deep learning-based fuzzy clustering model for large graphs. Knowl Inf Syst. https://doi.org/10.1007/s10115-018-1156-3

    Article  Google Scholar 

  43. D. Easley, J. Kleinberg (2010) Networks, crowds, and markets. https://doi.org/10.1017/cbo9780511761942

  44. Makris C, Pispirigos G (2021) Stacked community prediction: a distributed stacking-based community extraction methodology for large scale social networks. Big Data Cognit Comput. https://doi.org/10.3390/bdcc5010014

    Article  Google Scholar 

  45. Magelinski T, Bartulovic M, Carley KM (2021) Measuring node contribution to community structure with modularity vitality. IEEE Trans Netw Sci Eng. https://doi.org/10.1109/TNSE.2020.3049068

    Article  MathSciNet  Google Scholar 

  46. Pan X, Xu G, Wang B, Zhang T (2019) A novel community detection algorithm based on local similarity of clustering coefficient in social networks. IEEE Access. https://doi.org/10.1109/ACCESS.2019.2937580

    Article  Google Scholar 

  47. Li Z, Zhang S, Wang RS, Zhang XS, Chen L (2008) Quantitative function for community detection. Phys Rev E: Stat Nonlin Soft Matter Phys 77(3):036109. https://doi.org/10.1103/PhysRevE.77.036109

    Article  Google Scholar 

  48. Kumar A, Barman D, Sarkar R, Chowdhury N (2020) Overlapping community detection using multiobjective genetic algorithm. IEEE Trans Comput Soc Syst. https://doi.org/10.1109/TCSS.2020.2989295

    Article  Google Scholar 

  49. Lancichinetti A, Fortunato S, Radicchi F (2008) Benchmark graphs for testing community detection algorithms. Phys Rev E: Stat Nonlin Soft Matter Phys. https://doi.org/10.1103/PhysRevE.78.046110

    Article  Google Scholar 

  50. Leskovec J, Sosič R (2016) SNAP: a general-purpose network analysis and graph-mining library. ACM Trans Intell Syst Technol. https://doi.org/10.1145/2898361

    Article  Google Scholar 

  51. Danon L, Díaz-Guilera A, Duch J, Arenas A (2005) Comparing community structure identification. J Stat Mech: Theory Exp. https://doi.org/10.1088/1742-5468/2005/09/P09008

    Article  Google Scholar 

  52. Lancichinetti A, Fortunato S, Kertész J (2009) Detecting the overlapping and hierarchical community structure in complex networks. New J Phys. https://doi.org/10.1088/1367-2630/11/3/033015

    Article  Google Scholar 

  53. Rhouma D, Romdhane LB (2014) An efficient algorithm for community mining with overlap in social networks. Expert Syst Appl. https://doi.org/10.1016/j.eswa.2014.01.002

    Article  Google Scholar 

  54. Yang Z, Algesheimer R, Tessone CJ (2016) A comparative analysis of community detection algorithms on artificial networks. Sci Rep. https://doi.org/10.1038/srep30750

    Article  Google Scholar 

  55. Rossetti G, Cazabet R (2018) Community discovery in dynamic networks: a survey. https://doi.org/10.1145/3172867

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ali Hamzeh.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Saif, S.M., Samie, M.E. & Hamzeh, A. A subgraphs-density based overlapping community detection algorithm for large-scale complex networks. Computing 105, 151–185 (2023). https://doi.org/10.1007/s00607-022-01121-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00607-022-01121-1

Keywords

Mathematics Subject Classification

Navigation