skip to main content
10.1145/3472720.3483621acmconferencesArticle/Chapter ViewAbstractPublication PageshtConference Proceedingsconference-collections
research-article

Random-Walk Graph Embeddings and the Influence of Edge Weighting Strategies in Community Detection Tasks

Authors Info & Claims
Published:28 October 2021Publication History

ABSTRACT

Graph embedding methods have been developed over recent years with the goal of mapping graph data structures into low dimensional vector spaces so that conventional machine learning tasks can be efficiently evaluated. In particular, random walk based methods sample the graph using random walk sequences that capture a graph's structural properties. In this work, we study the influence of edge weighting strategies that bias the random walk process and we are able to demonstrate that under several settings the biased random walks enhance downstream community detection tasks.

References

  1. Lada A Adamic and Eytan Adar. 2003. Friends and neighbors on the web. Social networks , Vol. 25, 3 (2003), 211--230.Google ScholarGoogle Scholar
  2. Smriti Bhagat, Graham Cormode, and S Muthukrishnan. 2011. Node classification in social networks. In Social network data analytics . Springer, 115--148.Google ScholarGoogle Scholar
  3. Christopher Bishop. 2006. Pattern Recognition and Machine Learning. Pattern Recognition and Machine Learning (2006). Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Pasquale De Meo, Emilio Ferrara, Giacomo Fiumara, and Alessandro Provetti. 2011. Generalized louvain method for community detection in large networks. In 2011 11th international conference on intelligent systems design and applications. IEEE, 88--93.Google ScholarGoogle ScholarCross RefCross Ref
  5. Pasquale De Meo, Emilio Ferrara, Giacomo Fiumara, and Angela Ricciardello. 2012. A novel measure of edge centrality in social networks. Knowledge-based systems , Vol. 30 (2012), 136--150. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Santo Fortunato and Marc Barthelemy. 2007. Resolution limit in community detection. Proceedings of the national academy of sciences , Vol. 104, 1 (2007), 36--41.Google ScholarGoogle ScholarCross RefCross Ref
  7. Palash Goyal and Emilio Ferrara. 2018. Graph embedding techniques, applications, and performance: A survey. Knowledge-Based Systems , Vol. 151 (2018), 78--94.Google ScholarGoogle ScholarCross RefCross Ref
  8. Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable Feature Learning for Networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, August 13--17, 2016. ACM , 855--864. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Muhammad Aqib Javed, Muhammad Shahzad Younis, Siddique Latif, Junaid Qadir, and Adeel Baig. 2018. Community detection in networks: A multidisciplinary review. Journal of Network and Computer Applications , Vol. 108 (2018), 87--111. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Glen Jeh and Jennifer Widom. 2002. SimRank: a measure of structural-context similarity. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining . 538--543. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Di Jin, Zhizhi Yu, Pengfei Jiao, Shirui Pan, Philip S Yu, and Weixiong Zhang. 2021. A survey of community detection approaches: From statistical modeling to deep learning. arXiv preprint arXiv:2101.01669 (2021).Google ScholarGoogle Scholar
  12. Alireza Khadivi, Ali Ajdari Rad, and Martin Hasler. 2011. Network community-detection enhancement by proper weighting. Physical Review E , Vol. 83, 4 (2011), 046104.Google ScholarGoogle ScholarCross RefCross Ref
  13. Andrea Lancichinetti, Santo Fortunato, and Filippo Radicchi. 2008. Benchmark graphs for testing community detection algorithms. Physical review E , Vol. 78, 4 (2008), 046110.Google ScholarGoogle Scholar
  14. Jure Leskovec and Andrej Krevl. 2014. SNAP Datasets: Stanford Large Network Dataset Collection. http://snap.stanford.edu/data .Google ScholarGoogle Scholar
  15. Linyuan Lü and Tao Zhou. 2011. Link prediction in complex networks: A survey. Physica A: statistical mechanics and its applications , Vol. 390, 6 (2011), 1150--1170.Google ScholarGoogle Scholar
  16. Xiaoyan Lu, Konstantin Kuzmin, Mingming Chen, and Boleslaw K Szymanski. 2018. Adaptive modularity maximization via edge weighting scheme. Information Sciences , Vol. 424 (2018), 55--68. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013a. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).Google ScholarGoogle Scholar
  18. Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013b. Distributed representations of words and phrases and their compositionality. Advances in neural information processing systems , Vol. 26 (2013), 3111--3119. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Symeon Papadopoulos, Yiannis Kompatsiaris, Athena Vakali, and Ploutarchos Spyridonos. 2012. Community detection in social media. Data Mining and Knowledge Discovery , Vol. 24, 3 (2012), 515--554. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. DeepWalk: online learning of social representations. In The 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '14, New York, NY, USA - August 24 - 27, 2014. ACM , 701--710. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Nguyen Xuan Vinh, Julien Epps, and James Bailey. 2010. Information theoretic measures for clusterings comparison: Variants, properties, normalization and correction for chance. The Journal of Machine Learning Research , Vol. 11 (2010), 2837--2854. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Bernard L Welch. 1947. The generalization of ?Student's' problem when several different population variances are involved. Biometrika , Vol. 34, 1--2 (1947), 28--35.Google ScholarGoogle ScholarCross RefCross Ref
  23. Jaewon Yang and Jure Leskovec. 2015. Defining and evaluating network communities based on ground-truth. Knowledge and Information Systems , Vol. 42, 1 (2015), 181--213. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Random-Walk Graph Embeddings and the Influence of Edge Weighting Strategies in Community Detection Tasks

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        OASIS '21: Proceedings of the 2021 Workshop on Open Challenges in Online Social Networks
        October 2021
        44 pages
        ISBN:9781450386326
        DOI:10.1145/3472720

        Copyright © 2021 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 28 October 2021

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Upcoming Conference

        HT '24
        35th ACM Conference on Hypertext and Social Media
        September 10 - 13, 2024
        Poznan , Poland
      • Article Metrics

        • Downloads (Last 12 months)16
        • Downloads (Last 6 weeks)1

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader