research-article

Random-Walk Graph Embeddings and the Influence of Edge Weighting Strategies in Community Detection Tasks

Authors:
Andreas Kosmatopoulos

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece
View Profile

,
Kostas Loumponias

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece
View Profile

,
Despoina Chatzakou

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece
View Profile

,
Theodora Tsikrika

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece
View Profile

,
Stefanos Vrochidis

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece
View Profile

,
Ioannis Kompatsiaris

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece
View Profile

OASIS '21: Proceedings of the 2021 Workshop on Open Challenges in Online Social NetworksOctober 2021Pages 9–13https://doi.org/10.1145/3472720.3483621

Published:28 October 2021Publication History

OASIS '21: Proceedings of the 2021 Workshop on Open Challenges in Online Social Networks

Pages 9–13

ABSTRACT

Graph embedding methods have been developed over recent years with the goal of mapping graph data structures into low dimensional vector spaces so that conventional machine learning tasks can be efficiently evaluated. In particular, random walk based methods sample the graph using random walk sequences that capture a graph's structural properties. In this work, we study the influence of edge weighting strategies that bias the random walk process and we are able to demonstrate that under several settings the biased random walks enhance downstream community detection tasks.

References

Lada A Adamic and Eytan Adar. 2003. Friends and neighbors on the web. Social networks , Vol. 25, 3 (2003), 211--230.Google Scholar
Smriti Bhagat, Graham Cormode, and S Muthukrishnan. 2011. Node classification in social networks. In Social network data analytics . Springer, 115--148.Google Scholar
Christopher Bishop. 2006. Pattern Recognition and Machine Learning. Pattern Recognition and Machine Learning (2006). Google ScholarDigital Library
Pasquale De Meo, Emilio Ferrara, Giacomo Fiumara, and Alessandro Provetti. 2011. Generalized louvain method for community detection in large networks. In 2011 11th international conference on intelligent systems design and applications. IEEE, 88--93.Google ScholarCross Ref
Pasquale De Meo, Emilio Ferrara, Giacomo Fiumara, and Angela Ricciardello. 2012. A novel measure of edge centrality in social networks. Knowledge-based systems , Vol. 30 (2012), 136--150. Google ScholarDigital Library
Santo Fortunato and Marc Barthelemy. 2007. Resolution limit in community detection. Proceedings of the national academy of sciences , Vol. 104, 1 (2007), 36--41.Google ScholarCross Ref
Palash Goyal and Emilio Ferrara. 2018. Graph embedding techniques, applications, and performance: A survey. Knowledge-Based Systems , Vol. 151 (2018), 78--94.Google ScholarCross Ref
Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable Feature Learning for Networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, August 13--17, 2016. ACM , 855--864. Google ScholarDigital Library
Muhammad Aqib Javed, Muhammad Shahzad Younis, Siddique Latif, Junaid Qadir, and Adeel Baig. 2018. Community detection in networks: A multidisciplinary review. Journal of Network and Computer Applications , Vol. 108 (2018), 87--111. Google ScholarDigital Library
Glen Jeh and Jennifer Widom. 2002. SimRank: a measure of structural-context similarity. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining . 538--543. Google ScholarDigital Library
Di Jin, Zhizhi Yu, Pengfei Jiao, Shirui Pan, Philip S Yu, and Weixiong Zhang. 2021. A survey of community detection approaches: From statistical modeling to deep learning. arXiv preprint arXiv:2101.01669 (2021).Google Scholar
Alireza Khadivi, Ali Ajdari Rad, and Martin Hasler. 2011. Network community-detection enhancement by proper weighting. Physical Review E , Vol. 83, 4 (2011), 046104.Google ScholarCross Ref
Andrea Lancichinetti, Santo Fortunato, and Filippo Radicchi. 2008. Benchmark graphs for testing community detection algorithms. Physical review E , Vol. 78, 4 (2008), 046110.Google Scholar
Jure Leskovec and Andrej Krevl. 2014. SNAP Datasets: Stanford Large Network Dataset Collection. http://snap.stanford.edu/data .Google Scholar
Linyuan Lü and Tao Zhou. 2011. Link prediction in complex networks: A survey. Physica A: statistical mechanics and its applications , Vol. 390, 6 (2011), 1150--1170.Google Scholar
Xiaoyan Lu, Konstantin Kuzmin, Mingming Chen, and Boleslaw K Szymanski. 2018. Adaptive modularity maximization via edge weighting scheme. Information Sciences , Vol. 424 (2018), 55--68. Google ScholarDigital Library
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013a. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).Google Scholar
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013b. Distributed representations of words and phrases and their compositionality. Advances in neural information processing systems , Vol. 26 (2013), 3111--3119. Google ScholarDigital Library
Symeon Papadopoulos, Yiannis Kompatsiaris, Athena Vakali, and Ploutarchos Spyridonos. 2012. Community detection in social media. Data Mining and Knowledge Discovery , Vol. 24, 3 (2012), 515--554. Google ScholarDigital Library
Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. DeepWalk: online learning of social representations. In The 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '14, New York, NY, USA - August 24 - 27, 2014. ACM , 701--710. Google ScholarDigital Library
Nguyen Xuan Vinh, Julien Epps, and James Bailey. 2010. Information theoretic measures for clusterings comparison: Variants, properties, normalization and correction for chance. The Journal of Machine Learning Research , Vol. 11 (2010), 2837--2854. Google ScholarDigital Library
Bernard L Welch. 1947. The generalization of ?Student's' problem when several different population variances are involved. Biometrika , Vol. 34, 1--2 (1947), 28--35.Google ScholarCross Ref
Jaewon Yang and Jure Leskovec. 2015. Defining and evaluating network communities based on ground-truth. Knowledge and Information Systems , Vol. 42, 1 (2015), 181--213. Google ScholarDigital Library

Index Terms

Random-Walk Graph Embeddings and the Influence of Edge Weighting Strategies in Community Detection Tasks
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Learning latent representations
2. Networks
  1. Network types
    1. Overlay and other logical network structures
      1. Online social networks

Recommendations

Graphs with multiplicative vertex-coloring 2-edge-weightings

A k-weighting w of a graph is an assignment of an integer weight $$w(e)\in \{1,...k\}$$w(e)ź{1,...k} to each edge e. Such an edge weighting induces a vertex coloring c defined by $$c(v)=\mathop {\displaystyle {\prod }}\limits _{v\in e}w(e).$$c(v)=źvźew(...
Read More
Hub‐aware random walk graph embedding methods for classification
Abstract
In the last two decades, we are witnessing a huge increase of valuable big data structured in the form of graphs or networks. To apply traditional machine learning and data analytic techniques to such data it is necessary to transform graphs ...
Read More
Community Detection Using Restrained Random-Walk Similarity
In this paper, we propose a <italic>restrained random-walk similarity method</italic> for detecting the community structures of graphs. The basic premise of our method is that the starting vertices of finite-length random walks are judged to be in the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
OASIS '21: Proceedings of the 2021 Workshop on Open Challenges in Online Social Networks
October 2021
44 pages
ISBN:9781450386326
DOI:10.1145/3472720
General Chairs:
Barbara Guidi
University of Pisa, Italy
,
Andrea Michienzi
University of Pisa, Italy
,
Laura Ricci
University of Pisa, Italy
Copyright © 2021 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 28 October 2021
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
community detection
deepwalk
edge weighting
latent representation
node2vec
Qualifiers
- research-article
Conference
Upcoming Conference
HT '24

Sponsor:

sigweb

35th ACM Conference on Hypertext and Social Media

September 10 - 13, 2024

Poznan , Poland
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 81
  Total Downloads
- Downloads (Last 12 months)16
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Random-Walk Graph Embeddings and the Influence of Edge Weighting Strategies in Community Detection Tasks

OASIS '21: Proceedings of the 2021 Workshop on Open Challenges in Online Social Networks

ABSTRACT

References

Cited By

Index Terms

Recommendations

Graphs with multiplicative vertex-coloring 2-edge-weightings

Hub‐aware random walk graph embedding methods for classification

Community Detection Using Restrained Random-Walk Similarity