Modelling Zeros in Blockmodelling

Park, Laurence A. F.; Ganji, Mohadeseh; Demirovic, Emir; Chan, Jeffrey; Stuckey, Peter; Bailey, James; Leckie, Christopher; Kotagiri, Rao

doi:10.1007/978-3-031-05936-0_15

Laurence A. F. Park¹³,
Mohadeseh Ganji¹⁴,
Emir Demirovic¹⁵,
Jeffrey Chan¹⁶,
Peter Stuckey¹⁷,
James Bailey¹⁸,
Christopher Leckie¹⁸ &
…
Rao Kotagiri¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13281))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

2304 Accesses

Abstract

Blockmodelling is the process of determining community structure in a graph. Real graphs contain noise and so it is up to the blockmodelling method to allow for this noise and reconstruct the most likely role memberships and role relationships. Relationships are encoded in a graph using the absence and presence of edges. Two objects are considered similar if they each have edges to a third object. However, the information provided by missing edges is ambiguous and therefore can be measured in different ways. In this article, we examine the effect of the choice of block metric on blockmodelling accuracy and find that data relationships can be position based or set based. We hypothesise that this is due to the data containing either Hamming noise or Jaccard noise. Experiments performed on simulated data show that when no noise is present, the accuracy is independent of the choice of metric. But when noise is introduced, high accuracy results are obtained when the choice of metric matches the type of noise.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Stochastic Blockmodels Meets Overlapping Community Detection

Scalable Detection of Overlapping Communities and Role Assignments in Networks via Bayesian Probabilistic Generative Affiliation Modeling

Minimum Entropy Stochastic Block Models Neglect Edge Distribution Heterogeneity

Notes

1.
www-personal.umich.edu/~mejn/ vlado.fmf.uni-lj.si/pub/networks/pajek/.

References

Chan, J., Liu, W., Kan, A., Leckie, C., Bailey, J., Kotagiri, R.: Discovering latent blockmodels in sparse and noisy graphs using non-negative matrix factorisation. In: CIKM, pp. 811–816. ACM (2013)
Google Scholar
Fiala, J., Paulusma, D.: The computational complexity of the role assignment problem. In: Baeten, J.C.M., Lenstra, J.K., Parrow, J., Woeginger, G.J. (eds.) ICALP 2003. LNCS, vol. 2719, pp. 817–828. Springer, Heidelberg (2003). https://doi.org/10.1007/3-540-45061-0_64
Chapter Google Scholar
Hahsler, M.: An experimental comparison of seriation methods for one-mode two-way data. Eur. J. Oper. Res. 257(1), 133–143 (2017)
Article MathSciNet Google Scholar
Hurley, C.B.: Clustering visualizations of multidimensional data. J. Comput. Graph. Stat. 13(4), 788–806 (2004)
Article MathSciNet Google Scholar
Karrer, B., Newman, M.E.: Stochastic blockmodels and community structure in networks. Phys. Rev. E 83(1), 016107 (2011)
Article MathSciNet Google Scholar
Park, L.A.F., Bezdek, J.C., Leckie, C., Kotagiri, R., Bailey, J., Palaniswami, M.: Visual assessment of clustering tendency for incomplete data. IEEE TKDE 28(12), 3409–3422 (2016)
Google Scholar
Park, L.A.F., Read, J.: A blended metric for multi-label optimisation and evaluation. In: Berlingerio, M., Bonchi, F., Gärtner, T., Hurley, N., Ifrim, G. (eds.) ECML PKDD 2018. LNCS (LNAI), vol. 11051, pp. 719–734. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-10925-7_44
Chapter Google Scholar
Reichardt, J., White, D.R.: Role models for complex networks. The Eur. Phys. J. B 60(2), 217–224 (2007)
Article Google Scholar
Von Luxburg, U.: A tutorial on spectral clustering. Stat. Comput. 17(4), 395–416 (2007)
Article MathSciNet Google Scholar
Zhang, Y., Yeung, D.Y.: Overlapping community detection via bounded nonnegative matrix tri-factorization. In: Proceedings of the 18th ACM SIGKDD, pp. 606–614. ACM (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Centre for Research in Mathematics and Data Science, Western Sydney University, Sydney, Australia
Laurence A. F. Park
ANZ, Melbourne, Australia
Mohadeseh Ganji
TU Delft, Delft, The Netherlands
Emir Demirovic
School of Computing Technologies, RMIT University, Melbourne, Australia
Jeffrey Chan
Department of Data Science and AI, Monash University, Clayton, Australia
Peter Stuckey
School of Computing and Information Systems, The University of Melbourne, Parkville, Australia
James Bailey, Christopher Leckie & Rao Kotagiri

Authors

Laurence A. F. Park
View author publications
You can also search for this author in PubMed Google Scholar
Mohadeseh Ganji
View author publications
You can also search for this author in PubMed Google Scholar
Emir Demirovic
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey Chan
View author publications
You can also search for this author in PubMed Google Scholar
Peter Stuckey
View author publications
You can also search for this author in PubMed Google Scholar
James Bailey
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Leckie
View author publications
You can also search for this author in PubMed Google Scholar
Rao Kotagiri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Laurence A. F. Park .

Editor information

Editors and Affiliations

Laboratory of Artificial Intelligence and Decision Support, University of Porto, Porto, Portugal
João Gama
School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, China
Tianrui Li
National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China
Yang Yu
School of Computer Science and Technology, University of Science and Technology of China, Hefei, China
Enhong Chen
JD iCity, JD Technology & JD Intelligent Cities Research, Beijing, China
Yu Zheng
School of Computing and Artificial Intelligence, Southwest Jiaotong University, Chengdu, China
Fei Teng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Park, L.A.F. et al. (2022). Modelling Zeros in Blockmodelling. In: Gama, J., Li, T., Yu, Y., Chen, E., Zheng, Y., Teng, F. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2022. Lecture Notes in Computer Science(), vol 13281. Springer, Cham. https://doi.org/10.1007/978-3-031-05936-0_15

Download citation

DOI: https://doi.org/10.1007/978-3-031-05936-0_15
Published: 11 May 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-05935-3
Online ISBN: 978-3-031-05936-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Modelling Zeros in Blockmodelling