Abstract
This article studies the recovery of static communities in a temporal network. We introduce a temporal stochastic block model where dynamic interaction patterns between node pairs follow a Markov chain. We render this model versatile by adding degree correction parameters, describing the tendency of each node to start new interactions. We show that in some cases the likelihood of this model is approximated by the regularized modularity of a time-aggregated graph. This time-aggregated graph involves a trade-off between new edges and persistent edges. A continuous relaxation reduces the regularized modularity maximization to a normalized spectral clustering. We illustrate by numerical experiments the importance of edge persistence, both on simulated and real data sets.
This work has been done within the project of Inria - Nokia Bell Labs “Distributed Learning and Control for Network Analysis” and was partially supported by COSTNET Cost Action CA15109.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Avrachenkov, K., Dreveton, M., Leskelä, L.: Estimation of static community memberships from temporal network data. arXiv preprint arXiv:2008.04790 (2020)
Bhattacharyya, S., Chatterjee, S.: General community detection with optimal recovery conditions for multi-relational sparse networks with dependent layers. arXiv preprint arXiv:2004.03480 (2020)
Billingsley, P.: Statistical methods in Markov chains. Ann. Math. Stat. 32(1), 12–40 (1961)
Brandes, U., et al.: On finding graph clusterings with maximum modularity. In: Brandstädt, A., Kratsch, D., Müller, H. (eds.) WG 2007. LNCS, vol. 4769, pp. 121–132. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-74839-7_12
Chi, Y., Song, X., Zhou, D., Hino, K., Tseng, B.L.: Evolutionary spectral clustering by incorporating temporal smoothness. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2007, pp. 153–162. Association for Computing Machinery, New York (2007)
Fortunato, S.: Community detection in graphs. Phys. Rep. 486(3), 75–174 (2010)
Fournet, J., Barrat, A.: Contact patterns among high school students. PLOS One 9(9), 1–17 (2014)
Ghasemian, A., Zhang, P., Clauset, A., Moore, C., Peel, L.: Detectability thresholds and optimal algorithms for community structure in dynamic networks. Phys. Rev. X 6(3), 031005 (2016)
Holland, P., Laskey, K.B., Leinhardt, S.: Stochastic blockmodels: first steps. Soc. Netw. 5, 109–137 (1983)
Holme, P., Saramäki, J.: Temporal networks. Phys. Rep. 519(3), 97–125 (2012). Temporal Networks
Karrer, B., Newman, M.E.J.: Stochastic blockmodels and community structure in networks. Phys. Rev. E 83, 016107 (2011)
Kumar, A., Kannan, R.: Clustering with spectral norm and the k-means algorithm. In: 2010 IEEE 51st Annual Symposium on Foundations of Computer Science, pp. 299–308. IEEE (2010)
Liu, F., Choi, D., Xie, L., Roeder, K.: Global spectral clustering in dynamic networks. Proc. Natl. Acad. Sci. 115(5), 927–932 (2018)
Mastrandrea, R., Fournet, J., Barrat, A.: Contact patterns in a high school: a comparison between data collected using wearable sensors, contact diaries and friendship surveys. PLOS One 10(9), 1–26 (2015)
Matias, C., Miele, V.: Statistical clustering of temporal networks through a dynamic stochastic block model. J. R. Stat. Soc.: Ser. B (Stat. Methodol.) 79(4), 1119–1141 (2017)
Mucha, P.J., Richardson, T., Macon, K., Porter, M.A., Onnela, J.P.: Community structure in time-dependent, multiscale, and multiplex networks. Science 328(5980), 876–878 (2010)
Newman, M.E.J.: Spectral methods for community detection and graph partitioning. Phys. Rev. E 88, 042822 (2013)
Newman, M.E.: Equivalence between modularity optimization and maximum likelihood methods for community detection. Phys. Rev. E 94(5), 052315 (2016)
Newman, M.E., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E 69(2), 026113 (2004)
Pamfil, A.R., Howison, S.D., Lambiotte, R., Porter, M.A.: Relating modularity maximization and stochastic block models in multilayer networks. SIAM J. Math. Data Sci. 1(4), 667–698 (2019)
Reichardt, J., Bornholdt, S.: Statistical mechanics of community detection. Phys. Rev. E 74(1), 016110 (2006)
Rossetti, G., Cazabet, R.: Community discovery in dynamic networks: a survey. ACM Comput. Surv. 51(2), 1–37 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
A Proofs of Main Statements
A Proofs of Main Statements
1.1 A.1 Maximum Likelihood Computations (Proposition 1)
Proof
(Proof of Proposition 1). By the temporal Markov property, the log-likelihood of the model can be written as \(\log \mathbb {P}(A \, | \,Z,\theta ) = \log \mathbb {P}(A^1 \, | \,Z,\theta ) + \sum _{t=2}^T \mathbb {P}(A^t \, | \,A^{t-1},Z,\theta )\). By denoting \(\rho _a^{\theta _i\theta _j} = \log \frac{\mu _a^{\theta _i\theta _j}}{\nu _a^{\theta _i\theta _j}}\), we find that
where \(c_1(A) = \frac{1}{2} \sum _{i,j} \sum _a \delta (A_{ij}^1, a) \log \nu _a^{\theta _i\theta _j}\) does not depend on the community structure. Similarly, by denoting \(R^{\theta _i\theta _j}_{ab} = \log \frac{P_{ab}^{\theta _i\theta _j}}{Q_{ab}^{\theta _i\theta _j}}\) we find that
where \(c_t(A) = \frac{1}{2} \sum _{i,j} \sum _{a,b} \delta (A_{ij}^{t-1}, a) \delta (A_{ij}^t, b) \log Q_{ab}^{\theta _i\theta _j}\) does not depend on the community structure. Simple computations show that
and
By collecting the above observations, we now find that \(\log \mathbb {P}(A \, | \,Z, \theta )\) equals
where \(c(A) = \sum _t c_t(A)\) does not depend on Z. Hence the claim follows. \(\square \)
1.2 A.2 Approximation of the MLE
Recall the structural assumptions (3)–(4) about the degree correction parameters. Because \(P_{01}, Q_{01} = o(1)\), a first-order Taylor expansion yields
as well as \( \ell _{01}^{\theta _i\theta _j} \approx \log \frac{P_{01}}{Q_{01}} \), \(\ell _{10}^{\theta _i\theta _j} \approx \log \frac{ 1 - P_{11} }{ 1 - Q_{11} }\) and \(\ell _{11}^{\theta _i\theta _j} \approx \log \frac{P_{11}}{Q_{11}} \). Using these approximations in the MLE expression leads to the maximisation of
where \(\tilde{a}_{ij}^{t} = \alpha \left( A^t_{ \mathrm {new} } \right) _{ij} + \beta \left( A_{ \mathrm {pers} }^t \right) _{ij}\). Since \(\mu \) and \(\nu \) are stationary distributions,
Therefore, using \(W_{ij} = \sum _{t=2}^T \tilde{a}_{ij}\) we have
Since the community labeling are sampled uniformly at random and using the normalization for the \(\theta _i\)’s, we have
together with \( \bar{m} = \frac{N^2}{2} \frac{ \mu _1 \left( \alpha (1-P_{11}) + \beta P_{11} \right) + (K-1) \nu _1 \left( \alpha (1-Q_{11}) + \beta Q_{11} \right) }{ K }. \) \(\square \)
1.3 A.3 Modularity and Normalized Spectral Clustering
The regularized modularity of a partition \(Z \in [K]^N\) of the graph A is defined as
where \(d = A 1_n\) and \(\gamma \) is a resolution parameter. This can be rewritten as
where \( \tilde{Z} \in \{0,1\}^{N\times K}\) is the membership matrix associated to the vector Z, that is \( \tilde{Z} _{ik} = 1\) for \(k = Z_i\), and \( \tilde{Z} _{ik} = 0\) otherwise. As maximizing the modularity over \(Z \in \mathcal {Z}_{N,K}\) is in general NP-complete [4], it is convenient to perform a continuous relaxation. Following [17], we transform the problem into
The predicted membership matrix \( \hat{Z} \) is then recovered by performing an approximated solution to the following k-means problem (see [12])
The Lagrangian associated to the optimization problem (8) is
where \(\varLambda \in \mathbb {R}^{K\times K}\) is a symmetric matrix of Lagrangian multipliers. Up to a change of basis, we can assume that \(\varLambda \) is diagonal. The solution of (8) verifies
which is a generalized eigenvalue problem: the columns of X are the generalized eigenvectors, and the diagonal elements of \(\varLambda \) are the eigenvalues. In particular, since the constant vector \(1_n\) verifies \((A - \gamma \frac{d d^T}{2m} ) 1_n = (1-\gamma ) D 1_n\), we conclude that the eigenvalues should be larger than \(1-\gamma \) for the partition to be meaningful.
Multiplying the first equation by \(1_n^T\) leads to \((1-\gamma ) d^T X = d^T X \varLambda \), and therefore \(d^T X = 0\) (using the previous remark on \(\varLambda \)). The system then simplifies in
Defining a re-scaled vector \(U = D^{-1/2}X\) shows that U verifies \(D^{-1/2}A D^{-1/2} U = U \varLambda \) and \(U^T U = I_K\). Thus, the columns of U are eigenvectors of \(D^{-1/2}A D^{-1/2}\) associated to the K largest eigenvalue (or equivalently, the eigenvectors of \(\mathcal {L}= I_N - D^{-1/2}A D^{-1/2}\) associated to the K smallest eigenvalues).
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Avrachenkov, K., Dreveton, M., Leskelä, L. (2021). Recovering Communities in Temporal Networks Using Persistent Edges. In: Mohaisen, D., Jin, R. (eds) Computational Data and Social Networks. CSoNet 2021. Lecture Notes in Computer Science(), vol 13116. Springer, Cham. https://doi.org/10.1007/978-3-030-91434-9_22
Download citation
DOI: https://doi.org/10.1007/978-3-030-91434-9_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-91433-2
Online ISBN: 978-3-030-91434-9
eBook Packages: Computer ScienceComputer Science (R0)