Abstract
Cascading failures on techno-socio-economic systems can have dramatic and catastrophic implications in society. A damage caused by a cascading failure, such as a power blackout, is complex to predict, understand, prevent and mitigate as such complex phenomena are usually a result of an interplay between structural and functional non-linear dynamics. Therefore, systematic and generic measurements of network reliability and repairability against cascading failures is of a paramount importance to build a more sustainable and resilient society. This paper contributes a probabilistic framework for measuring network reliability and repairability against cascading failures. In contrast to related work, the framework is designed on the basis that network reliability is multifaceted and therefore a single metric cannot adequately characterize it. The concept of ‘repairability envelope’ is introduced that illustrates trajectories of performance improvement and trade-offs for countermeasures against cascading failures. The framework is illustrated via four model-independent and application-independent metrics that characterize the topological damage, the network spread of the cascading failure, the evolution of its propagation, the correlation of different cascading failure outbreaks and other aspects by using probability density functions and cumulative distribution functions. The applicability of the framework is experimentally evaluated in a theoretical model of damage spread and an empirical one of power cascading failures. It is shown that the reliability and repairability in two systems of a totally different nature undergoing cascading failures can be better understood by the same generic measurements of the proposed framework.
Similar content being viewed by others
Notes
For instance, US government claims that cyber attacks are the cause of power outages in Ukraine in 2015: https://ics-cert.us-cert.gov/alerts/IR-ALERT-H-16-056-01 (last accessed: December 2016).
Available at https://github.com/SFINA/Flow-Monitor (last accessed: December 2016).
Available at https://github.com/SFINA (last accessed: December 2016).
Available at http://gephi.github.io (last accessed: December 2016).
These strategies correspond to the strategy 3 and 4 in the earlier work (Buzna et al. 2007)
Available at http://www.pserc.cornell.edu/matpower/docs/ref/matpower5.0/menu5.0.html (last accessed: December 2016).
Available at http://www.interpss.com (last accessed: December 2016).
Several such placements are evaluated in earlier work (Pournaras and Espejo-Uribe 2016).
References
Balasubramaniam, K., Venayagamoorthy, G. K., & Watson, N. (2013). Cellular neural network based situational awareness system for power grids. In The 2013 international joint conference on neural networks (IJCNN) (pp. 1–8): IEEE.
Buzna, L., Peters, K., Ammoser, H., Kühnert, C., & Helbing, D. (2007). Efficient response to cascading disaster spreading. Physical Review E, 75(5), 056,107.
Dobson, I., Carreras, B. A., Lynch, V. E., & Newman, D. E. (2007). Complex systems analysis of series of blackouts: Cascading failure, critical points, and self-organization. Chaos: An Interdisciplinary Journal of Nonlinear Science, 17(2), 026,103.
Dobson, I., Kim, J., & Wierzbicki, K.R. (2010). Testing branching process estimators of cascading failure with data from a simulation of transmission line outages. Risk Analysis, 30(4), 650– 662.
Huang, Z., Chen, Y., & Nieplocha, J. (2009). Massive contingency analysis with high performance computing. In 2009 IEEE power & energy society general meeting (pp. 1–8). IEEE.
Huseby, A.B., & Natvig, B. (2013). Discrete event simulation methods applied to advanced importance measures of repairable components in multistate network flow systems. Reliability Engineering & System Safety, 119, 186–198.
Kuo, W., & Zhu, X. (2012). Some recent advances on importance measures in reliability. IEEE Transactions on Reliability, 61(2), 344–360.
Liscouski, B., & Elliot, W. (2004). Final report on the august 14, 2003 blackout in the United States and Canada: causes and recommendations. A report to US Department of Energy 40(4).
Mazauric, D., Soltan, S., & Zussman, G. (2013). Computational analysis of cascading failures in power networks. ACM SIGMETRICS Performance Evaluation Review, 41 (1), 337–338.
Nedic, D.P., Dobson, I., Kirschen, D.S., Carreras, B.A., & Lynch, V.E. (2006). Criticality in a cascading failure blackout model. International Journal of Electrical Power & Energy Systems, 28(9), 627–633.
Pournaras, E. (2013). Multi-level reconfigurable self-organization in overlay services. PhD thesis, TU Delft, Delft University of Technology.
Pournaras, E., & Espejo-Uribe, J. (2016). Self-repairable smart grids via online coordination of smart transformers. IEEE Transactions on Industrial Informatics, PP (99), 1–1. (to appear).
Pournaras, E., Yao, M., Ambrosio, R., & Warnier, M. (2013). Organizational control reconfigurations for a robust smart power grid. Internet of things and inter-cooperative computational technologies for collective intelligence (pp. 189–206). Springer.
Pournaras, E., Brandt, B. E., Thapa, M., Acharya, D., Espejo-Uribe, J., Ballandies, M., & Helbing, D. (2017). Sfina - simulation framework for intelligent network adaptations. Simulation Modelling Practice and Theory (to appear).
Qin, Z. (2015). Construction of generic and adaptive restoration strategies. HKU Theses Online (HKUTO).
Ren, H., Xiong, J., Watts, D., Zhao, Y., & et al. (2013). Branching process based cascading failure probability analysis for a regional power grid in China with utility outage data. Energy and Power Engineering, 5(04), 914.
Todinov, M.T. (2013). Flow Networks: Analysis and optimization of repairable flow networks, networks with disturbed flows, static flow networks and reliability networks. Newnes.
Trajanovski, S., Martín-hernández, J., Winterbach, W., & Van Mieghem, P. (2013). Ulanowicz2009. Journal of Complex Networks, 1(1), 44–62.
Ulanowicz, R.E., Goerner, S.J., Lietaer, B., & Gomez, R. (2009). Quantifying sustainability: resilience, efficiency and the return of information theory. Ecological Complexity, 6(1), 27– 36.
Wang, W.X., & Chen, G. (2008). Universal robustness characteristic of weighted networks against cascading failure. Physical Review E, 77(2), 026,101.
Wang, X., Koç, Y., Kooij, R.E., & Van Mieghem, P. (2015). A network approach for power grid robustness against cascading failures, 2015 7th international workshop on reliable networks design and modeling (RNDM) (pp. 208–214). IEEE.
Yan, J., He, H., & Sun, Y. (2014). Integrated security analysis on cascading failure in complex networks. IEEE Transactions on Information Forensics and Security, 9(3), 451–463.
Youssef, M., Scoglio, C., & Pahwa, S. (2011). Robustness measure for power grids with respect to cascading failures, Proceedings of the 2011 international workshop on modeling, analysis, and control of complex networks (pp. 45–49). International Teletraffic Congress.
Zhang, W., Pei, W., & Guo, T. (2014). An efficient method of robustness analysis for power grid under cascading failure. Safety Science, 64, 121–126.
Zhou, M., & Zhou, S. (2007). Internet, open-source and power system simulation, Power engineering society general meeting, 2007 (pp. 1–5). IEEE.
Acknowledgements
This research is funded by the Professorship of Computational Social Science, ETH Zurich, Zurich, Switzerland. The authors are grateful to the rest of the SFINA development team for their comments and overall contributions to the project: Ben-Elias Brandt, Mark Ballandies and Dinesh Acharya.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A: Theoretical model: damage correlation
To better understand the spreading dynamics, the evolution of damage correlation for the two artificial networks is shown in Figs. 12 and 13. The two figures show how correlated the spreading of network damages are in all possible pairs of perturbations in the network and how the correlations evolve during the cascading failure.
Figure 12 confirms that the hubs of the Barabási-Albert network result in highly positively correlated processes of damage spread, which are actually due to the high levels of damage diffused in the network. When the recovery process takes place, there is a low increase in the average correlations for strategy B compared to baseline, for instance, 3.95% at the 30th cascade iteration compared to 0.5% at the 20th iteration.
Figure 13 confirms that the damage spread in small world networks has a strong locality influence. The damage of neighboring nodes results in correlated spread of damages as it can be clearer seen in Fig. 13a, e and i. As the spread of the damage increases according the trajectories of Fig. 1b, the correlation of the damages becomes more polarized and vary between highly positive and negative correlation in different parts of the network. Strategies decrease correlations compared to the baseline, for instance, 25.44% at the 30th cascade iteration and 41.5% at the 70th iteration for strategy B. However, damage correlations increase on average during the cascading failure, for example, 87% and 83.6% for baseline and strategy B respectively and from the 30th to the 70th iteration of the cascading failure.
B: Theoretical model: network visualizations
Figures 14 and 15 show the two artificial networks in the respective cascade iterations that are also shown for the damage correlations. The initial damaged node is node 7.
The Barabási-Albert network in Fig. 14 clearly shows that the damage of the peripheral node influences the neighboring nodes (Fig. 14d, f), however, as the central hub is only two hops away from the damaged node, the damage spreads and eventually affects severely the whole network already at the 20th iteration (Fig. 14g, i). The repairability strategies can only alleviate the damage level of individual nodes and therefore, they do not restrict the cascading disaster.
In contrast, the small world network of Fig. 15 shows that the damage spread is in general localized at a higher level than the Barabási-Albert network. Strategy B maintains the lowest damage level in the nodes as also confirmed by Fig. 1b.
Rights and permissions
About this article
Cite this article
Thapa, M., Espejo-Uribe, J. & Pournaras, E. Measuring network reliability and repairability against cascading failures. J Intell Inf Syst 52, 573–594 (2019). https://doi.org/10.1007/s10844-017-0477-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10844-017-0477-0