Abstract
The initiatives Organic Computing and Autonomic Computing introduced challenging visions for future computer systems. They address the growing complexity of these systems that demands for new ways to control them. Future systems should be able to adapt dynamically to the current conditions of their environment. They should be characterised by so-called self-x properties like self-configuring, self-healing, self-optimising, self-protecting, and context-aware. For the incorporation of self-healing capabilities into distributed systems the detection of failures is a crucial part. Recently we proposed a new failure detector that can be described as an adaptive accrual algorithm. It has been designed for flexible generic usability as a basis to realise self-healing of distributed systems. This paper introduces variations of the proposed basic algorithm to improve its performance and provides an evaluation of all algorithms using message delay and loss models of the internet.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bertier, M., Marin, O., Sens, P.: Implementation and performance evaluation of an adaptable failure detector. In: DSN ’02: Proceedings of the 2002 International Conference on Dependable Systems and Networks, Washington, DC, USA, pp. 354–363. IEEE Computer Society Press, Los Alamitos (2002)
Bolot, J.-C.: End-to-end packet delay and loss behavior in the internet. In: SIGCOMM, pp. 289–298 (1993), http://doi.acm.org/10.1145/166237.166265
Chandra, T.D., Hadzilacos, V., Toueg, S.: The weakest failure detector for solving consensus. J. ACM 43(4), 685–722 (1996), doi:10.1145/234533.234549
Chandra, T.D., Toueg, S.: Unreliable failure detectors for reliable distributed systems. J. ACM 43(2), 225–267 (1996), doi:10.1145/226643.226647
Chen, W., Toueg, S., Aguilera, M.K.: On the quality of service of failure detectors. In: Proceedings of the International Conference on Dependable Systems and Networks (DSN 2000), New York, IEEE Computer Society Press, Los Alamitos (2000), citeseer.ist.psu.edu/chen00quality.html
Dam, K.K., Ni, L.M.: Design and implementation of a network emulator. Technical Report MSU-CPS-ACS-98-16, Department of Computer Science and Engineering, Michigan State University (May 1998)
Fetzer, C., Raynal, M., Tronel, F.: An adaptive failure detection protocol. In: PRDC ’01: Proceedings of the 2001 Pacific Rim International Symposium on Dependable Computing, Washington, DC, USA, p. 146. IEEE Computer Society Press, Los Alamitos (2001)
Fischer, M.J., Lynch, N.A., Paterson, M.S.: Impossibility of distributed consensus with one faulty process. J. ACM 32(2), 374–382 (1985), doi:10.1145/3149.214121
Härdle, W.: Smoothing Techniques with Implementation in S. Springer, Berlin (1991)
Hayashibara, N., Défago, X., Yared, R., Katayama, T.: The f accrual failure detector. In: SRDS, pp. 66–78. IEEE Computer Society Press, Los Alamitos (2004)
Horn, P.: Autonomic computing: Ibms perspective on the state of information technology (2001), http://www.research.ibm.com/autonomic/
Horstmann, M., Kirtland, M.: Dcomarchitecture. Technical report (July 1997), http://msdn.microsoft.com/library/backgrnd/html/msdn_dcomarch.htm
Jacobson, V.: Congestion avoidance and control. In: SIGCOMM ’88: Symposium proceedings on Communications architectures and protocols, Stanford, California, United States, pp. 314–329. ACM Press, New York (1988), doi:10.1145/52324.52356
Kephart, J.O.: Research challenges of autonomic computing. In: ICSE ’05: Proceedings of the 27th international conference on Software engineering, St. Louis, MO, USA, pp. 15–22 (2005), doi:10.1145/1062455.1062464
Lynch, N.: A hundred impossibility proofs for distributed computing. In: PODC ’89: Proceedings of the eighth annual ACM Symposium on Principles of distributed computing, Edmonton, Alberta, Canada, pp. 1–28. ACM Press, New York (1989), doi:10.1145/72981.72982
Mukherjee, A.: On the dynamics and significance of low frequency components of internet load. Technical Report MIS-CIS-92-83, University of Pennsylvania (December 1992)
Müller-Schloer, C., von der Malsburg, C., Würtz, R.P.: Organic computing. Informatik Spektrum 27(4), 332–336 (2004), doi:10.1007/s00287-004-0409-6
Pietzowski, A., Trumler, W., Ungerer, T.: An artificial immune system and its integration into an organic middleware for self-protection. In: GECCO ’06: Proceedings of the 8th annual conference on Genetic and evolutionary computation, Seattle, Washington, USA, pp. 129–130. ACM Press, New York (2006), doi:10.1145/1143997.1144020
Richter, U., Mnif, M., Branke, J., Müller-Schloer, C., Schmeck, H.: Towards a generic observer/controller architecture for organic computing. In: Hochberger, C., Liskowsky, R. (eds.) INFORMATIK 2006 – Informatik für Menschen, Dresden, Germany, sep 2006. GI-Edition – Lecture Notes in Informatics, vol. P-93, pp. 112–119. Köllen Verlag, Bonn (2006)
Sanghi, D., Agrawala, A.K., Gudmundsson, O., Jain, B.N.: Experimental assessment of end-to-end behavior on internet. In: INFOCOM, pp. 867–874 (1993)
Satzger, B., Pietzowski, A., Trumler, W., Ungerer, T.: A new adaptive accrual failure detector for dependable distributed systems. In: SAC ’07: Proceedings of the 2006 ACM symposium on Applied computing, Seoul, South Korea, ACM Press, New York (2007)
Schmeck, H.: Organic computing-vision and challenge for system design. In: Proceedings of the Parallel Computing in Electrical Engineering, International Conference on (PARELEC 2004), Washington, DC, USA, pp. 3–3. IEEE Computer Society Press, Los Alamitos (2004)
Schmeck, H.: Organic computing. Künstliche Intelligenz 5(3), 68–69 (2005)
Silverman, B.W.: Density Estimation for Statistics and Data Analysis. Chapman & Hall/CRC, Boca Raton (April 1986)
Trumler, W., Bagci, F., Petzold, J., Ungerer, T.: Amun - autonomic middleware for ubiquitous environments applied to the smart doorplate. In: Advanced Engineering Informatics, vol. 19, Washington, DC, USA, pp. 243–252. Elsevier, Amsterdam (2005)
Trumler, W., Klaus, R., Ungerer, T.: Self-configuration via cooperative social behavior. In: Yang, L.T., Jin, H., Ma, J., Ungerer, T. (eds.) ATC 2006. LNCS, vol. 4158, pp. 90–99. Springer, Heidelberg (2006)
Trumler, W., Thiemann, T., Ungerer, T.: An artificial hormone system for self-organization of networked nodes. In: IFIP Conference on Biologically Inspired Cooperative Computing, Santiago de Chile, August 2006, pp. 85–94. Springer, Heidelberg (2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Satzger, B., Pietzowski, A., Trumler, W., Ungerer, T. (2007). Variations and Evaluations of an Adaptive Accrual Failure Detector to Enable Self-healing Properties in Distributed Systems. In: Lukowicz, P., Thiele, L., Tröster, G. (eds) Architecture of Computing Systems - ARCS 2007. ARCS 2007. Lecture Notes in Computer Science, vol 4415. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71270-1_13
Download citation
DOI: https://doi.org/10.1007/978-3-540-71270-1_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71267-1
Online ISBN: 978-3-540-71270-1
eBook Packages: Computer ScienceComputer Science (R0)