Abstract
Although fault tolerant systems are badly needed, their development is not a trivial task. Considering fault tolerant distributed systems, this difficulty is even bigger, and any artifact that could make this task easier becomes highly valuable. In this paper, we propose and model a distributed consensusservice and a fault detection service, namely, DisCusS and FuSe, that can be used as building blocks in the development of distributed fault tolerant applications. We also show the compliance of FuSe to FT-CORBA fault detection, and give some insights on the use of the proposed consensus service in a possible FT-CORBA implementation. Moreover, this paper presents some comparative tests of the influence of adaptive and non-adaptive fault detectors over consensus.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Oestereich, B.: Developing Software with UML. Addison-Wesley, Harlow (1999)
Marin, O., Bertier, M., Pierre, S.: Implementation and performance evaluation of an adaptable failure detector. In: Proc. of the Int. Conference on Dependable Systems and Networks (DSN 2002), Washington, D.C., USA, June 2002, p. 354 (2002)
Brasileiro, F., Greve, F., Mostefaoui, A., Raynal, M.: Consensus in one communication step. In: Malyshkin, V.E. (ed.) PaCT 2001. LNCS, vol. 2127, pp. 42–50. Springer, Heidelberg (2001)
Chandra, T.D., Toueg, S.: Unreliable failure detectors for reliable distributed systems. Journal of the ACM 43(2), 225–267 (1996)
Chen, W., Toueg, S., Aguilera, M.K.: On the quality of service of failure detectors. IEEE Transactions on Computers 51(5), 561–580 (2002)
Dolev, D., Dwork, C., Stockmeyer, L.: On the minimal synchronism needed for distributed consensus. Journal of the ACM (JACM) 34(1), 77–97 (1987)
Dwork, C., Lynch, N., Stockmeyer, L.: Consensus in the presence of partial synchrony. Journal of the ACM (JACM) 35(2), 288–323 (1988)
Esteffanel, L.A.B., Jansch-Põrto, I.: On the evaluation of failure detectors performance. In: Proc. of IX Brazilian Symposium of Fault Tolerant Computing, Florianópolis, Brazil (March 2001)
Felber, P.: The CORBA Object Group Service: A Service Approach to Object Groups in CORBA. PhD thesis, École Polytechnique Fédérale de Lausanne, Switzerland (1998)
Fischer, M.J., Lynch, N.A., Paterson, M.S.: Impossibility of distributed consensus with one faulty process. Journal of the ACM (JACM) 32(2), 374–382 (1985)
Gamma, E., Helm, R., Johnson, R., Vlissides, J.: Design Patterns: Elements of Reusable Object-Oriented Software. Addison-Wesley Professional Computing Series. Addison-Wesley, Reading (1994)
Gartner, F.C.: Fundamentals of fault-tolerant distributed computing in asynchronous environments. ACM Computing Surveys 31(1), 1–26 (1999)
Guerraoui, R.: Revisiting the relationship between non-blocking atomic commitment and consensus. In: Helary, J.-M., Raynal, M. (eds.) WDAG 1995. LNCS, vol. 972, pp. 87–100. Springer, Heidelberg (1995)
Guerraoui, R., Oliveira, R., Schiper, A.: Stubborn communication channels. Technical Report 98/272, École Polytechnique Fédérale de Lausanne, Switzerland (March 1998)
Guerraoui, R., Schiper, A.: The generic consensus service. IEEE Transactions on Software Engineering 27(1), 29–41 (2001)
Lamport, L.: The part-time parliament. ACM Transactions on Computer Systems (TOCS) 16(2), 133–169 (1998)
Lung, C., Fraga, J., Farines, J.-M., Ogg, M., Ricciardi, A.: Cosnamingft - a fault-tolerant corba naming service. In: Proc. 18th IEEE International Symposium on Reliable Distributed Systems (SRDS 1999), pp. 254–262. IEEE Computer Society, Los Alamitos (1999)
Hurfin, M., Macêdo, R., Raynal, M., Tronel, F.: A general framework to solve agreement problems. In: Proc. 18th IEEE International Symposium on Reliable Distributed Systems (SRDS 1999), pp. 55–65. IEEE Computer Society, Los Alamitos (1999)
The Common Object Request Broker: Architecture and Specification. Technical Report 2.6, Object Management Group (December 2001)
Fault tolerant corba. corba 2.6. Technical Report 2.6, Object Management Group (December 2001)
Schiper, A.: Early consensus in an asynchronous system with a weak failure detector. Distributed Computing 10(3), 149–157 (1997)
Schmidt, D.C.: Internet site (December 2002), http://www.cs.wustl.edu/schmidt/tao.html
Sergent, N., Défago, X., Schiper, A.: Impact of a failure detection mechanism on the performance of consensus. In: Proc. IEEE Pacific Rim Symp. on Dependable Computing (PRDC), Seoul, Korea (December 2001)
Sommerlad, P., Buschmann, F.: Manager design pattern. In: 3rd annual PLoP, Allenton Park, Illinois (September 1996)
Sotoma, I., Madeira, E.R.M.: ADAPTATION - Algorithms to ADAPTive FAulT MonItOriNg and Their Implementation on CORBA. In: Proc. IEEE of the 3rd International Symposium on Distributed Objects and Applications (DOA 2001), Rome, Italy, September 2001, pp. 219–228 (2001)
Sotoma, I., Madeira, E.R.M.: DPCP(Discard Past Consider Present) - A Novel Approach to Adaptive Fault Detection in Distributed Systems. In: Proc. of the 8th IEEE Workshop on Future Trends of Distributed Computing Systems (FTDCS2001), Bologna, Italy, November 2001, pp. 76–82 (2001)
Urbán, P., Défago, X., Schiper, A.: Contention-aware metrics for distributed algorithms: Comparison of atomic broadcast algorithms. In: Proc. of the 9th IEEE International Conference on Computer Communications and Networks (IC3N 2000), October 2000, pp. 582–589 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Camargos, L.J., Madeira, E.R.M. (2003). DisCusS and FuSe: Considering Modularity, Genericness, and Adaptation in the Development of Consensus and Fault Detection Services. In: de Lemos, R., Weber, T.S., Camargo, J.B. (eds) Dependable Computing. LADC 2003. Lecture Notes in Computer Science, vol 2847. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45214-0_18
Download citation
DOI: https://doi.org/10.1007/978-3-540-45214-0_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20224-0
Online ISBN: 978-3-540-45214-0
eBook Packages: Springer Book Archive