Abstract
We introduce FTRepMI, a simple fault-tolerant protocol for providing sequential consistency amongst replicated objects in a grid, without using any centralized components. FTRepMI supports dynamic joins and graceful leaves of processes holding a replica, as well as fail-stop crashes. Performance evaluation shows that FTRepMI behaves efficiently, both on a single cluster and on a distributed cluster environment.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bal, H., Kaashoek, F., Tanenbaum, A.: Orca: A language for parallel programming of distributed systems. IEEE TSE 18(3), 190–205 (1992)
Birman, K.: Replication and fault-tolerance in the Isis system. In: SOSP 1985, pp. 79–86. ACM, New York (1985)
Blom, S., Fokkink, W., Groote, J.F., van Langevelde, I., Lisser, B., van de Pol, J.: μCRL: A toolset for analysing algebraic specifications. In: Berry, G., Comon, H., Finkel, A. (eds.) CAV 2001. LNCS, vol. 2102, pp. 250–254. Springer, Heidelberg (2001)
Fekete, A., Kaashoek, M.F., Lynch, N.: Implementing sequentially consistent shared objects using broadcast and point-to-point communication. J. ACM 45(1), 35–69 (1998)
Garbinato, B., Guerraoui, R., Mazouni, K.R.: Implementation of the GARF replicated objects platform. Distributed Systems Engineering 2(1), 14–27 (1995)
Grosso, P., Xu, L., Velders, J.-P., de Laat, C.: Starplane: A national dynamic photonic network controlled by grid applications. Emerald Journal on Internet Research 17(5), 546–553 (2007)
Guerraoui, R., Schiper, A.: Fault-tolerance by replication in distributed systems. In: Strohmeier, A. (ed.) Ada-Europe 1996. LNCS, vol. 1088, pp. 38–57. Springer, Heidelberg (1996)
Herlihy, M.: A quorum-consensus replication method for abstract data types. ACM TOCS 4(1), 32–53 (1986)
Lamport, L.: How to make a multiprocessor computer that correctly executes multiprocess programs. IEEE TOC 28(9), 690–691 (1979)
Lynch, N.A., Shvartsman, A.A.: RAMBO: A reconfigurable atomic memory service for dynamic networks. In: Malkhi, D. (ed.) DISC 2002. LNCS, vol. 2508, pp. 173–190. Springer, Heidelberg (2002)
Maassen, J.: Method Invocation Based Programming Models for Parallel Programming in Java. PhD thesis, Vrije Universiteit Amsterdam (2003)
Maassen, J., van Nieuwpoort, R., Veldema, R., Bal, H., Kielmann, T., Jacobs, C., Hofman, R.: Efficient Java RMI for parallel programming. ACM TOPLAS 23(6), 747–775 (2001)
Maassen, J., Kielmann, T., Bal, H.: Parallel application experience with replicated method invocation. Concurrency and Computation: Practice and Experience 13(8-9), 681–712 (2001)
Moser, L., Melliar-Smith, P., Agarwal, D., Budhia, R., Lingley-Papadopoulos, C.: Totem: A fault-tolerant multicast group communication system. Commun. ACM 39(4), 54–63 (1996)
Narasimhan, P., Moser, L., Melliar-Smith, P.: Strongly consistent replication and recovery of fault-tolerant Corba applications. Computer System Science and Engineering 17(2), 103–114 (2002)
van Nieuwpoort, R., Maassen, J., Wrzesinska, G., Hofman, R., Jacobs, C., Kielmann, T., Bal, H.: Ibis: A flexible and efficient Java-based grid programming environment. Concurrency and Computation: Practice and Experience 17(7-8), 1079–1107 (2005)
van Renesse, R., Schneider, F.: Chain replication for supporting high throughput and availability. In: OSDI 2004, pp. 91–104. USENIX Association (2004)
Schiper, A.: Practical impact of group communication theory. In: Schiper, A., Shvartsman, M.M.A.A., Weatherspoon, H., Zhao, B.Y. (eds.) Future Directions in Distributed Computing. LNCS, vol. 2584, pp. 1–10. Springer, Heidelberg (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Oprescu, AM., Kielmann, T., Fokkink, W. (2008). FTRepMI: Fault-Tolerant, Sequentially-Consistent Object Replication for Grid Applications. In: Garg, V., Wattenhofer, R., Kothapalli, K. (eds) Distributed Computing and Networking. ICDCN 2009. Lecture Notes in Computer Science, vol 5408. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92295-7_44
Download citation
DOI: https://doi.org/10.1007/978-3-540-92295-7_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-92294-0
Online ISBN: 978-3-540-92295-7
eBook Packages: Computer ScienceComputer Science (R0)