Skip to main content
Log in

Data Management in the APPA System

  • Published:
Journal of Grid Computing Aims and scope Submit manuscript

Abstract

Combining Grid and P2P technologies can be exploited to provide high-level data sharing in large-scale distributed environments. However, this combination must deal with two hard problems: the scale of the network and the dynamic behavior of the nodes. In this paper, we present our solution in APPA (Atlas Peer-to-Peer Architecture), a data management system with high-level services for building large-scale distributed applications. We focus on data availability and data discovery which are two main requirements for implementing large-scale Grids. We have validated APPA’s services through a combination of experimentation over Grid5000, which is a very large Grid experimental platform, and simulation using SimJava. The results show very good performance in terms of communication cost and response time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Aberer, K., Cudré-Mauroux, P., Datta, A., Despotovic, Z., Hauswirth, M., Punceva, M., Schmidt, R.: P-Grid: a self-organizing structured P2P system. ACM SIGMOD Record 32(3) (2003)

  2. Akbarinia, R., Pacitti, E., Valduriez, P.: Reducing network traffic in unstructured P2P systems using top-k queries. J. Distrib. Parallel Datab. 19(2–3) (2006)

  3. Akbarinia, R., Martins, V., Pacitti, E., Valduriez, P.: Top-k query processing in the APPA P2P system. In: Int. Conf. on High Performance Computing for Computational Science (VecPar). Springer, Berlin Heidelberg New York (2006)

  4. Aloisio, G., Cafaro, M., Fiore, S., Mirto, M.: The Grid-DBMS: towards dynamic data management in Grid environments. In: IEEE Int. Symposium on Information Technology: Coding and Computing (ITCC). IEEE Computer Society, Washington, DC (2005)

  5. Antonioletti, M., et al.: The design and implementation of Grid database services in OGSA-DAI. In: Concurrency and Computation: Practice and Experience vol. 17. pp. 2–4. Wiley, New York (2005)

  6. Balfe, S., Lakhani, A.D., Paterson, K.G.: Trusted computing: providing security for peer-to-peer networks. In: IEEE Int. Conf. on Peer-to-Peer Computing. IEEE Computer Society, Washington, DC (2005)

  7. Castro, M., Jones, M.B., Kermarrec, A., Rowstron, A., Theimer, M., Wang, H., Wolman, A.: An evaluation of scalable application-level multicast built using P2P overlays. In: IEEE Infocom. IEEE Computer Society, Washington, DC (2003)

  8. Chakravarti, A.J., Baumgartner, G., Lauria, M.: The organic Grid: self-organizing computation on a peer-to-peer network. IEEE Trans. Syst. Man. Cybern., Part A 35(3), 373–384 (2005)

    Article  Google Scholar 

  9. Chaudhuri, S., Gravano, L.: Evaluating top-k selection queries. In: VLDB Conf. ACM, New York (1999)

  10. Chockler, G., Keidar, I., Vitenberg, R.: Group communication specifications: a comprehensive study. ACM Comput. Surv. 33, 427–469 (2001)

    Article  Google Scholar 

  11. Clarke, I., Miller, S., Hong, T.W., Sandberg, O.: Wiley, B. Protecting free expression online with Freenet. J. IEEE Internet Comput. 6(1) (2002)

  12. Foster, I.T., Kesselman, C., Tuecke, S.: The anatomy of the Grid: enabling scalable virtual organizations. Int. J. Supercomput Appl 15(3) (2001)

  13. Foster, I.T., Iamnitchi, A.: On death, taxes, and the convergence of peer-to-peer and Grid computing. In: Int. Workshop on P2P Systems (IPTPS). Springer, Berlin Heidelberg New York (2003)

  14. Globus Alliance. http://www.globus.org/

  15. Gnutella. http://www.gnutelliums.com/

  16. Grid4All project. http://www.grid4all.eu/

  17. Grid5000 Project. http://www.grid5000.fr

  18. Howell, F., McNab, R.: SimJava: A discrete event simulation package for java with applications in computer systems modeling. In: Int. Conf. on Web-based Modeling and Simulation. Society for Computer Simulation, San Diego, CA (1998)

  19. JXTA. http://www.jxta.org/

  20. JXTA-GRID. http://jxta-grid.jxta.org/

  21. Kermarrec, A., Rowstron, A., Shapiro, M., Druschel P.: The IceCube approach to the reconciliation of diverging replicas. In: ACM Symp. on Principles of Distributed Computing. ACM, New York (2001)

  22. Knezevic, P., Wombacher, A., Risse, T.: Enabling high data availability in a DHT. In: Int. Workshop on Grid and P2P Computing Impacts on Large Scale Heterogeneous Distributed Database Systems (GLOBE). IEEE Computer Society, washington, DC (2005)

  23. Levy, A., Rajaraman, A., Ordille, J.: Querying heterogeneous information sources using source descriptions. In: VLDB Conf. Morgan Kaufmann, San Francisco, CA (1996)

  24. Martins, V., Akbarinia, R., Pacitti, E., Valduriez, P.: Reconciliation in the APPA P2P System. In: IEEE ICPADS. IEEE Computer Society, Washington, DC (2006)

  25. Martins, V., Pacitti, E., Valduriez, P.: Dynamic and distributed reconciliation in P2P-DHT networks. In European Conf. on Parallel Computing (Euro-Par). Springer, Berlin Heidelberg New York (2006)

  26. Martins, V., Pacitti, E., Jimenez-Peris, R., Valduriez, P.: Scalable and available reconciliation in P2P networks. Journées Bases de Données Avancées (BDA) 19(2), 67–86 (2006)

    Google Scholar 

  27. Nejdl, W., Siberski, W., Sintek, M.: Design issues and challenges for RDF- and schema-based peer-to-peer systems. ACM SIGMOD Record 32(3) (2003)

  28. Nejdl, W., Wolf, B., Qu, C., Decker, S., Sintek, M, Naeve, A., Nilsson, M., Palmér, M., Risch, T.: EDUTELLA: a P2P networking infrastructure based on RDF. In: Int. World Wide Web conf. (WWW). ACM, New York (2002)

  29. OGSAP2P Research Group. http://www.ggf.org/4_GP/ogsap2p.htm.

  30. Open Grid Services Architecture. http://www.globus.org/ogsa/

  31. Open Grid Services Architecture Data Access and Integration. http://www.ogsadai.org.uk/

  32. Özsu, T., Valduriez, P.: Principles of distributed database systems. Prentice-Hall, Englewood Cliffs, NJ (1999)

    Google Scholar 

  33. Preguiça, N., Shapiro, M., Matheson, C.: Semantics-based reconciliation for collaborative and mobile environments. In: Int. Conf. on Cooperative Information Systems (CoopIS). Springer, Berlin Heidelberg New York (2003)

  34. Ratnasamy, S., Francis, P., Handley, M., Karp, R., Shenker, S.: A scalable content-addressable network. In: ACM SIGCOMM Conf. ACM, New York (2001)

  35. Saito, Y., Shapiro, M.: Optimistic replication. ACM Comput. Surv. 37(1) (2005)

  36. Schlosser, M., Sintek, M., Decker, S., Nejdl, W.: HyperCuP-Hypercubes, ontologies and efficient search on P2P networks. In: Int. Workshop on Agents and Peer-to-Peer Computing. IOS, Amsterdam, The Netherlands (2002)

  37. Siong Ng, W., Ooi, B., Tan, k.L., Zhou, A.: PeerDB: a P2P-based system for distributed data sharing. In: Int. Conf. on Data Engineering (ICDE). ACM, New York (2003)

  38. Sit, E., Morris, R.: Security considerations for peer-to-peer distributed hash tables. In: Int. Workshop on P2P Systems (IPTPS). Springer, Berlin Heidelberg New York (2002)

  39. Stoica, I., Morris, R., Karger, D.R., Kaashoek, M.F., Balakrishnan, H.: Chord: a scalable peer-to-peer lookup service for internet applications. In: ACM SIGCOMM Conf. ACM, New York (2001)

  40. Tanaka, A., Valduriez, P.: The Ecobase environmental information system: applications, architecture and open issues. ACM SIGMOD Record, 3(5–6) (2000)

  41. Tatarinov, I., Ives, Z.G., Madhavan, J., Halevy, A., Suciu, D., Dalvi, N., Dong, X., Kadiyska, Y., Miklau, G., Mork, P.: The Piazza peer data management project. ACM SIGMOD Record 32(3) (2003)

  42. The Spitfire Project. http://edg-wp2.web.cern.ch/edg-wp2/spitfire/index.html

  43. Tomasic, A., Raschid, L., Valduriez, P.: Scaling access to heterogeneous data sources with DISCO. IEEE Trans. Knowl. Data Eng. 10(5) (1998)

  44. Valduriez, P.: Parallel Database Systems: open problems and new issues. J. Distrib. Parallel Datab. 1(2) (1993)

  45. Whittaker, S., Issacs, E., O’Day, V.: Widening the net: workshop report on the theory and practice of physical and network communities. ACM SIGCHI Bulletin 29(3) (1997)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Reza Akbarinia.

Additional information

Work partially funded by ARA “Massive Data” of the French ministry of research and the European Strep Grid4All project.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Akbarinia, R., Martins, V. Data Management in the APPA System. J Grid Computing 5, 303–317 (2007). https://doi.org/10.1007/s10723-007-9070-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10723-007-9070-z

Key words

Navigation