Abstract
Queries over scientific data often imply expensive analyses of data requiring a lot of computational resources available in Grids. We are developing a customizable query processor built on top of an established Grid infrastructure, the NorduGrid middleware, and have implemented a framework for managing long running queries in Grid environment. With the framework the user does not specify the detailed job and parallelization descriptions required by NorduGrid. Instead s/he specifies queries in terms of an application-oriented schema describing contents of files managed by the Grid and accessed through wrappers. When a query is received by the system it generates NorduGrid job descriptions submitted to NorduGrid for execution. The framework considers limitations of NorduGrid. It includes a submission mechanism, a job babysitter, and a generic data exchange mechanism. The submission mechanism generates a number of jobs for parallel execution of a user query over wrapped data files. The task of the babysitter is to submit generated jobs to NorduGrid for the execution, to monitor their execution status, and to download results from the execution. The generic exchange mechanism provides a way to exchange objects through files between Grid execution nodes and user applications.
This work is funded by The Swedish Research Council (VR) under contract 343-2003-955.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
ATLAS collaboration, http://atlas.web.cern.ch/Atlas/internal/Welcome.html
LHC Computing Grid, http://lcg.web.cern.ch/lcg/
EGEE: Enabling Grids for E-sciencE, http://egee-intranet.web.cern.ch/egee-intranet/gateway.html
Eerola, P., Ekelöf, T., Ellert, M., Hansen, J.R., Konstantinov, A., Kónya, B., Nielsen, J.L., Ould-Saada, F., Smirnova, O., Wäänänen, A.: Science on NorduGrid. In: Neittaanmäki, P., Rossi, T., Korotov, S., Oñate, E., Périaux, J., Knörzer, D. (eds.) ECCOMAS 2004 (2004), See also http://www.nordugrid.org
LHC - the Large Hadron Collider, http://lhc-new-homepage.web.cern.ch/lhc-new-homepage/
Fomkin, R., Risch, T.: Managing long running queries in Grid environment. In: Meersman, R., Tari, Z., Corsaro, A. (eds.) OTM-WS 2004. LNCS, vol. 3292, pp. 99–110. Springer, Heidelberg (2004)
Swegrid, http://www.swegrid.se
Brun, R., Rademakers, F.: ROOT - an object oriented data analysis framework. In: AIHENP 1996 Workshop. Nucl. Inst. & Meth. in Phys. Res. A 389, pp. 81–86 (1997) See also http://root.cern.ch
Smith, J., Gounaris, A., Watson, P., Paton, N.W., Fernandes, A.A.A., Sakellariou, R.: Distributed query processing on the Grid. In: Parashar, M. (ed.) GRID 2002. LNCS, vol. 2536, pp. 279–290. Springer, Heidelberg (2002)
Alpdemir, M.N., Mukherjee, A., Gounaris, A., Paton, N.W., Watson, P., Fernandes, A.A.A., Fitzgerald, D.J.: OGSA-DQP: A service for distributed querying on the Grid. In: Bertino, E., Christodoulakis, S., Plexousakis, D., Christophides, V., Koubarakis, M., Böhm, K., Ferrari, E. (eds.) EDBT 2004. LNCS, vol. 2992, pp. 858–861. Springer, Heidelberg (2004)
myGrid, http://www.mygrid.org.uk
Narayanan, S., Kurç, T.M., Çatalyürek, Ü.V., Saltz, J.H.: Database support for data-driven scientific applications in the grid. Parallel Processing Letters 13, 245–271 (2003), See also http://storm.bmi.ohio-state.edu
Nieto-Santisteban, M.A., Gray, J., Szalay, A.S., Annis, J., Thakar, A.R., O’Mullane, W.: When database systems meet the Grid. In: CIDR, pp. 154–161 (2005)
O’Mullane, W., Li, N., Nieto-Santisteban, M.A., Szalay, A.S., Thakar, A.R., Gray, J.: Batch is back: CasJobs, serving multi-TB data on the Web. Technical Report MSR-TR-2005-19, Microsoft Research (2005)
Adams, D., Deng, W., Chetan, N., Kannan, C., Sambamurthy, V., Harrison, K., Tan, C., Soroko, A., Liko, D., Orellana, F., Branco, M., Haeberli, C., Albrand, S., Fulachier, J., Lozano, J., Fassi, F., Rybkine, G.: ATLAS distributed analysis. In: CHE 2004 (2004)
The NorduGrid/ARC User Guide (2005), Available at http://www.nordugrid.org/documents/userguide.pdf
Ellert, M.: The NorduGrid brokering algorithm (2004), Available at http://www.nordugrid.org/documents/brokering.pdf
Smirnova, O.: Extended Resource Specification Language Reference Manual (2005), Available at http://www.nordugrid.org/documents/xrsl.pdf
Welch, V., Siebenlist, F., Foster, I., Bresnahan, J., Czajkowski, K., Gawor, J., Kesselman, C., Meder, S., Pearlman, L., Tuecke, S.: Security for Grid services. In: HPDC 2003, pp. 48–57. IEEE Computer Society, Los Alamitos (2003), See also http://www-unix.globus.org/toolkit/docs/3.2/gsi/
Hansen, C., Gollub, N., Assmagan, K., Ekelöf, T.: Discovery potential for a charged Higgs boson decaying in the chargino-neutralino channel of the ATLAS detector at the LHC. SN-ATLAS-2005-050 (2005)
Risch, T., Josifovski, V., Katchaounov, T.: Functional data integration in a distributed mediator system. In: The Functional Approach to Data Management: Modeling, Analyzing, and Integrating Heterogeneous Data. Springer, Heidelberg (2003)
Flodin, S., Hansson, M., Josifovski, V., Katchaounov, T., Risch, T., Skold, M.: Amos II Release 7 User’s Manual. Uppsala Database Laboratory (2005), Available at http://user.it.uu.se/~udbl/amos/doc/amos_users_guide.html
Konstantinov, A.: The Logger Service, Functionality Description and Installation Manual (2005), Available at http://www.nordugrid.org/documents/Logger.pdf
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fomkin, R., Risch, T. (2006). Framework for Querying Distributed Objects Managed by a Grid Infrastructure. In: Pierson, JM. (eds) Data Management in Grids. DMG 2005. Lecture Notes in Computer Science, vol 3836. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11611950_6
Download citation
DOI: https://doi.org/10.1007/11611950_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31212-3
Online ISBN: 978-3-540-32452-2
eBook Packages: Computer ScienceComputer Science (R0)