research-article

PROQID: partial restarts of queries in distributed databases

Authors:
Jon Olav Hauglid

National University of Science and Technology, Trondheim, Norway

National University of Science and Technology, Trondheim, Norway
View Profile

,
Kjetil Nørvåg

National University of Science and Technology, Trondheim, Norway

National University of Science and Technology, Trondheim, Norway
View Profile

CIKM '08: Proceedings of the 17th ACM conference on Information and knowledge managementOctober 2008Pages 1251–1260https://doi.org/10.1145/1458082.1458247

Published:26 October 2008Publication History

CIKM '08: Proceedings of the 17th ACM conference on Information and knowledge management

Pages 1251–1260

ABSTRACT

In a number of application areas, distributed database systems can be used to provide persistent storage of data while providing efficient access for both local and remote data. With an increasing number of sites (computers) involved in a query, the probability of failure at query time increases. Recovery has previously only focused on database updates while query failures have been handled by complete restart of the query. This technique is not always applicable in the context of large queries and queries with deadlines. In this paper we present an approach for partial restart of queries that incurs minimal extra network traffic during query recovery. Based on results from experiments on an implementation of the partial restart technique in a distributed database system, we demonstrate its applicability and significant reduction of query cost in the presence of failures.

References

M. N. Alpdemir et al. OGSA-DQP: a service for distributed querying on the Grid. In Proceedings of EDBT'2004, 2004.Google Scholar
R. S. Barga et al. Recovery guarantees for internet applications. ACM Trans. Internet Techn., 4(3):289--328, 2004. Google ScholarDigital Library
P. Bonnet and A. Tomasic. Partial answers for unavailable data sources. In Proceedings of FQAS'98, 1998. Google ScholarDigital Library
R. Braumandl, M. Keidl, A. Kemper, D. Kossmann, A. Kreutz, S. Seltzsam, and K. Stocker. ObjectGlobe: ubiquitous query processing on the Internet. VLDB Journal, 10(1):48--71, 2001. Google ScholarDigital Library
B. Chandramouli, C. N. Bond, S. Babu, and J. Yang. Query suspend and resume. In Proceedings of the SIGMOD'2007, 2007. Google ScholarDigital Library
S. Chaudhuri, R. Kaushik, R. Ramamurthy, and A. Pol. Stop-and-restart style execution for long running decision support queries. In Proceedings of VLDB'2007, 2007. Google ScholarDigital Library
S. Chaudhuri, R. Krishnamurthy, S. Potamianos, and K. Shim. Optimizing queries with materialized views. In Proceedings of ICDE'1995, 1995. Google ScholarDigital Library
S. Dar, M. J. Franklin, B. T. Jónsson, D. Srivastava, and M. Tan. Semantic data caching and replacement. In Proceedings of VLDB'1996, 1996. Google ScholarDigital Library
A. Gounaris et al. Adapting to changing resource performance in Grid query processing. In Proceedings of DMG'05, 2005. Google ScholarDigital Library
J.-H. Hwang et al. High-availability algorithms for distributed stream processing. In Proceedings of ICDE'2005, 2005. Google ScholarDigital Library
J.-H. Hwang et al. A cooperative, self-configuring high-availability solution for stream processing. In Proceedings of ICDE'2007, 2007.Google ScholarCross Ref
N. Kabra and D. J. DeWitt. Efficient mid-query re-optimization of sub-optimal query execution plans. In Proceedings of SIGMOD'1998, 1998. Google ScholarDigital Library
D. Kossmann. The state of the art in distributed query processing. ACM Computing Surveys, 32(4):422--469, 2000. Google ScholarDigital Library
W. Labio et al. Efficient resumption of interrupted warehouse loads. In Proceedings of SIGMOD'2000, 2000. Google ScholarDigital Library
Q. Ren, M. H. Dunham, and V. Kumar. Semantic caching and query processing. IEEE Trans. on Knowl. and Data Eng., 15(1):192--210, 2003. Google ScholarDigital Library
A. N. Saharia and Y. M. Babad. Enhancing data warehouse performance through query caching. SIGMIS Database, 31(3):43--63, 2000. Google ScholarDigital Library
J. Smith and P. Watson. Fault-tolerance in distributed query processing. In Proceedings of IDEAS'2005, 2005. Google ScholarDigital Library
R. Wang, B. Salzberg, and D. B. Lomet. Log-based recovery for middleware servers. In Proceedings of SIGMOD'2007, 2007. Google ScholarDigital Library
A. N. Wilschut and P. M. G. Apers. Dataflow query execution in a parallel main-memory environment. Distributed and Parallel Databases, 1(1):103--128, 1993. Google ScholarDigital Library

Index Terms

PROQID: partial restarts of queries in distributed databases
1. Information systems
  1. Data management systems
    1. Database management system engines
      1. Database query processing
2. Theory of computation
  1. Theory and algorithms for application domains
    1. Database theory
      1. Database query processing and optimization (theory)

Recommendations

A low-overhead recovery technique using quasi-synchronous checkpointing
ICDCS '96: Proceedings of the 16th International Conference on Distributed Computing Systems (ICDCS '96)

In this paper, we propose a quasi-synchronous checkpointing algorithm and a low-overhead recovery algorithm based on it. The checkpointing algorithm preserves process autonomy by allowing them to take checkpoints asynchronously and uses communication-...
Read More
A quasi-synchronous checkpointing algorithm that prevents contention for stable storage

Checkpointing and rollback recovery are established techniques for handling failures in distributed systems. Under synchronous checkpointing, each process involved in the distributed computation takes checkpoint almost simultaneously. This causes ...
Read More
Asynchronous recovery without using vector timestamps

A checkpoint of a process involved in a distributed computation is said to be useful if it is part of a consistent global checkpoint. In this paper, we present a quasi-synchronous checkpointing algorithm that makes every checkpoint useful. We also ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '08: Proceedings of the 17th ACM conference on Information and knowledge management
October 2008
1562 pages
ISBN:9781595939913
DOI:10.1145/1458082
General Chair:
James G. Shanahan
Church and Duncan Group Inc, USA
,
Program Chairs:
Sihem Amer-Yahia
Yahoo! Research, USA
,
Ioana Manolescu
INRIA, France
,
Yi Zhang
University of California, Santa Cruz, USA
,
David A. Evans
JustSystems Evans Research, USA
,
Alek Kolcz
Microsoft Live Labs, USA
,
Key-Sun Choi
KAIST, Korea
,
Abdur Chowdury
Twitter, USA
Copyright © 2008 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 26 October 2008
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
distributed querying
fault-tolerance
query restart
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,861of8,427submissions,22%
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 6
  Total Citations
  View Citations
- 234
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

PROQID: partial restarts of queries in distributed databases

CIKM '08: Proceedings of the 17th ACM conference on Information and knowledge management

ABSTRACT

References

Cited By

Index Terms

Recommendations

A low-overhead recovery technique using quasi-synchronous checkpointing

A quasi-synchronous checkpointing algorithm that prevents contention for stable storage

Asynchronous recovery without using vector timestamps