Uniformly-distributed random generation of join orders

Galindo-Legaria, César A.; Pellenkoft, Arjan; Kersten, Martin L.

doi:10.1007/3-540-58907-4_22

César A. Galindo-Legaria^1,2,
Arjan Pellenkoft¹ &
Martin L. Kersten¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 893))

Included in the following conference series:

International Conference on Database Theory

177 Accesses
2 Citations

Abstract

In this paper we study the space of operator trees that can be used to answer a join query, with the goal of generating elements form this space at random. We solve the problem for queries with acyclic query graphs. We first count, in O(n ³) time, the exact number of trees that can be used to evaluate a given query on n relations. The intermediate results of the counting procedure then serve to generate random, uniformly distributed operator trees in O(n ²) time per tree. We also establish a mapping between the N operator trees for a query and the integers 1 through N —i. e. a ranking-and describe ranking and unranking procedures with complexity O(n ²) and O(n ² log n), respectively.

C. Galindo-Legaria was supported by an ERCIM postdoctoral fellowship.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Dissociation and propagation for approximate lifted inference with standard relational database management systems

Article 16 July 2016

Join Sizes, Frequency Moments, and Applications

Efficient generation of query plans containing group-by, join, and groupjoin

Article 17 August 2017

References

C. Beeri, R. Fagin, D. Maier, and M. Yannakakis. On the desirability of acyclic database schemes. Journal of the ACM, 30(3):479–513, July 1983.
Google Scholar
S. Ceri and G. Pelagatti. Distributed Databases: Principles and Systems. McGraw-Hill, New York, 1985.
Google Scholar
C. A. Galindo-Legaria, A. Pellenkoft, and M. L. Kersten. Fast, randomized join-order selection —Why use transformations? In Proceedings of the Twentieth International Conference on Very Large Databases, Santiago, 1994. Also CWI Technical Report CS-R9416.
Google Scholar
U. Gupta, D. T. Lee, and C. K. Wong. Ranking and unranking of 2–3 trees. SIAM Journal of Computation, pages 582–590, August 1982.
Google Scholar
G. Graefe. Query evaluation techniques for large databases. ACM Computing Surveys, 25(2):73–170, June 1993.
Google Scholar
F. Harary and E. M. Palmer. Graphical Enumeration. Academic Press, 1973.
Google Scholar
Y. E. Ioannidis and Y. C. Kang. Randomized algorithms for optimizing large join queries. Proc. of the ACM-SIGMOD Conference on Management of Data, pages 312–321, 1990.
Google Scholar
Y. E. Ioannidis and Y. C. Kang. Left-deep vs. bushy trees: An analysis of strategy spaces and its implications for query optimization. Proc. of the ACM-SIGMOD Conference on Management of Data, pages 168–177, 1991.
Google Scholar
Y. C. Kang. Randomized Algorithms for Query Optimization. PhD thesis, University of Wisconsin-Madison, 1991. Technical report #1053.
Google Scholar
D. E. Knuth. The Art of Computer Programming, volume 1: Fundamental Algorithms. Addison-Wesley, 1968. Second edition, 1973.
Google Scholar
W. Kim, D. S. Reiner, and D. S. Batory, editors. Query processing in database systems. Springer, Berlin, 1985.
Google Scholar
R. S. G. Lanzelotte, P. Valduriez, and M. Zaït. On the effectiveness of optimization search strategies for parallel execution spaces. Proc. of the 19th VLDB Conference, Dublin, Ireland, pages 493–504, 1993.
Google Scholar
K. Ono and G. M. Lohman. Measuring the complexity of join enumeration in query optimization. Proc. of the 16th VLDB Conference, Brisbane, Australia, pages 314–325, 1990.
Google Scholar
F. Ruskey and T. C. Hu. Generating binary trees lexicographically. SIAM journal of Computation, 6(4):745–758, December 1977.
Google Scholar
A. N. Swami and A. Gupta. Optimization of large join queries. Proc. of the ACM-SIGMOD Conference on Management of Data, pages 8–17, 1988.
Google Scholar
A. N. Swami. Optimization of Large Join Queries. PhD thesis, Stanford University, 1989. Technical report STAN-CS-89-1262.
Google Scholar
A. N. Swami. Optimization of large join queries: Combining heuristics and combinatorial techniques. Proc. of the ACM-SIGMOD Conference on Management of Data, pages 367–376, 1989.
Google Scholar
A. N. Swami. Distribution of query plan costs for large join queries. Technical Report RJ 7908, IBM Research Division, Almaden, 1991.
Google Scholar
J. D. Ullman. Principles of Database Systems. Computer Science Press, Rockville, MD, 2nd edition, 1982.
Google Scholar
J. S. Vitter and Ph. Flajolet. Analysis of algorithms and data structures. In J. van Leeuwen, editor, Handbook of Theoretical Computer Science, volume A: Algorithms and Complexity, chapter 9, pages 431–524. North Holland, 1990.
Google Scholar
J. van Leeuwen. Graph algorithms. In J. van Leeuwen, editor, Handbook of Theoretical Computer Science, volume A: Algorithms and Complexity, chapter 10, pages 525–631. North Holland, 1990.
Google Scholar

Download references

Author information

Authors and Affiliations

CWI, P. O. Box 94079, 1090, GB Amsterdam, The Netherlands
César A. Galindo-Legaria, Arjan Pellenkoft & Martin L. Kersten
SINTEF DELAB, N-7034, Trondheim, Norway
César A. Galindo-Legaria

Authors

César A. Galindo-Legaria
View author publications
You can also search for this author in PubMed Google Scholar
Arjan Pellenkoft
View author publications
You can also search for this author in PubMed Google Scholar
Martin L. Kersten
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Georg Gottlob Moshe Y. Vardi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Galindo-Legaria, C.A., Pellenkoft, A., Kersten, M.L. (1995). Uniformly-distributed random generation of join orders. In: Gottlob, G., Vardi, M.Y. (eds) Database Theory — ICDT '95. ICDT 1995. Lecture Notes in Computer Science, vol 893. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58907-4_22

Download citation

DOI: https://doi.org/10.1007/3-540-58907-4_22
Published: 02 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-58907-5
Online ISBN: 978-3-540-49136-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Uniformly-distributed random generation of join orders

Abstract

Access this chapter

Preview

Similar content being viewed by others

Dissociation and propagation for approximate lifted inference with standard relational database management systems

Join Sizes, Frequency Moments, and Applications

Efficient generation of query plans containing group-by, join, and groupjoin

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Uniformly-distributed random generation of join orders

Abstract

Access this chapter

Preview

Similar content being viewed by others

Dissociation and propagation for approximate lifted inference with standard relational database management systems

Join Sizes, Frequency Moments, and Applications

Efficient generation of query plans containing group-by, join, and groupjoin

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation