skip to main content
10.1145/3294052.3319683acmconferencesArticle/Chapter ViewAbstractPublication PagespodsConference Proceedingsconference-collections
research-article

HyperBench: A Benchmark and Tool for Hypergraphs and Empirical Findings

Authors Info & Claims
Published:25 June 2019Publication History

ABSTRACT

To cope with the intractability of answering Conjunctive Queries (CQs) and solving Constraint Satisfaction Problems (CSPs), several notions of hypergraph decompositions have been proposed - giving rise to different notions of width, noticeably, plain, generalized, and fractional hypertree width (hw, ghw, and fhw). Given the increasing interest in using such decomposition methods in practice, a publicly accessible repository of decomposition software, as well as a large set of benchmarks, and a web-accessible workbench for inserting, analysing, and retrieving hypergraphs are called for. We address this need by providing (i) concrete implementations of hypergraph decompositions (including new practical algorithms), (ii) a new, comprehensive benchmark of hypergraphs stemming from disparate CQ and CSP collections, and (iii) HyperBench, our new web-interface for accessing the benchmark and the results of our analyses. In addition, we describe a number of actual experiments we carried out with this new infrastructure.

References

  1. Christopher R. Aberger, Andrew Lamb, Susan Tu, Andres Nö tzli, Kunle Olukotun, and Christopher Ré. 2017. EmptyHeaded: A Relational Engine for Graph Processing. ACM Trans. Database Syst., Vol. 42, 4 (2017), 20:1--20:44. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Christopher R. Aberger, Susan Tu, Kunle Olukotun, and Christopher Ré. 2016. Old Techniques for New Join Algorithms: A Case Study in RDF Processing. CoRR, Vol. abs/1602.03557 (2016). arxiv: 1602.03557 http://arxiv.org/abs/1602.03557Google ScholarGoogle Scholar
  3. Isolde Adler, Georg Gottlob, and Martin Grohe. 2007. Hypertree width and related hypergraph invariants. Eur. J. Comb., Vol. 28, 8 (2007), 2167--2181. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Kamal Amroun, Zineb Habbas, and Wassila Aggoune-Mtalaa. 2016. A compressed Generalized Hypertree Decomposition-based solving technique for non-binary Constraint Satisfaction Problems. AI Commun., Vol. 29, 2 (2016), 371--392.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Molham Aref, Balder ten Cate, Todd J. Green, Benny Kimelfeld, Dan Olteanu, Emir Pasalic, Todd L. Veldhuizen, and Geoffrey Washburn. 2015. Design and Implementation of the LogicBlox System. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31 - June 4, 2015, Timos K. Sellis, Susan B. Davidson, and Zachary G. Ives (Eds.). ACM, 1371--1382. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Patricia C. Arocena, Boris Glavic, Radu Ciucanu, and René e J. Miller. 2015. The iBench Integration Metadata Generator. PVLDB, Vol. 9, 3 (2015), 108--119. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Gilles Audemard, Frédéric Boussemart, Christophe Lecoutre, and Cédric Piette. 2016. XCSP3: an XML-based format designed to represent combinatorial constrained problems. http://www.xcsp.org/Google ScholarGoogle Scholar
  8. Nurzhan Bakibayev, Tomá s Kociský, Dan Olteanu, and Jakub Zavodny. 2013. Aggregation and Ordering in Factorised Databases. PVLDB, Vol. 6, 14 (2013), 1990--2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Michael Benedikt. 2017. CQ benchmarks. Personal Communication.Google ScholarGoogle Scholar
  10. Michael Benedikt, George Konstantinidis, Giansalvatore Mecca, Boris Motik, Paolo Papotti, Donatello Santoro, and Efthymia Tsamoura. 2017. Benchmarking the Chase. In Proceedings of the 36th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, PODS 2017, Chicago, IL, USA, May 14--19, 2017, Emanuel Sallinger, Jan Van den Bussche, and Floris Geerts (Eds.). ACM, 37--52.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Jeremias Berg, Neha Lodha, Matti J"arvisalo, and Stefan Szeider. 2017. MaxSAT Benchmarks based on Determining Generalized Hypertree-width. MaxSAT Evaluation 2017: Solver and Benchmark Descriptions, Vol. B-2017--2 (2017), 22.Google ScholarGoogle Scholar
  12. Angela Bonifati, Wim Martens, and Thomas Timm. 2017. An Analytical Study of Large SPARQL Query Logs. PVLDB, Vol. 11, 2 (2017), 149--161. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Nofar Carmeli, Batya Kenig, and Benny Kimelfeld. 2017. Efficiently Enumerating Minimal Triangulations. In Proceedings of the 36th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, PODS 2017, Chicago, IL, USA, May 14--19, 2017, Emanuel Sallinger, Jan Van den Bussche, and Floris Geerts (Eds.). ACM, 273--287.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Ashok K. Chandra and Philip M. Merlin. 1977. Optimal Implementation of Conjunctive Queries in Relational Data Bases. In Proceedings of the 9th Annual ACM Symposium on Theory of Computing, May 4--6, 1977, Boulder, Colorado, USA, John E. Hopcroft, Emily P. Friedman, and Michael A. Harrison (Eds.). ACM, 77--90. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Rina Dechter. 2003. Constraint Processing .Elsevier. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Uriel Feige and Mohammad Mahdian. 2006. Finding small balanced separators. In Proceedings of the 38th Annual ACM Symposium on Theory of Computing, Seattle, WA, USA, May 21--23, 2006, Jon M. Kleinberg (Ed.). ACM, 375--384. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Johannes Klaus Fichte, Markus Hecher, Neha Lodha, and Stefan Szeider. 2018. An SMT Approach to Fractional Hypertree Width. In Principles and Practice of Constraint Programming - 24th International Conference, CP 2018, Lille, France, August 27--31, 2018, Proceedings (Lecture Notes in Computer Science), John N. Hooker (Ed.), Vol. 11008. Springer, 109--127.Google ScholarGoogle Scholar
  18. Wolfgang Fischl, Georg Gottlob, and Reinhard Pichler. 2017. Tractable Cases for Recognizing Low Fractional Hypertree Width. viXra.org e-prints, Vol. viXra:1708.0373 (2017). http://vixra.org/abs/1708.0373Google ScholarGoogle Scholar
  19. Wolfgang Fischl, Georg Gottlob, and Reinhard Pichler. 2018. General and Fractional Hypertree Decompositions: Hard and Easy Cases. In Proceedings of the 37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, Houston, TX, USA, June 10--15, 2018, Jan Van den Bussche and Marcelo Arenas (Eds.). ACM, 17--32.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Floris Geerts, Giansalvatore Mecca, Paolo Papotti, and Donatello Santoro. 2014. Mapping and cleaning. In IEEE 30th International Conference on Data Engineering, Chicago, ICDE 2014, IL, USA, March 31 - April 4, 2014, Isabel F. Cruz, Elena Ferrari, Yufei Tao, Elisa Bertino, and Goce Trajcevski (Eds.). IEEE Computer Society, 232--243.Google ScholarGoogle ScholarCross RefCross Ref
  21. Lucantonio Ghionna, Luigi Granata, Gianluigi Greco, and Francesco Scarcello. 2007. Hypertree Decompositions for Query Optimization. In Proceedings of the 23rd International Conference on Data Engineering, ICDE 2007, The Marmara Hotel, Istanbul, Turkey, April 15--20, 2007, Rada Chirkova, Asuman Dogac, M. Tamer Ö zsu, and Timos K. Sellis (Eds.). IEEE Computer Society, 36--45.Google ScholarGoogle ScholarCross RefCross Ref
  22. Lucantonio Ghionna, Gianluigi Greco, and Francesco Scarcello. 2011. H-DB: a hybrid quantitative-structural sql optimizer. In Proceedings of the 20th ACM Conference on Information and Knowledge Management, CIKM 2011, Glasgow, United Kingdom, October 24--28, 2011, Craig Macdonald, Iadh Ounis, and Ian Ruthven (Eds.). ACM, 2573--2576. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Georg Gottlob, Gianluigi Greco, Nicola Leone, and Francesco Scarcello. 2016. Hypertree Decompositions: Questions and Answers. In Proceedings of the 35th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, PODS 2016, San Francisco, CA, USA, June 26 - July 01, 2016, Tova Milo and Wang-Chiew Tan (Eds.). ACM, 57--74. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Georg Gottlob, Nicola Leone, and Francesco Scarcello. 2002. Hypertree Decompositions and Tractable Queries. J. Comput. Syst. Sci., Vol. 64, 3 (2002), 579--627. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Georg Gottlob, Zoltá n Mikló s, and Thomas Schwentick. 2009. Generalized hypertree decompositions: NP-hardness and tractable variants. J. ACM, Vol. 56, 6 (2009), 30:1--30:32. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Georg Gottlob and Marko Samer. 2008. A backtracking-based algorithm for hypertree decomposition. ACM Journal of Experimental Algorithmics, Vol. 13 (2008), 1:1.1--1:1.19. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Martin Grohe and Dá niel Marx. 2014. Constraint Solving via Fractional Edge Covers. ACM Trans. Algorithms, Vol. 11, 1 (2014), 4:1--4:20. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Yuanbo Guo, Zhengxiang Pan, and Jeff Heflin. 2005. LUBM: A benchmark for OWL knowledge base systems. J. Web Semant., Vol. 3, 2--3 (2005), 158--182. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Zineb Habbas, Kamal Amroun, and Daniel Singer. 2015. A Forward-Checking algorithm based on a Generalised Hypertree Decomposition for solving non-binary constraint satisfaction problems. J. Exp. Theor. Artif. Intell., Vol. 27, 5 (2015), 649--671.Google ScholarGoogle ScholarCross RefCross Ref
  30. Shrainik Jain, Dominik Moritz, Daniel Halperin, Bill Howe, and Ed Lazowska. 2016. SQLShare: Results from a Multi-Year SQL-as-a-Service Experiment. In Proceedings of the 2016 International Conference on Management of Data, SIGMOD Conference 2016, San Francisco, CA, USA, June 26 - July 01, 2016, Fatma Ö zcan, Georgia Koutrika, and Sam Madden (Eds.). ACM, 281--293. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Shant Karakashian, Robert J. Woodward, and Berthe Y. Choueiry. 2011. Reformulating R(*, m)C with Tree Decomposition. In Proceedings of the Ninth Symposium on Abstraction, Reformulation, and Approximation, SARA 2011, Parador de Cardona, Cardona, Catalonia, Spain, July 17--18, 2011., Michael R. Genesereth and Peter Z. Revesz (Eds.). AAAI, 62--69. http://www.aaai.org/ocs/index.php/SARA/SARA11/paper/view/4234Google ScholarGoogle Scholar
  32. Mahmoud Abo Khamis, Hung Q. Ngo, Christopher Ré, and Atri Rudra. 2016b. Joins via Geometric Resolutions: Worst Case and Beyond. ACM Trans. Database Syst., Vol. 41, 4 (2016), 22:1--22:45. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Mahmoud Abo Khamis, Hung Q. Ngo, and Atri Rudra. 2016a. FAQ: Questions Asked Frequently. In Proceedings of the 35th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, PODS 2016, San Francisco, CA, USA, June 26 - July 01, 2016, Tova Milo and Wang-Chiew Tan (Eds.). ACM, 13--28. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Mohammed Lalou, Zineb Habbas, and Kamal Amroun. 2009. Solving Hypertree Structured CSP: Sequential and Parallel Approaches. In Proceedings of the 16th RCRA workshop on Experimental Evaluation of Algorithms for Solving Problems with Combinatorial Explosion, RCRA@AI*IA 2009, Reggio Emilia, Italy, December 11--12, 2009 (CEUR Workshop Proceedings), Marco Gavanelli and Toni Mancini (Eds.), Vol. 589. CEUR-WS.org. http://ceur-ws.org/Vol-589/paper11.pdfGoogle ScholarGoogle Scholar
  35. Viktor Leis, Andrey Gubichev, Atanas Mirchev, Peter A. Boncz, Alfons Kemper, and Thomas Neumann. 2015. How Good Are Query Optimizers, Really? PVLDB, Vol. 9, 3 (2015), 204--215. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Viktor Leis, Bernhard Radke, Andrey Gubichev, Atanas Mirchev, Peter A. Boncz, Alfons Kemper, and Thomas Neumann. 2018. Query optimization through the looking glass, and what we found running the Join Order Benchmark. VLDB J., Vol. 27, 5 (2018), 643--668. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Stanislav Malyshev, Markus Krö tzsch, Larry Gonzá lez, Julius Gonsior, and Adrian Bielefeldt. 2018. Getting the Most Out of Wikidata: Semantic Technology Usage in Wikipedia's Knowledge Graph. In The Semantic Web - ISWC 2018 - 17th International Semantic Web Conference, Monterey, CA, USA, October 8--12, 2018, Proceedings, Part II (Lecture Notes in Computer Science), Denny Vrandecic, Kalina Bontcheva, Mari Carmen Suá rez-Figueroa, Valentina Presutti, Irene Celino, Marta Sabou, Lucie-Aimé e Kaffee, and Elena Simperl (Eds.), Vol. 11137. Springer, 376--394.Google ScholarGoogle Scholar
  38. Dá niel Marx. 2010. Approximating fractional hypertree width. ACM Trans. Algorithms, Vol. 6, 2 (2010), 29:1--29:17. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Lukas Moll, Siamak Tazari, and Marc Thurley. 2012. Computing hypergraph width measures exactly. Inf. Process. Lett., Vol. 112, 6 (2012), 238--242. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Dan Olteanu and Jakub Zá vodný. 2015. Size Bounds for Factorised Representations of Query Results. ACM Trans. Database Syst., Vol. 40, 1 (2015), 2:1--2:44.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Adam Perelman and Christopher Ré. 2015. DunceCap: Compiling Worst-Case Optimal Query Plans. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31 - June 4, 2015, Timos K. Sellis, Susan B. Davidson, and Zachary G. Ives (Eds.). ACM, 2075--2076.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Francc ois Picalausa and Stijn Vansummeren. 2011. What are real SPARQL queries like?. In Proceedings of the International Workshop on Semantic Web Information Management, SWIM 2011, Athens, Greece, June 12, 2011, Roberto De Virgilio, Fausto Giunchiglia, and Letizia Tanca (Eds.). ACM, 7. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Rachel Pottinger and Alon Y. Halevy. 2001. MiniCon: A scalable algorithm for answering queries using views. VLDB J., Vol. 10, 2--3 (2001), 182--198. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Francesco Scarcello, Gianluigi Greco, and Nicola Leone. 2007. Weighted hypertree decompositions and optimal query plans. J. Comput. Syst. Sci., Vol. 73, 3 (2007), 475--506. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Werner Schafhauser. 2006. New heuristic methods for tree decompositions and generalized hypertree decompositions . Master's thesis. Technische Universit"at Wien.Google ScholarGoogle Scholar
  46. Aaron Schild and Christian Sommer. 2015. On Balanced Separators in Road Networks. In Experimental Algorithms - 14th International Symposium, SEA 2015, Paris, France, June 29 - July 1, 2015, Proceedings (Lecture Notes in Computer Science), Evripidis Bampis (Ed.), Vol. 9125. Springer, 286--297. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Transaction Processing Performance Council (TPC). 2014. TPC-H decision support benchmark. http://www.tpc.org/tpch/default.aspGoogle ScholarGoogle Scholar
  48. Susan Tu and Christopher Ré. 2015. DunceCap: Query Plans Using Generalized Hypertree Decompositions. In Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31 - June 4, 2015, Timos K. Sellis, Susan B. Davidson, and Zachary G. Ives (Eds.). ACM, 2077--2078. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. V. N. Vapnik and A. Ya. Chervonenkis. 1971. On the Uniform Convergence of Relative Frequencies of Events to Their Probabilities. Theory of Probability & Its Applications, Vol. 16, 2 (jan 1971), 264--280.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. HyperBench: A Benchmark and Tool for Hypergraphs and Empirical Findings

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          PODS '19: Proceedings of the 38th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems
          June 2019
          494 pages
          ISBN:9781450362276
          DOI:10.1145/3294052

          Copyright © 2019 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 25 June 2019

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          PODS '19 Paper Acceptance Rate29of87submissions,33%Overall Acceptance Rate642of2,707submissions,24%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader