skip to main content
10.1145/1807085.1807114acmconferencesArticle/Chapter ViewAbstractPublication PagespodsConference Proceedingsconference-collections
research-article

On probabilistic fixpoint and Markov chain query languages

Published:06 June 2010Publication History

ABSTRACT

We study highly expressive query languages such as datalog, fixpoint, and while-languages on probabilistic databases. We generalize these languages such that computation steps (e.g. datalog rules) can fire probabilistically. We define two possible semantics for such query languages, namely inflationary semantics where the results of each computation step are added to the current database and noninflationary queries that induce a random walk in-between database instances. We then study the complexity of exact and approximate query evaluation under these semantics.

References

  1. S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. Addison-Wesley, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. P. Agrawal, O. Benjelloun, A. Das Sarma, C. Hayworth, S. U. Nabar, T. Sugihara, and J. Widom. "Trio: A System for Data, Uncertainty, and Lineage". In VLDB, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. L. Antova, C. Koch, and D. Olteanu. "From Complete to Incomplete Information and Back". In Proc. SIGMOD, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. L. Antova, C. Koch, and D. Olteanu. "Query Language Support for Incomplete Information in the MayBMS System". In Proc. VLDB, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. V. Bansal. "Computational Methods for Analyzing Human Genetic Variations". PhD thesis, University of California, San Diego, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. O. Benjelloun, A. D. Sarma, C. Hayworth, and J. Widom. "An Introduction to ULDBs and the Trio System". IEEE Data Engineering Bulletin, 2006.Google ScholarGoogle Scholar
  7. D. P. Bertsekas and J. N. Tsitsiklis. Introduction to Probability. MIT Press, 2008.Google ScholarGoogle Scholar
  8. N. Dalvi and D. Suciu. "Efficient query evaluation on probabilistic databases". VLDB Journal, 16(4):523--544, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. L. De Raedt, A. Kimmig, and H. Toivonen. "ProbLog: A Probabilistic Prolog and Its Application in Link Discovery". In IJCAI, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. Freedman. Markov Chains. Springer-Verlag, 1983.Google ScholarGoogle Scholar
  11. N. Fuhr. "Probabilistic Datalog - A Logic For Powerful Retrieval Methods". In Proc. SIGIR, pages 282--290, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Goetz and C. Koch. "A Compositional Framework for Complex Queries over Uncertain Data". In Proc. ICDT, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. T. J. Green and V. Tannen. "Models for Incomplete and Probabilistic Information". IEEE Data Eng. Bull., 29(1):17--24, 2006.Google ScholarGoogle Scholar
  14. M. Jerrum and A. Sinclair. "The Markov chain Monte Carlo method: an approach to approximate counting and integration". Approximation algorithms for NP-hard problems, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C. Koch. "Approximating Predicates and Expressive Queries on Probabilistic Databases". In Proc. PODS, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. C. Koch. "On Query Algebras for Probabilistic Databases". SIGMOD Record, 37(4):78--85, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. C. Koch. "A Compositional Query Algebra for Second-Order Logic and Uncertain Databases". In Proc. ICDT, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. C. H. Papadimitriou. Computational complexity. Addison-Wesley, 1994.Google ScholarGoogle Scholar
  19. D. Randall. "Mixing (a tutorial on Markov Chains)". In FOCS, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. C. Re, N. Dalvi, and D. Suciu. "Efficient Top-k Query Evaluation on Probabilistic Data". In ICDE, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  21. P. Sen and A. Deshpande. "Representing and Querying Correlated Tuples in Probabilistic Databases". In ICDE, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  22. D. Sorensen and D. Gianola. "Likelihood, Bayesian, and MCMC Methods in Quantitative Genetics". Springer-Verlag, New York, July 2002.Google ScholarGoogle ScholarCross RefCross Ref
  23. Stanford Trio Project. "TriQL - The Trio Query Language", 2006.Google ScholarGoogle Scholar
  24. L. Valiant. "The complexity of computing the permanent". Theoretical Computer Science, 8(2):189--201, 1979.Google ScholarGoogle ScholarCross RefCross Ref
  25. M. Y. Vardi. "The Complexity of Relational Query Languages". In Proc. STOC, pages 137--146, 1982. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. V. V. Vazirani. Approximation Algorithms. Springer, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. On probabilistic fixpoint and Markov chain query languages

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              PODS '10: Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
              June 2010
              350 pages
              ISBN:9781450300339
              DOI:10.1145/1807085

              Copyright © 2010 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 6 June 2010

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article

              Acceptance Rates

              PODS '10 Paper Acceptance Rate27of113submissions,24%Overall Acceptance Rate642of2,707submissions,24%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader