skip to main content
10.1145/3035918.3064017acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections

QIRANA: A Framework for Scalable Query Pricing

Published:09 May 2017Publication History

ABSTRACT

Users are increasingly engaging in buying and selling data over the web. Facilitated by the proliferation of online marketplaces that bring such users together, data brokers need to serve requests where they provide results for user queries over the underlying datasets, and price them fairly according to the information disclosed by the query. In this work, we present a novel pricing system, called QIRANA, that performs query-based data pricing for a large class of SQL queries (including aggregation) in real time. QIRANA provides prices with formal guarantees: for example, it avoids prices that create arbitrage opportunities. Our framework also allows flexible pricing, by allowing the data seller to choose from a variety of pricing functions, as well as specify relation and attribute-level parameters that control the price of queries and assign different value to different portions of the data. We test QIRANA on a variety of real-world datasets and query workloads, and we show that it can efficiently compute the prices for queries over large-scale data.

References

  1. S. Abiteboul and O. M. Duschka. Complexity of answering queries using materialized views. In PODS, pages 254--263. ACM Press, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Y. Ahmad, O. Kennedy, C. Koch, and M. Nikolic. Dbtoaster: Higher-order delta processing for dynamic, frequently fresh views. Proceedings of the VLDB Endowment, 5(10):968--979, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Balazinska, B. Howe, and D. Suciu. Data markets in the cloud: An opportunity for the database community. PVLDB, 4(12), 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Banjo. ban.jo.Google ScholarGoogle Scholar
  5. Big Data Exchange. www.bigdataexchange.com.Google ScholarGoogle Scholar
  6. J. A. Blakeley, N. Coburn, P. Larson, et al. Updating derived relations: Detecting irrelevant and autonomously computable updates. ACM Transactions on Database Systems (TODS), 14(3):369--400, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Bloomberg Market Data. www.bloomberg.com/enterprise/content-data/market-data.Google ScholarGoogle Scholar
  8. O. P. Buneman and E. K. Clemons. Efficiently monitoring relational databases. ACM Transactions on Database Systems (TODS), 4(3):368--382, 1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Chaudhuri, R. Krishnamurthy, S. Potamianos, and K. Shim. Optimizing queries with materialized views. In Data Engineering, 1995. Proceedings of the Eleventh International Conference on, pages 190--200. IEEE, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. N. N. Dalvi, C. Ré, and D. Suciu. Probabilistic databases: diamonds in the dirt. Commun. ACM, 52(7):86--94, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. DataFinder. datafinder.com.Google ScholarGoogle Scholar
  12. DBLP dataset. https://snap.stanford.edu/data/com-DBLP.html.Google ScholarGoogle Scholar
  13. S. Deep and P. Koutris. The design of arbitrage-free data pricing schemes. arXiv preprint arXiv:1606.09376, 2016.Google ScholarGoogle Scholar
  14. S. Diamond and S. Boyd. CVXPY: A Python-embedded modeling language for convex optimization. Journal of Machine Learning Research, 17(83):1--5, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Gupta, I. S. Mumick, and V. S. Subrahmanian. Maintaining views incrementally. ACM SIGMOD Record, 22(2):157--166, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. P. Koutris, P. Upadhyaya, M. Balazinska, B. Howe, and D. Suciu. Query-based data pricing. In M. Benedikt, M. Krötzsch, and M. Lenzerini, editors, PODS, pages 167--178. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. P. Koutris, P. Upadhyaya, M. Balazinska, B. Howe, and D. Suciu. Querymarket demonstration: Pricing for online data markets. PVLDB, 5(12):1962--1965, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. P. Koutris, P. Upadhyaya, M. Balazinska, B. Howe, and D. Suciu. Toward practical query pricing with querymarket. In K. A. Ross, D. Srivastava, and D. Papadias, editors, ACMSIGMOD 2013, pages 613--624. ACM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Lattice Data Inc. lattice.io.Google ScholarGoogle Scholar
  20. C. Li and G. Miklau. Pricing aggregate queries in a data marketplace. In WebDB, 2012.Google ScholarGoogle Scholar
  21. B. Lin and D. Kifer. On arbitrage-free pricing for general data queries. PVLDB, 7(9):757--768, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. A. Muschalle, F. Stahl, A. Löser, and G. Vossen. Pricing approaches for data markets. In International Workshop on Business Intelligence for the Real-Time Enterprise, pages 129--144. Springer, 2012.Google ScholarGoogle Scholar
  23. B. O'Donoghue, E. Chu, N. Parikh, and S. Boyd. SCS: Splitting conic solver, version 1.2.6. https://github.com/cvxgrp/scs, Apr. 2016.Google ScholarGoogle Scholar
  24. QLik Data Market. www.qlik.com/us/products/qlik-data-market.Google ScholarGoogle Scholar
  25. K. A. Ross, D. Srivastava, and S. Sudarshan. Materialized view maintenance and integrity constraint checking: Trading space for time. In ACM SIGMOD Record, volume 25, pages 447--458. ACM, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. SSB Benchmark. http://www.cs.umb.edu/ poneil/StarSchemaB.PDF.Google ScholarGoogle Scholar
  27. M. Staudt and M. Jarke. Incremental maintenance of externally materialized views. In VLDB, volume 96, pages 3--6, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. TPC-H Benchmark. http://www.tpc.org/tpch.Google ScholarGoogle Scholar
  29. Twitter GNIP Audience API. gnip.com/insights/audience.Google ScholarGoogle Scholar
  30. P. Upadhyaya, M. Balazinska, and D. Suciu. Price-optimal querying with data apis. In PVLDB, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. USA Car crash 2011 dataset. https://datamarket.azure.com/dataset/bigml/carcrashusa2011.Google ScholarGoogle Scholar
  32. Windows Azure Marketplace. www.datamarket.azure.com.Google ScholarGoogle Scholar
  33. world dataset. https://dev.mysql.com/doc/world-setup/en/.Google ScholarGoogle Scholar
  34. Y. Zhuge, H. Garcia-Molina, J. Hammer, and J. Widom. View maintenance in a warehousing environment. ACM SIGMOD Record, 24(2):316--327, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. QIRANA: A Framework for Scalable Query Pricing

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGMOD '17: Proceedings of the 2017 ACM International Conference on Management of Data
      May 2017
      1810 pages
      ISBN:9781450341974
      DOI:10.1145/3035918

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 9 May 2017

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate785of4,003submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader