skip to main content
10.1145/3430984.3431011acmotherconferencesArticle/Chapter ViewAbstractPublication PagescodsConference Proceedingsconference-collections
research-article

Bootstrapping Chatbot Interfaces to Databases

Published:02 January 2021Publication History

ABSTRACT

In this paper, we present an automated technique for creating a chatbot interface to databases. To the best of our knowledge, our technique is the first one for bootstrapping chatbots for question answering on relational databases. Our method leverages the natural language classifiers present in industrial chatbot platforms for natural language to structure query translation. We use our framework to create usable chatbots starting from the databases in a short time. We instantiated several chatbots in different domains and demonstrate the usefulness of it.

References

  1. [n.d.]. 5 Metrics every chatbot should track. https://www.topbots.com/5-bot-metrics-every-chatbot-should-track/. ([n. d.]).Google ScholarGoogle Scholar
  2. [n.d.]. Ontology. http://tomgruber.org/writing/ontology-definition-2007.htm.Google ScholarGoogle Scholar
  3. [n.d.]. W3C. http://www.w3.org/TR/owl-guide/.Google ScholarGoogle Scholar
  4. Ricardo Baeza-Yates, Carlos Hurtado, and Marcelo Mendoza. 2004. Query Recommendation Using Query Logs in Search Engines(EDBT’04). Springer-Verlag, Berlin, Heidelberg, 588–596. https://doi.org/10.1007/978-3-540-30192-9_58Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Huanhuan Cao, Daxin Jiang, Jian Pei, Qi He, Zhen Liao, Enhong Chen, and Hang Li. 2008. Context-aware Query Suggestion by Mining Click-through and Session Data. In KDD (Las Vegas, Nevada, USA) (KDD ’08). ACM, New York, NY, USA, 875–883. https://doi.org/10.1145/1401890.1401995Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Basil Ell, Denny Vrandečić, and Elena Simperl. 2015. SPARTIQULATION: Verbalizing SPARQL Queries. In The Semantic Web: ESWC 2012 Satellite Events, Elena Simperl, Barry Norton, Dunja Mladenic, Emanuele Della Valle, Irini Fundulaki, Alexandre Passant, and Raphaël Troncy (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 117–131.Google ScholarGoogle Scholar
  7. Raquel Fernández and Jonathan Ginzburg. 2002. Non-sentential utterances: Grammar and dialogue dynamics in corpus annotation. In Proceedings of the 19th international conference on Computational linguistics-Volume 1. Association for Computational Linguistics, 1–7.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Raquel Fernández, Jonathan Ginzburg, and Shalom Lappin. 2007. Classifying non-sentential utterances in dialogue: A machine learning approach. Computational Linguistics 33, 3 (2007), 397–427.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Matthew Henderson, Blaise Thomson, and Jason D Williams. 2014. The third dialog state tracking challenge. In Spoken Language Technology Workshop (SLT), 2014 IEEE. IEEE, 324–329.Google ScholarGoogle ScholarCross RefCross Ref
  10. William J. Holton. 2015. Translating Database Queries to English for Enhancing Database Education. (02 2015).Google ScholarGoogle Scholar
  11. Manasa Jammi, Jaydeep Sen, Ashish Mittal, Sagar Verma, Vardaan Pahuja, Rema Ananthanarayanan, Pranay Lohia, Hima Karanam, Diptikalyan Saha, and Karthik Sankaranarayanan. 2018. Tooling Framework for Instantiating Natural Language Querying System. Proceedings of the VLDB Endowment 11, 12 (2018).Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Heeyoung Lee, Yves Peirsman, Angel Chang, Nathanael Chambers, Mihai Surdeanu, and Dan Jurafsky. 2011. Stanford’s multi-pass sieve coreference resolution system at the CoNLL-2011 shared task. In Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task. Association for Computational Linguistics, 28–34.Google ScholarGoogle Scholar
  13. Chuan Lei, Fatma Ozcan, Abdul Quamar, Ashish Mittal, Jaydeep Sen, Diptikalyan Saha, and Karthik Sankaranarayanan. 2018. Ontology-Based Natural Language Query Interfaces for Data Exploration. Data Engineering (2018), 52.Google ScholarGoogle Scholar
  14. Fei Li and H. V. Jagadish. 2014. Constructing an Interactive Natural Language Interface for Relational Databases. Proc. VLDB Endow. 8, 1 (2014), 73–84.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan. 2016. A Persona-Based Neural Conversation Model. CoRR abs/1603.06155(2016). arxiv:1603.06155http://arxiv.org/abs/1603.06155Google ScholarGoogle Scholar
  16. Chia-Wei Liu, Ryan Lowe, Iulian V. Serban, Michael Noseworthy, Laurent Charlin, and Joelle Pineau. 2016. How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation. arXiv:arXiv:1603.08023Google ScholarGoogle Scholar
  17. W. S. Luk and Steve Kloster. 1986. ELFS: English Language from SQL. ACM Trans. Database Syst. 11, 4 (Dec. 1986), 447–472. https://doi.org/10.1145/7239.384276Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Christopher D Manning, Mihai Surdeanu, John Bauer, Jenny Rose Finkel, Steven Bethard, and David McClosky. 2014. The stanford corenlp natural language processing toolkit.. In ACL (System Demonstrations). 55–60.Google ScholarGoogle Scholar
  19. Ashish Mittal, Jaydeep Sen, Diptikalyan Saha, and Karthik Sankaranarayanan. 2018. An Ontology based Dialog Interface to Database. In Proceedings of the 2018 International Conference on Management of Data. ACM, 1749–1752.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Vincent Ng. 2010. Supervised noun phrase coreference research: The first fifteen years. In Proceedings of the 48th annual meeting of the association for computational linguistics. Association for Computational Linguistics, 1396–1411.Google ScholarGoogle Scholar
  21. Axel-Cyrille Ngonga Ngomo, Lorenz Bühmann, Christina Unger, Jens Lehmann, and Daniel Gerber. 2013. Sorry, I Don’T Speak SPARQL: Translating SPARQL Queries into Natural Language. In Proceedings of the 22Nd International Conference on World Wide Web (Rio de Janeiro, Brazil) (WWW ’13). ACM, New York, NY, USA, 977–988. https://doi.org/10.1145/2488388.2488473Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Hoifung Poon and Pedro Domingos. 2008. Joint unsupervised coreference resolution with Markov logic. In Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, 650–659.Google ScholarGoogle ScholarCross RefCross Ref
  23. Ana-Maria Popescu, Oren Etzioni, and Henry Kautz. 2003. Towards a Theory of Natural Language Interfaces to Databases. In IUI.Google ScholarGoogle Scholar
  24. Dinesh Raghu, Sathish Indurthi, Jitendra Ajmera, and Sachindra Joshi. 2015. A Statistical Approach for Non-Sentential Utterance Resolution for Interactive QA System. In 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Vol. 335.Google ScholarGoogle ScholarCross RefCross Ref
  25. Diptikalyan Saha, Avrilia Floratou, Karthik Sankaranarayanan, Umar Farooq Minhas, Ashish R. Mittal, and Fatma Özcan. 2016. ATHENA: An Ontology-driven System for Natural Language Querying over Relational Data Stores. Proc. VLDB Endow. 9, 12 (Aug. 2016), 1209–1220. https://doi.org/10.14778/2994509.2994536Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Diptikalyan Saha, Neelamadhav Gantayat, Senthil Mani, and Barry Mitchell. 2017. Natural language querying in SAP-ERP platform. In Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering. ACM, 878–883.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Jaydeep Sen, Ashish R Mittal, Diptikalyan Saha, and Karthik Sankaranarayanan. 2018. Functional Partitioning of Ontologies for Natural Language Query Completion in Question Answering Systems.. In IJCAI. 4331–4337.Google ScholarGoogle Scholar
  28. Bayu Setiaji and Ferry Wahyu Wibowo. 2016. Chatbot Using A Knowledge in Database. In 2016 7th International Conference on Intelligent Systems, Modelling and Simulation. IEEE, 72–77.Google ScholarGoogle Scholar
  29. K. Shabaza, Jim D. O’Shea, Keeley A. Crockett, and A. Latham. 2015. Aneesah: A Conversational Natural Language Interface to Databases. In Proceedings of The World Congress on Engineering 2015. LNCS, 227–232.Google ScholarGoogle Scholar
  30. Alane Suhr, Srinivasan Iyer, and Yoav Artzi. 2018. Learning to Map Context-Dependent Sentences to Executable Formal Queries. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), Vol. 1. 2238–2249.Google ScholarGoogle ScholarCross RefCross Ref
  31. Sujatha R Upadhyaya and P Sreenivasa Kumar. 2005. ERONTO: a tool for extracting ontologies from extended E/R diagrams. In Proceedings of the 2005 ACM symposium on Applied computing. ACM, 666–670.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Oriol Vinyals and Quoc V. Le. 2015. A Neural Conversational Model. CoRR abs/1506.05869(2015). arxiv:1506.05869http://arxiv.org/abs/1506.05869Google ScholarGoogle Scholar
  33. Yushi Wang, Jonathan Berant, and Percy Liang. 2015. Building a Semantic Parser Overnight. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, ACL 2015, July 26-31, 2015, Beijing, China, Volume 1: Long Papers. 1332–1342. http://aclweb.org/anthology/P/P15/P15-1129.pdfGoogle ScholarGoogle Scholar
  34. Xiaojun Xu, Chang Liu, and Dawn Song. 2017. Sqlnet: Generating structured queries from natural language without reinforcement learning. arXiv preprint arXiv:1711.04436(2017).Google ScholarGoogle Scholar
  35. Victor Zhong, Caiming Xiong, and Richard Socher. 2017. Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning. CoRR abs/1709.00103(2017). arxiv:1709.00103http://arxiv.org/abs/1709.00103Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    CODS-COMAD '21: Proceedings of the 3rd ACM India Joint International Conference on Data Science & Management of Data (8th ACM IKDD CODS & 26th COMAD)
    January 2021
    453 pages

    Copyright © 2021 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 2 January 2021

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate197of680submissions,29%
  • Article Metrics

    • Downloads (Last 12 months)22
    • Downloads (Last 6 weeks)3

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format