skip to main content
research-article

A Discourse-Based Approach for Arabic Question Answering

Published:04 November 2016Publication History
Skip Abstract Section

Abstract

The treatment of complex questions with explanatory answers involves searching for arguments in texts. Because of the prominent role that discourse relations play in reflecting text producers’ intentions, capturing the underlying structure of text constitutes a good instructor in this issue. From our extensive review, a system for automatic discourse analysis that creates full rhetorical structures in large-scale Arabic texts is currently unavailable. This is due to the high computational complexity involved in processing a large number of hypothesized relations associated with large texts. Therefore, more practical approaches should be investigated. This article presents a new Arabic Text Parser oriented for question-answering systems dealing with لماذا “why” and كيف “how to” questions. The Text Parser presented here considers the sentence as the basic unit of text and incorporates a set of heuristics to avoid computational explosion. With this approach, the developed question-answering system reached a significant improvement over the baseline with a Recall of 68% and MRR of 0.62.

References

  1. Mohammed Akour, Sameer Abufardeh, Kenneth Magel, and Qasemm Al-Radaideh. 2011. QArabPro: A rule based question answering system for reading comprehension tests in Arabic. American Journal of Applied Science 8, 6, 652--661.Google ScholarGoogle ScholarCross RefCross Ref
  2. Fatima Al Kohlani. 2010. The Function of Discourse Markers in Arabic Newspaper Opinion Articles. PhD thesis, Georgetown University, Washington.Google ScholarGoogle Scholar
  3. Raffaella Bernardi, Jijkoun Valentin, Mishne Gilad, and De Rijke Maarten. 2003. Selectively using linguistic resources throughout the question answering pipeline. In Proceedings of the 2nd CoLogNET. 50--60.Google ScholarGoogle Scholar
  4. Diane Blakemore. 2003. Discourse and relevance theory. In The Handbook of Discourse Analysis. Oxford: Blackwell, 100--115.Google ScholarGoogle Scholar
  5. Eric Breck, John Burger, Lisa Ferro, Warren Greiff, Mani Light, and Jason Rennie. 2000. Another sys called Qanda. In Proceedings of the 9th Text REtrieval Conference, NIST Special Publication 500-246. Maryland, 369--379.Google ScholarGoogle Scholar
  6. Simon Corston-Oliver. 1998. Computing Representations of the Structure of Written Discourse. PhD thesis, University of California, Santa Barbara. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Shehdeh Fareh and Jihad Hamdan. 1999. The translation of arabic ‘wa’ into English: Some problems and implications. In Dirasat: Human and Social Science, 26, 590--603.Google ScholarGoogle Scholar
  8. Vanessa Feng and Graeme Hirst. 2012. Text-level discourse parsing with rich linguistic features. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL’12). Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Bruce Fraser. 1996. Pragmatic markers. Pragmatics 6, 2, 167--190.Google ScholarGoogle ScholarCross RefCross Ref
  10. Ryuichiro Higashinaka and Hideki Isozaki. 2008. Corpus-based question answering for why-question. In Proceedings of the 3rd International Joint Conference on Natural Language Processing. 1, 418--425.Google ScholarGoogle Scholar
  11. Ahmed Ibrahim and Tarek Elghazaly. 2012. Arabic text summarization using rhetorical structure theory. In 8th International Conference on Informatics and Systems (INFOS’12). IEEE, 34--38.Google ScholarGoogle Scholar
  12. Julian Kupice. 1999. MURAX: Finding and organizing answers form text search. Natural Language Information Retrieval. Netherlands, 311--332.Google ScholarGoogle Scholar
  13. William Mann and Sandra Thompson. 1988. A rhetorical structure theory: Toward a functional theory of text organization. Text-Interdisciplinary Journal for the study of Discourse 8, 3, 243--281.Google ScholarGoogle Scholar
  14. Daniel Marcu. 2000a. The rhetorical parsing of unrestricted texts: A surface-based approach. Computational Linguistics 26, 3, 395--448. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Daniel Marcu. 2000b. The Theory and Practice of Discourse Parsing and Summarization. MIT Press, London. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Hassan Mathkour, Ameur Touir, and Waleed Al-Sanea. 2008. Parsing arabic texts using rhetorical structure theory. Journal of Computer Science 4, 9, 713--720.Google ScholarGoogle ScholarCross RefCross Ref
  17. Jawad Sadek and Farid Meziane. 2016. Extracting arabic causal relations using linguistic patterns. Journal ACM Translations on Asian and Low-Resource Language Information Processing 15, 3, Article 14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Jawad Sadek. 2013. Automatic detection of Arabic causal relations. In Proceedings of the 18th International Conference on Application of Natural Language to Information Systems (NLDB’13). UK, 400--403.Google ScholarGoogle ScholarCross RefCross Ref
  19. Jawad Sadek, Fairouz Chakkour, and Farid Meziane. 2012. Arabic rhetorical relations extraction for answering “why” and “how to” questions. In Proceedings of the 17th International Conference on Application of Natural Language to Information Systems (NLDB’12). The Netherlands, 385--390. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Ted Sanders and Leo Noordman. 2000. The role of coherence relations and their linguistic markers in text processing. Discourse Processes 29, 1, 37--60.Google ScholarGoogle ScholarCross RefCross Ref
  21. Deborah Schiffrin, Deborah Tannen, and Heidi Hamilton. 2001. Discourse Markers: Language, Meaning, and Context. Basil Blackwell, Oxford.Google ScholarGoogle Scholar
  22. Bernard Schneuwly. 1997. Textual organizers and text types: Ontogenetic aspects in writing. Processing Interclausal Relationships: Studies in the Production and Comprehension of Text, 245--263.Google ScholarGoogle Scholar
  23. Erwin Segal, Judith Duchan, and Paula Scott. 1991. The role of interclausal connectives in narrative structuring: Evidence from adults’ interpretation of simple stories. Discourse Processes 14, 27--54.Google ScholarGoogle ScholarCross RefCross Ref
  24. Hideki Shima and Teruko Mitamura. 2007. JAVELIN III: Answering non-factoid question in Japanese. In Proceedings of NTCIR-6 Workshop Meeting. Tokyo, 464--468.Google ScholarGoogle Scholar
  25. Radu Soricut and Daniel Marcu. 2003. Sentence level discourse parsing using syntactic and lexical information. In Proceedings of the Human Language Technology and North American Association for Computational Linguistics. Canada, 149--156. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Mihai Surdeanu, Massimiliano Giaramita, and Hugo Zaragoza. 2008. Learning to rank answers on large online qa collections. In Proceedings of ACL’08. 719--727.Google ScholarGoogle Scholar
  27. Daphne Theijssen. 2007. Feature for Automatic Discourse Analysis of Paragraphs. Master's thesis. Radboud Universiteit Nijmegen, The Netherlands.Google ScholarGoogle Scholar
  28. Sander Timmerman. 2007. Automatic Recognition of Structural Relations in Dutch Text. Master's thesis. University of Twente, The Netherlands.Google ScholarGoogle Scholar
  29. Suzan Verberne. 2007. Paragraph retrieval for why-question answering. In Proceedings of the 30th Annual International ACMSIGR Conference on Research and Development in Information Retrieval. New York, 922--927. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Suzan Verberne, Lou Boves, and Nelleke Osstdijk. 2007. Discourse-based answering of why-questions. Traitement Automatique Des Langues, Special Issue on Computational Approaches to Discourse and Document Processing 47, 2, 21--41.Google ScholarGoogle Scholar
  31. William Wright and Paul Caspari. 1896. A Grammar of the Arabic Language. Cambridge University Press, UK.Google ScholarGoogle Scholar
  32. Mai Zaki 2011. The Semantics and Pragmatics of Demonstratives in English and Arabic. PhD thesis, University of Middlesex.Google ScholarGoogle Scholar

Index Terms

  1. A Discourse-Based Approach for Arabic Question Answering

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 16, Issue 2
      TALLIP Notes and Regular Papers
      June 2017
      136 pages
      ISSN:2375-4699
      EISSN:2375-4702
      DOI:10.1145/3008658
      Issue’s Table of Contents

      Copyright © 2016 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 4 November 2016
      • Accepted: 1 August 2016
      • Revised: 1 March 2016
      • Received: 1 October 2015
      Published in tallip Volume 16, Issue 2

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader