Skip to main content
Log in

Representation, Analysis, and Extraction of Knowledge from Unstructured Natural Language Texts

  • SOFTWARE–HARDWARE SYSTEMS
  • Published:
Cybernetics and Systems Analysis Aims and scope

Abstract

This article overviews means of description logics for representing knowledge contained in natural language texts and a classification of description logics by constructors of concepts and roles. It also considers basic conceptions of temporal description logics. An approach to the construction of systems for analyzing natural language texts based on problems of parts-of-speech tagging, dependency parsing, and coreference resolution is described. Examples of using natural language knowledge bases to solve applied problems are given, in particular, those of checking the integrity of a text and revealing contradictions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. F. Baader, D. Calvanese, D. L. McGuinness, D. Nardi, and P. F. Patel-Schneider (eds.), The Description Logic Handbook: Theory, Implementation, and Applications, Cambridge University Press (2007).

    Google Scholar 

  2. S. L. Kryvyi and H. I. Hoherchak, “Logic in mathematics and informatics,” in: Proc. of the First Ukrainian Conf. “Logic and Its Applications” (UCLA’2019) (Kyiv, Nov 26–28, 2019), AVANPOST-PRYM, Kyiv (2019), pp. 47–55.

  3. C. Lutz, F. Wolter, and M. Zakharyaschev, “Temporal description logics: A survey,” in: Proc. of the 15th Intern. Symp. on Temporal Representation and Reasoning (Montreal, Canada, June 16–18, 2008), IEEE Computer Society (2008), pp. 3–14. https://doi.org/10.1109/TIME.2008.14.

  4. C. Lutz, H. Sturm, F. Wolter, and M. Zakharyaschev, “Tableaux for temporal description logic with constant domains,” in: R. Goré, A. Leitsch, and T. Nipkow (eds.), Automated Reasoning, IJCAR 2001; Lecture Notes in Computer Science, Vol. 2083, Springer, Berlin–Heidelberg, 121–136 (2001). https://doi.org/10.1007/3-540-45744-5_10.

  5. S. Lai, K. S. Leung, and Y. Leung, “SUNNYNLP at SemEval-2018 Task 10: A Support-Vector-Machine-based method for detecting semantic difference using taxonomy and word embedding features,” in: Proc. of the 12th Intern. Workshop on Semantic Evaluation (SemEval-2018) (New Orleans, USA, June 5–6, 2018), Association for Computational Linguistics (2018), pp. 741–746. https://doi.org/10.18653/v1/S18-1118.

  6. J. Zhan and H. Zhao, “Span model for open information extraction on accurate corpus,” in: Proc. of the AAAI Conf. on Artificial Intelligence, Vol. 34, No. 05 (2020), pp. 9523–9530. https://doi.org/10.1609/aaai.v34i05.6497.

  7. A. Gangemi, V. Presutti, D. Reforgiato Recupero, A. G. Nuzzolese, F. Draicchio, and M. Mongiovi, “Semantic Web machine reading with FRED,” Semantic Web, Vol. 8, Iss. 6, 873–893 (2017). https://doi.org/10.3233/SW-160240.

  8. D. Reforgiato Recupero, A. G. Nuzzolese, S. Consoli, V. Presutti, M. Mongiovi, and S. Peroni, “Extracting knowledge from text using SHELDON, a Semantic Holistic framEwork for LinkeD ONtology data,” in: WWW’15 Companion: Proc. of the 24th Intern. Conf. on World Wide Web (Florence, Italy, May 2015), Association for Computing Machinery, New York (2015), pp. 235–238. https://doi.org/10.1145/2740908.2742842.

  9. H. I. Hoherchak, “Knowledge bases and description logics applications to natural language texts analysis,” Problems in Programming, No. 2–3, 259–269 (2020). https://doi.org/10.15407/pp2020.02-03.259.

  10. S. L. Kryvyi, N. P. Darchuk, and O. I. Provotar, “Ontological similar systems for analysis of texts of natural language,” Problems in Programming, No. 2–3, 132–139 (2018).

  11. A. V. Palagin, S. L. Kryvyi, and N. G. Petrenko, ”Knowledge-oriented information systems with the processing of natural language objects: the basis of ethodology, architectural and structural organization,“ Upr. Sis. Mash., No. 3, 42–55 (2009).

  12. A. V. Palagin, S. L. Kryvyi, and N. G. Petrenko, “On the automation of the process of extracting knowledge from natural language texts,” Natural and Artificial Intelligence Intern. Book Series, Inteligent Processing, No. 9, 44–52, ITHEA, Sofia (2012).

  13. A. V. Palagin, S. L. Kryvyi, D. S. Bibikov, “Natural language sentence processing using dictionaries and words frequency,” Natural and Artificial Intelligence Intern. Book Series, Inteligent Processing, No. 9, 44–52, ITHEA, Sofia (2010).

  14. R. McDonald, J. Nivre, Y. Quirmbach-Brundage, Y. Goldberg, D. Das, K. Ganchev, K. Hall, S. Petrov, H. Zhang, O. Tackstrom, C. Bedini, N. B. Castelló, and J. Lee, “Universal dependency annotation for multilingual parsing,” in: Proc. of the 51st Annual Meeting of the Association for Computational Linguistics (Sofia, Bulgaria, August 4–9, 2013), Vol. 2: Short Papers, Association for Computational Linguistics (2013), pp. 92–97.

  15. K. Mrini, F. Dernoncourt, T. Bui, W. Chang, N. Nakashole, “Rethinking self-attention: An interpretable self-attentive encoder-decoder parser,” in: T. Cohn, Y. He, and Y. Liu (eds.), Findings of the Association for Computational Linguistics: EMNLP 2020, Association for Computational Linguistics (2020), pp. 731–742. https://doi.org/10.18653/v1/2020.findings-emnlp.65.

  16. W. Che, Y. Lui, Y. Wang, B. Zheng, and T. Liu, “Towards better UD parsing: Deep contextualized word embeddings, ensemble, and treebank concatenation,” in: Proc. of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies (Brussels, Belgium, October 31 – November 1, 2018), Association for Computational Linguistics (2018), pp. 55–64. https://doi.org/10.18653/v1/K18-2005.

  17. N. Darchuk, “Automatic syntactic analysis of texts from Ukrainian language corpus,” Ukrainian Linguistics, Issue 43, 11–19 (2013).

  18. M. Vilain, J. Burger, J. Aberdeen, D. Connolly, and L. Hirschman, “A model-theoretic coreference scoring scheme,” in: MUC6’95: Proc. of the 6th Message Understanding Conf. (Maryland, USA, November 6–8, 1995), Association for Computational Linguistics (1995), pp. 45–52. https://doi.org/10.3115/1072399.1072405.

  19. V. Stoyanov, N. Gilbert, C. Cardie, and E. Riloff, “Conundrums in noun phrase coreference resolution: Making sense of the state-of-the-art,” in: Proc. of the Joint Conf. of the 47th Annual Meeting of the ACL and the 4th Intern. Joint Conf. on Natural Language Processing of the AFNLP (Singapore, August 2–7, 2009), Association for Computational Linguistics (2009), pp. 656–664. https://doi.org/10.3115/1690219.1690238.

  20. X. Luo, “On coreference resolution performance metrics,” in: HLT’05: Proc. of the Conf. on Human Language Technology and Empirical Methods in Natural Language Processing (Vancouver, Canada, October, 2005), Association for Computational Linguistics (2005), pp. 25–32. https://doi.org/10.3115/1220575.1220579.

  21. J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of deep bidirectional transformers for language understanding,” in: Proc. of the 2019 Conf. of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Minneapolis, USA, June 2–7, 2019), Vol. 1 (Long and Short Papers), Association for Computational Linguistics (2019), pp. 4171–4186. https://doi.org/10.18653/v1/N19-1423.

  22. L. Xu and J. D. Choi, “Revealing the myth of higher-order inference in coreference resolution,” in: Proc. of the 2020 Conf. on Empirical Methods in Natural Language Processing (EMNLP) (online, November 16–20, 2020), Association for Computational Linguistics (2020), pp. 8527–8533. https://doi.org/10.18653/v1/2020.emnlp-main.686.

  23. N. V. Lukashevich, Thesauri in Information Retrieval Problems [in Russian], Izd-vo Mosk. Un-ta, Moscow (2011).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to H. Hoherchak.

Additional information

Translated from Kibernetyka ta Systemnyi Analiz, No. 3, May–June, 2021, pp. 164–183.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hoherchak, H., Darchuk, N. & Kryvyi, S. Representation, Analysis, and Extraction of Knowledge from Unstructured Natural Language Texts. Cybern Syst Anal 57, 481–500 (2021). https://doi.org/10.1007/s10559-021-00373-7

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10559-021-00373-7

Keywords

Navigation