Representation, Analysis, and Extraction of Knowledge from Unstructured Natural Language Texts

Hoherchak, H.; Darchuk, N.; Kryvyi, S.

doi:10.1007/s10559-021-00373-7

Representation, Analysis, and Extraction of Knowledge from Unstructured Natural Language Texts

SOFTWARE–HARDWARE SYSTEMS
Published: 27 May 2021

Volume 57, pages 481–500, (2021)
Cite this article

Cybernetics and Systems Analysis Aims and scope

H. Hoherchak¹,
N. Darchuk¹ &
S. Kryvyi¹

66 Accesses
7 Citations
Explore all metrics

Abstract

This article overviews means of description logics for representing knowledge contained in natural language texts and a classification of description logics by constructors of concepts and roles. It also considers basic conceptions of temporal description logics. An approach to the construction of systems for analyzing natural language texts based on problems of parts-of-speech tagging, dependency parsing, and coreference resolution is described. Examples of using natural language knowledge bases to solve applied problems are given, in particular, those of checking the integrity of a text and revealing contradictions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-level Modeling of Structural Elements of Natural Language Texts and Its Applications

Extracting and structuring information from the electronic medical text: state of the art and trendy directions

Article 01 August 2023

Mohamed Yassine Landolsi, Lobna Hlaoua & Lotfi Ben Romdhane

Extracting Schema Knowledge from Text Documents: Part I. Problem Formulation and Method Overview

Article 01 December 2021

M. I. Suvorova, M. V. Kobozeva, … S. Y. Toldova

References

F. Baader, D. Calvanese, D. L. McGuinness, D. Nardi, and P. F. Patel-Schneider (eds.), The Description Logic Handbook: Theory, Implementation, and Applications, Cambridge University Press (2007).
Google Scholar
S. L. Kryvyi and H. I. Hoherchak, “Logic in mathematics and informatics,” in: Proc. of the First Ukrainian Conf. “Logic and Its Applications” (UCLA’2019) (Kyiv, Nov 26–28, 2019), AVANPOST-PRYM, Kyiv (2019), pp. 47–55.
C. Lutz, F. Wolter, and M. Zakharyaschev, “Temporal description logics: A survey,” in: Proc. of the 15th Intern. Symp. on Temporal Representation and Reasoning (Montreal, Canada, June 16–18, 2008), IEEE Computer Society (2008), pp. 3–14. https://doi.org/10.1109/TIME.2008.14.
C. Lutz, H. Sturm, F. Wolter, and M. Zakharyaschev, “Tableaux for temporal description logic with constant domains,” in: R. Goré, A. Leitsch, and T. Nipkow (eds.), Automated Reasoning, IJCAR 2001; Lecture Notes in Computer Science, Vol. 2083, Springer, Berlin–Heidelberg, 121–136 (2001). https://doi.org/10.1007/3-540-45744-5_10.
S. Lai, K. S. Leung, and Y. Leung, “SUNNYNLP at SemEval-2018 Task 10: A Support-Vector-Machine-based method for detecting semantic difference using taxonomy and word embedding features,” in: Proc. of the 12th Intern. Workshop on Semantic Evaluation (SemEval-2018) (New Orleans, USA, June 5–6, 2018), Association for Computational Linguistics (2018), pp. 741–746. https://doi.org/10.18653/v1/S18-1118.
J. Zhan and H. Zhao, “Span model for open information extraction on accurate corpus,” in: Proc. of the AAAI Conf. on Artificial Intelligence, Vol. 34, No. 05 (2020), pp. 9523–9530. https://doi.org/10.1609/aaai.v34i05.6497.
A. Gangemi, V. Presutti, D. Reforgiato Recupero, A. G. Nuzzolese, F. Draicchio, and M. Mongiovi, “Semantic Web machine reading with FRED,” Semantic Web, Vol. 8, Iss. 6, 873–893 (2017). https://doi.org/10.3233/SW-160240.
D. Reforgiato Recupero, A. G. Nuzzolese, S. Consoli, V. Presutti, M. Mongiovi, and S. Peroni, “Extracting knowledge from text using SHELDON, a Semantic Holistic framEwork for LinkeD ONtology data,” in: WWW’15 Companion: Proc. of the 24th Intern. Conf. on World Wide Web (Florence, Italy, May 2015), Association for Computing Machinery, New York (2015), pp. 235–238. https://doi.org/10.1145/2740908.2742842.
H. I. Hoherchak, “Knowledge bases and description logics applications to natural language texts analysis,” Problems in Programming, No. 2–3, 259–269 (2020). https://doi.org/10.15407/pp2020.02-03.259.
S. L. Kryvyi, N. P. Darchuk, and O. I. Provotar, “Ontological similar systems for analysis of texts of natural language,” Problems in Programming, No. 2–3, 132–139 (2018).
A. V. Palagin, S. L. Kryvyi, and N. G. Petrenko, ”Knowledge-oriented information systems with the processing of natural language objects: the basis of ethodology, architectural and structural organization,“ Upr. Sis. Mash., No. 3, 42–55 (2009).
A. V. Palagin, S. L. Kryvyi, and N. G. Petrenko, “On the automation of the process of extracting knowledge from natural language texts,” Natural and Artificial Intelligence Intern. Book Series, Inteligent Processing, No. 9, 44–52, ITHEA, Sofia (2012).
A. V. Palagin, S. L. Kryvyi, D. S. Bibikov, “Natural language sentence processing using dictionaries and words frequency,” Natural and Artificial Intelligence Intern. Book Series, Inteligent Processing, No. 9, 44–52, ITHEA, Sofia (2010).
R. McDonald, J. Nivre, Y. Quirmbach-Brundage, Y. Goldberg, D. Das, K. Ganchev, K. Hall, S. Petrov, H. Zhang, O. Tackstrom, C. Bedini, N. B. Castelló, and J. Lee, “Universal dependency annotation for multilingual parsing,” in: Proc. of the 51st Annual Meeting of the Association for Computational Linguistics (Sofia, Bulgaria, August 4–9, 2013), Vol. 2: Short Papers, Association for Computational Linguistics (2013), pp. 92–97.
K. Mrini, F. Dernoncourt, T. Bui, W. Chang, N. Nakashole, “Rethinking self-attention: An interpretable self-attentive encoder-decoder parser,” in: T. Cohn, Y. He, and Y. Liu (eds.), Findings of the Association for Computational Linguistics: EMNLP 2020, Association for Computational Linguistics (2020), pp. 731–742. https://doi.org/10.18653/v1/2020.findings-emnlp.65.
W. Che, Y. Lui, Y. Wang, B. Zheng, and T. Liu, “Towards better UD parsing: Deep contextualized word embeddings, ensemble, and treebank concatenation,” in: Proc. of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies (Brussels, Belgium, October 31 – November 1, 2018), Association for Computational Linguistics (2018), pp. 55–64. https://doi.org/10.18653/v1/K18-2005.
N. Darchuk, “Automatic syntactic analysis of texts from Ukrainian language corpus,” Ukrainian Linguistics, Issue 43, 11–19 (2013).
M. Vilain, J. Burger, J. Aberdeen, D. Connolly, and L. Hirschman, “A model-theoretic coreference scoring scheme,” in: MUC6’95: Proc. of the 6th Message Understanding Conf. (Maryland, USA, November 6–8, 1995), Association for Computational Linguistics (1995), pp. 45–52. https://doi.org/10.3115/1072399.1072405.
V. Stoyanov, N. Gilbert, C. Cardie, and E. Riloff, “Conundrums in noun phrase coreference resolution: Making sense of the state-of-the-art,” in: Proc. of the Joint Conf. of the 47th Annual Meeting of the ACL and the 4th Intern. Joint Conf. on Natural Language Processing of the AFNLP (Singapore, August 2–7, 2009), Association for Computational Linguistics (2009), pp. 656–664. https://doi.org/10.3115/1690219.1690238.
X. Luo, “On coreference resolution performance metrics,” in: HLT’05: Proc. of the Conf. on Human Language Technology and Empirical Methods in Natural Language Processing (Vancouver, Canada, October, 2005), Association for Computational Linguistics (2005), pp. 25–32. https://doi.org/10.3115/1220575.1220579.
J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of deep bidirectional transformers for language understanding,” in: Proc. of the 2019 Conf. of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Minneapolis, USA, June 2–7, 2019), Vol. 1 (Long and Short Papers), Association for Computational Linguistics (2019), pp. 4171–4186. https://doi.org/10.18653/v1/N19-1423.
L. Xu and J. D. Choi, “Revealing the myth of higher-order inference in coreference resolution,” in: Proc. of the 2020 Conf. on Empirical Methods in Natural Language Processing (EMNLP) (online, November 16–20, 2020), Association for Computational Linguistics (2020), pp. 8527–8533. https://doi.org/10.18653/v1/2020.emnlp-main.686.
N. V. Lukashevich, Thesauri in Information Retrieval Problems [in Russian], Izd-vo Mosk. Un-ta, Moscow (2011).
Google Scholar

Download references

Author information

Authors and Affiliations

Taras Shevchenko National University of Kyiv, Kyiv, Ukraine
H. Hoherchak, N. Darchuk & S. Kryvyi

Authors

H. Hoherchak
View author publications
You can also search for this author in PubMed Google Scholar
N. Darchuk
View author publications
You can also search for this author in PubMed Google Scholar
S. Kryvyi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to H. Hoherchak.

Additional information

Translated from Kibernetyka ta Systemnyi Analiz, No. 3, May–June, 2021, pp. 164–183.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hoherchak, H., Darchuk, N. & Kryvyi, S. Representation, Analysis, and Extraction of Knowledge from Unstructured Natural Language Texts. Cybern Syst Anal 57, 481–500 (2021). https://doi.org/10.1007/s10559-021-00373-7

Download citation

Received: 10 November 2020
Published: 27 May 2021
Issue Date: May 2021
DOI: https://doi.org/10.1007/s10559-021-00373-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Representation, Analysis, and Extraction of Knowledge from Unstructured Natural Language Texts

Abstract

Access this article

Similar content being viewed by others

Multi-level Modeling of Structural Elements of Natural Language Texts and Its Applications

Extracting and structuring information from the electronic medical text: state of the art and trendy directions

Extracting Schema Knowledge from Text Documents: Part I. Problem Formulation and Method Overview

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Representation, Analysis, and Extraction of Knowledge from Unstructured Natural Language Texts

Abstract

Access this article

Similar content being viewed by others

Multi-level Modeling of Structural Elements of Natural Language Texts and Its Applications

Extracting and structuring information from the electronic medical text: state of the art and trendy directions

Extracting Schema Knowledge from Text Documents: Part I. Problem Formulation and Method Overview

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation