ABSTRACT
Information extraction (IE) refers to the task of turning text documents into a structured form, in order to make the information contained therein automatically processable. Ontology Mediated Information Extraction (OMIE) is a new paradigm for IE that seeks to exploit the semantic knowledge expressed in ontologies to improve query answering over unstructured data (properly raw text). In this paper we present Mastro System-T, an OMIE tool born from a joint collaboration between the University of Rome "La Sapienza" and IBM Research Almaden and its first application in a financial domain, namely to facilitate the access to and the sharing of data extracted from the EDGAR system.
- Diego Calvanese, Giuseppe De Giacomo, Domenico Lembo, Maurizio Lenzerini, and Riccardo Rosati. 2007. Tractable reasoning and efficient query answering in description logics: The DL-Lite family. Journal of Automated reasoning 39, 3 (2007), 385--429.Google ScholarDigital Library
- Laura Chiticariu, Rajasekar Krishnamurthy, Yunyao Li, Sriram Raghavan, Frederick R Reiss, and Shivakumar Vaithyanathan. 2010. SystemT: an algebraic approach to declarative information extraction. In Proc. of the 48th Annual Meeting of the Association for Computational Linguistics (ACL). Association for Computational Linguistics, 128--137.Google Scholar
- Jim Cowie and Wendy Lehnert. 1996. Information extraction. Commun. ACM 39, 1 (1996), 80--91.Google ScholarDigital Library
- Souripriya Das, Seema Sundara, and Richard Cyganiak. 2012. R2RML: RDB to RDF Mapping Language. W3C Recommendation. W3C. Available at http://www.w3.org/TR/r2rml/.Google Scholar
- Giuseppe De Giacomo, Domenico Lembo, Maurizio Lenzerini, Antonella Poggi, Riccardo Rosati, Marco Ruzzi, and Domenico Fabio Savo. 2012. MASTRO: A Reasoner for Effective Ontology-Based Data Access. In Proc. of the 1st Int. Workshop on OWL Reasoner Evaluation (ORE).Google Scholar
- Ronald Fagin, Benny Kimelfeld, Frederick Reiss, and Stijn Vansummeren. 2015. Document spanners: A formal approach to information extraction. J. ACM 62, 2 (2015), 1--51.Google ScholarDigital Library
- Dayne Freitag. 2000. Machine learning for information extraction in informal domains. Machine learning 39, 2-3 (2000), 169--202.Google Scholar
- Giulio Ganino, Domenico Lembo, Massimo Mecella, and Federico Scafoglieri. 2018. Ontology population for open-source intelligence: A GATE-based solution. Software: Practice and Experience 48, 12 (2018), 2302--2330.Google ScholarCross Ref
- Tom Gruber. 2018. Ontology. In Encyclopedia of Database Systems, Second Edition. Springer.Google Scholar
- Steve Harris and Andy Seaborne. 2013. SPARQL 1.1 Query Language. W3C Recommendation. W3C. Available at http://www.w3.org/TR/sparql11-query.Google Scholar
- Alexander Hogenboom, Frederik Hogenboom, Flavius Frasincar, Kim Schouten, and Otto Van Der Meer. 2013. Semantics-based information extraction for detecting economic events. Multimedia Tools and Applications 64, 1 (2013), 27--52.Google ScholarDigital Library
- Rajasekar Krishnamurthy, Yunyao Li, Sriram Raghavan, Frederick Reiss, Shivakumar Vaithyanathan, and Huaiyu Zhu. 2009. SystemT: a system for declarative information extraction. ACM SIGMOD Record 37, 4 (2009), 7--13.Google ScholarDigital Library
- Domenico Lembo, Daniele Pantaleone, Valerio Santarelli, and Domenico Fabio Savo. 2016. Easy OWL drawing with the graphol visual ontology language. In Proc. of the 15th Int. Conf. on Principles of Knowledge Representation and Reasoning (KR). 573--576.Google Scholar
- Domenico Lembo, Daniele Pantaleone, Valerio Santarelli, and Domenico Fabio Savo. 2018. Drawing OWL 2 ontologies with Eddy the editor. AI Commun. 31, 1 (2018), 97--113.Google ScholarDigital Library
- Domenico Lembo and Federico Maria Scafoglieri. 2020. Ontology-based Document Spanning Systems for Information Extraction. Int. Journal of Semantic Computing (2020).Google Scholar
- Deborah L McGuinness, Frank Van Harmelen, et al. 2004. OWL web ontology language overview. W3C Recommendation 10, 10 (2004), 2004.Google Scholar
- Boris Motik, Achille Fokoue, Ian Horrocks, Zhe Wu, Carsten Lutz, and Bernardo Cuenca Grau. 2009. OWLWeb Ontology Language Profiles. W3C Recommendation. W3C. Available at http://www.w3.org/TR/owl-profiles/.Google Scholar
- Boris Motik, Bijan Parsia, and Peter F. Patel-Schneider. 2012. OWL 2 Web Ontology Language Structural Specification and Functional-Style Syntax (Second Edition). W3C Recommendation. W3C. Available at http://www.w3.org/TR/owl2-syntax/.Google Scholar
- Antonella Poggi, Domenico Lembo, Diego Calvanese, Giuseppe De Giacomo, Maurizio Lenzerini, and Riccardo Rosati. 2008. Linking Data to Ontologies. Journal on Data Semantics X (2008), 133--173.Google Scholar
- Borislav Popov, Atanas Kiryakov, Damyan Ognyanoff, Dimitar Manov, Angel Kirilov, and Miroslav Goranov. 2003. Towards semantic web information extraction. In Proc. of the Human Language Technologies Workshop at ISWC 2003, Vol. 20.Google Scholar
- Frederick Reiss, Sriram Raghavan, Rajasekar Krishnamurthy, Huaiyu Zhu, and Shivakumar Vaithyanathan. 2008. An algebraic approach to rule-based information extraction. In 2008 IEEE 24th Int. Conf. on Data Engineering. IEEE, 933--942.Google ScholarDigital Library
- Horacio Saggion, Adam Funk, Diana Maynard, and Kalina Bontcheva. 2007. Ontology-Based Information Extraction for Business Intelligence. In Proc. of the 6th Int. Semantic Web Conf. and the, 2nd Asian Semantic Web Conf. (ISWC + ASWC). 843--856.Google ScholarDigital Library
- Federico Maria Scafoglieri and Domenico Lembo. 2019. A formal framework for coupling document spanners with ontologies. In 2019 IEEE 2nd Int. Conf. on Artificial Intelligence and Knowledge Engineering (AIKE). IEEE, 155--162.Google ScholarCross Ref
- Michael K. Smith, Chris Welty, and Deborah L. McGuiness. 2004. OWL Web Ontology Language Guide. W3C Recommendation. W3C. Available at http://www.w3.org/TR/owl-guide/.Google Scholar
- Guohui Xiao, Diego Calvanese, Roman Kontchakov, Domenico Lembo, Antonella Poggi, Riccardo Rosati, and Michael Zakharyaschev. 2018. Ontology-based data access: A survey. In Proc. of the 27th Int. Joint Conf. on Artificial Intelligence (IJCAI). 5511--5519.Google ScholarCross Ref
Index Terms
- Ontology mediated information extraction in financial domain with Mastro System-T
Recommendations
Ontology View Extraction: An Approach Based on Ontological Meta-properties
ICTAI '14: Proceedings of the 2014 IEEE 26th International Conference on Tools with Artificial IntelligenceOntologies have been applied in Computer Science to ensure the semantic interoperability among multiple systems. With the increasing of ontologies availability, many approaches for promoting the share and reuse of ontologies have been investigated in ...
Ontology-based information extraction and information retrieval in health care domain
DaWaK'07: Proceedings of the 9th international conference on Data Warehousing and Knowledge DiscoveryOntology-based information extraction is considered as an effective method to improve the performance of information extraction (IE) systems. In order to build a better IE system using ontology-based technique, the two challenges should be most taken ...
Towards a System for Ontology-Based Information Extraction from PDF Documents
OTM '08: Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part II on On the Move to Meaningful Internet SystemsOntologies enable to directly encode domain knowledge in software applications, so ontology-based systems can exploit the meaning of information for providing advanced and intelligent functionalities. One of the most interesting and promising ...
Comments