Incremental Induction of Classification Rules for Cultural Heritage Documents

Basile, Teresa M. A.; Ferilli, Stefano; Di Mauro, Nicola; Esposito, Floriana

doi:10.1007/978-3-540-24677-0_94

Teresa M. A. Basile¹⁹,
Stefano Ferilli¹⁹,
Nicola Di Mauro¹⁹ &
…
Floriana Esposito¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3029))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

1304 Accesses

Abstract

This work presents the application of a first-order logic incremental learning system, INTHELEX, to learn rules for the automatic identification of a wide range of significant document classes and their related components. Specifically, the material includes multi-format cultural heritage documents concerning European films from the 20’s and 30’s provided by the EU project COLLATE. Incrementality plays a key role when the set of documents is continuously augmented. To ensure that there is no performance loss with respect to classical one-step systems, a comparison with Progol was carried out. Experimental results prove that the proposed approach is a viable solution, for both its performance and its effectiveness in the document processing domain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 74.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Becker, J.M.: Inductive learning of decision rules with exceptions: Methodology and experimentation. B.s. diss., Dept. of Computer Science, University of Illinois at Urbana- Champaign, Urbana, Illinois, USA (1985)
Google Scholar
Dietterich, T.G.: Approximate statistical test for comparing supervised classification learning algorithms. Neural Computation 10(7), 1895–1923 (1998)
Article Google Scholar
Esposito, F., Semeraro, G., Fanizzi, N., Ferilli, S.: Multistrategy Theory Revision: Induction and abduction in INTHELEX. Machine Learning Journal 38(1/2), 133–156 (2000)
Article MATH Google Scholar
Esposito, F., Malerba, D., Lisi, F.A.: Machine learning for intelligent processing of printed documents. Journal of Intelligent Information Systems 14(2/3), 175–198 (2000)
Article Google Scholar
Esposito, F., Fanizzi, N., Ferilli, S., Semeraro, G.: Refining logic theories under oiimplication. In: Ohsuga, S., Raś, Z.W. (eds.) ISMIS 2000. LNCS (LNAI), vol. 1932, pp. 109–118. Springer, Heidelberg (2000)
Chapter Google Scholar
Kouzes, R.T., Myers, J.D., Wulf, W.A.: Collaboratories: Doing science on the internet. IEEE Computer 29(8) (1996)
Google Scholar
Lamma, E., Mello, P., Riguzzi, F., Esposito, F., Ferilli, S., Semeraro, G.: Cooperation of abduction and induction in logic programming. In: Kakas, A.C., Flach, P. (eds.) Abductive and Inductive Reasoning: Essays on their Relation and Integration, Kluwer, Dordrecht (2000)
Google Scholar
Lloyd, J.W.: Foundations of Logic Programming, 2nd edn. Springer, Berlin (1987)
MATH Google Scholar
Michalski, R.S.: Inferential theory of learning. developing foundations for multistrategy learning. In: Michalski, R.S., Tecuci, G. (eds.) Machine Learning. A Multistrategy Approach, vol. IV, pp. 3–61. Morgan Kaufmann, San Mateo (1994)
Google Scholar
Muggleton, S.: Inverse entailment and Progol. New Generation Computing, Special issue on Inductive Logic Programming 13(3-4), 245–286 (1995)
Google Scholar
Semeraro, G., Esposito, F., Malerba, D., Fanizzi, N., Ferilli, S.: A logic framework for the incremental inductive synthesis of Datalog theories. In: Fuchs, N.E. (ed.) LOPSTR 1997. LNCS, vol. 1463, pp. 300–321. Springer, Heidelberg (1998)
Chapter Google Scholar
Semeraro, G., Fanizzi, N., Ferilli, S., Esposito, F.: Document classification and interpretation through the inference of logic-based models. In: Constantopoulos, P., Sølvberg, I.T. (eds.) ECDL 2001. LNCS, vol. 2163, pp. 59–70. Springer, Heidelberg (2001)
Chapter Google Scholar
Zucker, J.D.: Semantic abstraction for concept representation and learning. In: Michalski, R.S., Saitta, L. (eds.) Proceedings of the 4th International Workshop on Multistrategy Learning, Desenzano del Garda, Italy (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Informatica, Università degli Studi di Bari, via E. Orabona, 4, 70125, Bari, Italia
Teresa M. A. Basile, Stefano Ferilli, Nicola Di Mauro & Floriana Esposito

Authors

Teresa M. A. Basile
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Ferilli
View author publications
You can also search for this author in PubMed Google Scholar
Nicola Di Mauro
View author publications
You can also search for this author in PubMed Google Scholar
Floriana Esposito
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute for Information Technology, National Research Council of Canada, 1200 Montreal Read, M-50, K1A 0R6, Ottawa, Ontario, Canada
Bob Orchard
Institute for Information Technology, National Research Council, Canada
Chunsheng Yang
Department of Computer Science, Texas State University-San Marcos, Nueces 247, 601 University Drive, TX 78666-4616, San Marcos, USA
Moonis Ali

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Basile, T.M.A., Ferilli, S., Di Mauro, N., Esposito, F. (2004). Incremental Induction of Classification Rules for Cultural Heritage Documents. In: Orchard, B., Yang, C., Ali, M. (eds) Innovations in Applied Artificial Intelligence. IEA/AIE 2004. Lecture Notes in Computer Science(), vol 3029. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24677-0_94

Download citation

DOI: https://doi.org/10.1007/978-3-540-24677-0_94
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22007-7
Online ISBN: 978-3-540-24677-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics