Abstract
This work presents the application of a first-order logic incremental learning system, INTHELEX, to learn rules for the automatic identification of a wide range of significant document classes and their related components. Specifically, the material includes multi-format cultural heritage documents concerning European films from the 20’s and 30’s provided by the EU project COLLATE. Incrementality plays a key role when the set of documents is continuously augmented. To ensure that there is no performance loss with respect to classical one-step systems, a comparison with Progol was carried out. Experimental results prove that the proposed approach is a viable solution, for both its performance and its effectiveness in the document processing domain.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Becker, J.M.: Inductive learning of decision rules with exceptions: Methodology and experimentation. B.s. diss., Dept. of Computer Science, University of Illinois at Urbana- Champaign, Urbana, Illinois, USA (1985)
Dietterich, T.G.: Approximate statistical test for comparing supervised classification learning algorithms. Neural Computation 10(7), 1895–1923 (1998)
Esposito, F., Semeraro, G., Fanizzi, N., Ferilli, S.: Multistrategy Theory Revision: Induction and abduction in INTHELEX. Machine Learning Journal 38(1/2), 133–156 (2000)
Esposito, F., Malerba, D., Lisi, F.A.: Machine learning for intelligent processing of printed documents. Journal of Intelligent Information Systems 14(2/3), 175–198 (2000)
Esposito, F., Fanizzi, N., Ferilli, S., Semeraro, G.: Refining logic theories under oiimplication. In: Ohsuga, S., Raś, Z.W. (eds.) ISMIS 2000. LNCS (LNAI), vol. 1932, pp. 109–118. Springer, Heidelberg (2000)
Kouzes, R.T., Myers, J.D., Wulf, W.A.: Collaboratories: Doing science on the internet. IEEE Computer 29(8) (1996)
Lamma, E., Mello, P., Riguzzi, F., Esposito, F., Ferilli, S., Semeraro, G.: Cooperation of abduction and induction in logic programming. In: Kakas, A.C., Flach, P. (eds.) Abductive and Inductive Reasoning: Essays on their Relation and Integration, Kluwer, Dordrecht (2000)
Lloyd, J.W.: Foundations of Logic Programming, 2nd edn. Springer, Berlin (1987)
Michalski, R.S.: Inferential theory of learning. developing foundations for multistrategy learning. In: Michalski, R.S., Tecuci, G. (eds.) Machine Learning. A Multistrategy Approach, vol. IV, pp. 3–61. Morgan Kaufmann, San Mateo (1994)
Muggleton, S.: Inverse entailment and Progol. New Generation Computing, Special issue on Inductive Logic Programming 13(3-4), 245–286 (1995)
Semeraro, G., Esposito, F., Malerba, D., Fanizzi, N., Ferilli, S.: A logic framework for the incremental inductive synthesis of Datalog theories. In: Fuchs, N.E. (ed.) LOPSTR 1997. LNCS, vol. 1463, pp. 300–321. Springer, Heidelberg (1998)
Semeraro, G., Fanizzi, N., Ferilli, S., Esposito, F.: Document classification and interpretation through the inference of logic-based models. In: Constantopoulos, P., Sølvberg, I.T. (eds.) ECDL 2001. LNCS, vol. 2163, pp. 59–70. Springer, Heidelberg (2001)
Zucker, J.D.: Semantic abstraction for concept representation and learning. In: Michalski, R.S., Saitta, L. (eds.) Proceedings of the 4th International Workshop on Multistrategy Learning, Desenzano del Garda, Italy (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Basile, T.M.A., Ferilli, S., Di Mauro, N., Esposito, F. (2004). Incremental Induction of Classification Rules for Cultural Heritage Documents. In: Orchard, B., Yang, C., Ali, M. (eds) Innovations in Applied Artificial Intelligence. IEA/AIE 2004. Lecture Notes in Computer Science(), vol 3029. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24677-0_94
Download citation
DOI: https://doi.org/10.1007/978-3-540-24677-0_94
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22007-7
Online ISBN: 978-3-540-24677-0
eBook Packages: Springer Book Archive