Abstract
This paper proposes a methodology for building fuzzy multimedia ontologies dedicated to image annotation. The built ontology incorporates visual, conceptual, contextual and spatial knowledge about image concepts in order to model image semantics in an effective way. Indeed, our approach uses visual and conceptual information to build a semantic hierarchy that will serve as a backbone of our ontology. Contextual and spatial information about image concepts are then computed and incorporated in the ontology in order to model richer semantic relationships between these concepts. Fuzzy description logics are used as a formalism to represent our ontology and the inherent uncertainty and imprecision of this kind of information. Subsequently, we propose a new approach for image annotation based on hierarchical image classification and a multi-stage reasoning framework for reasoning about the consistency of the produced annotation. In this approach, fuzzy ontological reasoning is used in order to achieve a semantically relevant decision on the belonging of a given image to the set of concepts from the annotation vocabulary. An empirical evaluation of our approach on Pascal VOC’2009 and Pascal VOC’2010 datasets has shown a significant improvement on the average precision results.
Similar content being viewed by others
Notes
n, m are natural numbers, such that n ≥ 0, m > 0. d is an unary fuzzy domain predicate.
A candidate annotation \(\mathcal{P}\) consists of a set of candidate concepts {\(c_j \in \mathcal C \cup \mathcal C', j=1..n_{i_i}\)} and their confidence values {\(\alpha_j, j=1..n_{i_i}\)}, predicted as describing the image content.
References
Baader F, Calvanese D, McGuinness DL, Nardi D, Patel-Schneider PF (eds) (2003) The description logic handbook: theory, implementation, and applications
Bannour H, Hudelot C (2011) Towards ontologies for image interpretation and annotation. In: Content-based multimedia indexing (CBMI’11)
Bannour H, Hudelot C (2012) Building semantic hierarchies faithful to image semantics. In: International conference on advances in multimedia modeling (MMM’12), pp 4–15
Bannour H, Hudelot C (2012) Hierarchical image annotation using semantic hierarchies. In: Proceedings of the 21st ACM international conference on information and knowledge management (CIKM’12), pp 2431–2434
Barnard K, Duygulu P, Forsyth D, de Freitas N, Blei DM, Jordan MI (2003) Matching words and pictures. J Mach Learn Res 3:1107–1135
Bart E, Porteous I, Perona P, Welling M (2008) Unsupervised learning of visual taxonomies. In: Computer vision and pattern recognition (CVPR)
Bloch I (2005) Fuzzy spatial relationships for image processing and interpretation: a review. Image Vis Comput 23(2):89–110
Bobillo F, Straccia U (2011) Reasoning with the finitely many-valued lukasiewicz fuzzy description logic sroiq. Inform Sci 181(4):758–778
Carneiro G, Chan AB, Moreno PJ, Vasconcelos N (2007) Supervised learning of semantic classes for image annotation and retrieval. IEEE Trans Pattern Anal Mach Intell 29(3):394–410
Choi MJ, Lim J, Torralba A, Willsky A (2010) Exploiting hierarchical context on a large database of object categories. In: Computer vision and pattern recognition (CVPR), pp 129–136
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297
Dasiopoulou S, Kompatsiaris I, Strintzis M (2009) Applying fuzzy DLs in the extraction of image semantics. In: Spaccapietra S, Delcambre L (eds) Journal on data semantics XIV. Lecture notes in computer science, vol 5880. Springer Berlin, Heidelberg, pp 105–132
Dasiopoulou S, Tzouvaras V, Kompatsiaris I, Strintzis MG (2010) Enquiring mpeg-7 based multimedia ontologies. Multimed Tools Appl 46:331–370
Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L (2009) Imagenet: a large-scale hierarchical image database. In: Computer vision and pattern recognition (CVPR)
Everingham M, Van Gool L, Williams CKI, Winn J, Zisserman A (2009) The PASCAL visual object classes challenge 2009 (VOC2009) results. http://www.pascal-network.org/challenges/VOC/voc2009/workshop/index.html
Everingham M, Van Gool L, Williams CKI, Winn J, Zisserman A (2010) The PASCAL visual object classes challenge 2010 (VOC2010) results. http://www.pascal-network.org/challenges/VOC/voc2010/workshop/index.html
Fan J, Gao Y, Luo H (2008) Integrating concept ontology and multitask learning to achieve more effective classifier training for multilevel image annotation. IEEE Trans Image Process 17(3):407–426
Griffin G, Perona P (2008) Learning and using taxonomies for fast visual categorization. In: Computer vision and pattern recognition (CVPR)
Gruber TR (1995) Toward principles for the design of ontologies used for knowledge sharing. Int J Hum-Comput Stud 43(5):907–928
Gupta A, Mannem P (2012) From image annotation to image description. Neural Inf Process 7667:196–204
Hauptmann A, Yan R, Lin WH (2007) How many high-level concepts will fill the semantic gap in news video retrieval? In: International conference on image and video retrieval (CIVR)
Hollink L, Nguyen G, Schreiber G, Wielemaker J, Wielinga B, Worring M (2004) Adding spatial semantics to image annotations. In: International workshop on knowledge markup and semantic annotation
Horridge M, Bechhofer S (2011) The owl api: a java api for owl ontologies. Semant Web 2(1):11–21
Hudelot C, Atif J, Bloch I (2008) Fuzzy spatial relation ontology for image interpretation. Fuzzy Set Syst 159:1929–1951
Hudelot C, Atif J, Bloch I (2010) Integrating bipolar fuzzy mathematical morphology in description logics for spatial reasoning. In: European conference on artificial intelligence (ECAI), pp 497–502
Kompatsiaris Y, Hobson P (2008) Semantic multimedia and ontologies: theory and applications. Springer
Lavrenko V, Manmatha R, Jeon J (2003) A model for learning the semantics of pictures. In: Neural information processing systems. MIT, Cambridge
Li FF, Perona P (2005) A bayesian hierarchical model for learning natural scene categories. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR’05), vol 2. Washington, DC, USA, pp 524–531
Li LJ, Wang C, Lim Y, Blei DM, Li FF (2010) Building and using a semantivisual image hierarchy. In: Computer vision and pattern recognition (CVPR)
Liu Y, Zhang D, Lu G, Ma WY (2007) A survey of content-based image retrieval with high-level semantics. Pattern Recogn 40(1):262–282
Lowe DG (1999) Object recognition from local scale-invariant features. In: International conference on computer vision (ICCV)
Marszalek M, Schmid C (2007) Semantic hierarchies for visual object recognition. In: Computer vision and pattern recognition (CVPR)
Simou N, Tzouvaras V, Avrithis Y, Stamou G, Kollias S (2005) A visual descriptor ontology for multimedia reasoning. In: WIAMIS
Simou N, Athanasiadis T, Stoilos G, Kollias SD (2008) Image indexing and retrieval using expressive fuzzy description logics. Signal Image Video Process 2(4):321–335
Smeulders AWM, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22(12):1349–1380
Spaccapietra S, Cullot N, Parent C, Vangenot C (2004) On spatial ontologies. In: Brazilian symposium on geoinformatics
Stoilos G, Stamou GB (2007) Extending fuzzy description logics for the semantic web. In: Workshop on OWL: experiences and directions (OWLED)
Straccia U (2001) Reasoning within fuzzy description logics. J Artif Intell Res 14:137–166
Straccia U (2006) A fuzzy description logic for the semantic web. In: Sanchez E (ed) Fuzzy logic and the semantic web. Capturing intelligence, vol 1. Elsevier, pp 73–90
Straccia U (2010) An ontology mediated multimedia information retrieval system. In: Multiple-valued logic (ISMVL), pp 319–324
Straccia U (2012) Description logics with fuzzy concrete domains. In: Computing research repository (CoRR). arXiv:abs/1207.1410
Tousch AM, Herbin S, Audibert JY (2012) Semantic hierarchies for image annotation: a survey. Pattern Recogn 45(1):333–345
Wu L, Hua XS, Yu N, Ma WY, Li S (2012) Flickr distance: a relationship measure for visual concepts. IEEE Trans Pattern Anal Mach Intell 34(5):863 –875
Xiao J, Hays J, Ehinger KA, Oliva A, Torralba A (2010) Sun database: large-scale scene recognition from abbey to zoo. In: Computer vision and pattern recognition (CVPR). IEEE, pp 3485–3492
Yang J, Yu K, Huang T (2010) Efficient highly over-complete sparse coding using a mixture model. In: Proceedings of the 11th European conference on computer vision: part V, ECCV’10, pp 113–126
Yao B, Yang X, Lin L, Lee MW, Zhu SC (2010) I2t: Image parsing to text description. Proc IEEE 98(8):1485–1508
Zhou X, Yu K, Zhang T, Huang T (2010) Image classification using super-vector coding of local image descriptors. In: European conference on computer vision (ECCV)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bannour, H., Hudelot, C. Building and using fuzzy multimedia ontologies for semantic image annotation. Multimed Tools Appl 72, 2107–2141 (2014). https://doi.org/10.1007/s11042-013-1491-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-013-1491-z