A Fast Visual Word Frequency - Inverse Image Frequency for Detector of Rare Concepts

Dumont, Emilie; Glotin, Hervé; Paris, Sébastien; Zhao, Zhong-Qiu

doi:10.1007/978-3-642-15751-6_39

A Fast Visual Word Frequency - Inverse Image Frequency for Detector of Rare Concepts

Emilie Dumont^23,24,
Hervé Glotin^23,24,
Sébastien Paris²³ &
…
Zhong-Qiu Zhao²⁵

Conference paper

493 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6242))

Abstract

In this paper we propose an original image retrieval model inspired from the vector space information retrieval model. We build for different features and different scales a visual concept dictionary composed by visual words intended to represent a semantic concept, and then we represent an image by the frequency of the visual words within the image. Then the image similarity is computed as in the textual domain where a textual document is represented by a vector in which each component is the frequency of occurrence of a specific textual word in that document. We then adapt the common text-based paradigm by using the TF-IDF weighting scheme to construct a WF-IIF weighting scheme in our Multi-Scale Visual Dictionary (MSVD) vector space model.

The experiments are conducted on the 2009 Visual Concept Detection ImageCLEF Campaign. We compare WF-IIF to usual direct Support-Vector Machine (SVM) algorithm. We demonstrate that SVM and WF-IIF are in average over all the concept giving the same Area Under the Curve (AUC). We then discuss the fusion process that should enhance the whole system, and of some particular properties of MSVD, that shall be less dependant of the training set size of each concept than the SVM.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Picard, R.W.: Toward a visual thesaurus. In: Springer Werlag Workshops in Computing, MIRO (1995)
Google Scholar
Picard, R.W.: A society of models for video and image libraries (1996)
Google Scholar
Zhang, R., Zhang, Z.M.: Hidden semantic concept discovery in region based image retrieval. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 996–1001 (2004)
Google Scholar
Lim, J.H.: Categorizing visual contents by matching visual “keywords”. In: Huijsmans, D.P., Smeulders, A.W.M. (eds.) VISUAL 1999. LNCS, vol. 1614, pp. 367–374. Springer, Heidelberg (1999)
Chapter Google Scholar
Fauqueur, J., Boujemaa, N.: Mental image search by boolean composition of region categories. In: Multimedia Tools and Applications, pp. 95–117 (2004)
Google Scholar
Souvannavong, F., Hohl, L., Mérialdo, B., Huet, B.: Enhancing latent semantic analysis video object retrieval with structural information. In: IEEE International Conference on Image Processing, ICIP 2004, Singapore, October 24-27 (2004)
Google Scholar
Salton, G., Mcgill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, Inc., New York (1986)
Google Scholar
Mitchell, T.: Machine Learning (October 1997)
Google Scholar
Seymore, K., Chen, S., Rosenfeld, R., Chen, S., Rosenfeld, R.: Nonlinear interpolation of topic models for language model adaptation. In: Proceedings of ICSLP-1998, vol. 6, pp. 2503–2506 (1998)
Google Scholar
Jensen, R., Shen, Q.: Fuzzy-rough data reduction with ant colony optimization. Fuzzy Sets and Systems (March 2004)
Google Scholar
Nowak, S., Dunker, P.: Overview of the CLEF 2009 large-scale visual concept detection and annotation task. In: Peters, C., et al. (eds.) CLEF 2009 Workshop, Part II. LNCS, vol. 6242, pp. 94–109. Springer, Heidelberg (2010)
Google Scholar
Smach, F., Lemaître, C., Gauthier, J.P., Miteran, J., Atri, M.: Generalized fourier descriptors with applications to objects recognition in svm context. J. Math. Imaging Vis. 30(1), 43–71 (2008)
Article Google Scholar
Glotin, H., Zhao, Z., Ayache, S.: Efficient image concept indexing by harmonic and arithmetic profiles entropy. In: Proceedings of 2009 IEEE International Conference on Image Processing (ICIP 2009), Cairo, Egypt, November 7-11 (2009)
Google Scholar
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm
van de Sande, K., Gevers, T., Smeulders, A.: The university of Amsterdam’s concept detection system at imageCLEF 2009. In: Peters, C., et al. (eds.) CLEF 2009 Workshop, Part II. LNCS, vol. 6242, pp. 261–268. Springer, Heidelberg (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Sciences and Information Lab. LSIS UMR CNRS 6168, France
Emilie Dumont, Hervé Glotin & Sébastien Paris
University of Sud Toulon-Var, France
Emilie Dumont & Hervé Glotin
College of Computer Science and Information Engineering, Hefei University of Technology, China
Zhong-Qiu Zhao

Authors

Emilie Dumont
View author publications
You can also search for this author in PubMed Google Scholar
Hervé Glotin
View author publications
You can also search for this author in PubMed Google Scholar
Sébastien Paris
View author publications
You can also search for this author in PubMed Google Scholar
Zhong-Qiu Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ISTI-CNR, Area Ricerca CNR, Via Moruzzi, 1, 56124, Pisa, Italy
Carol Peters
Idiap Research Institute, Rue Marconi 19, 1920, Martigny, Switzerland
Barbara Caputo
LSI-UNED, Juan del Rosal, 16, 28040, Madrid, Spain
Julio Gonzalo
Centre for Digital Video Processing, School of Computing, Dublin City University, Dublin 9, Ireland
Gareth J. F. Jones
Oregon Health and Science University, 3181 SW Sam Jackson Park Road, 97239-3098, Portland, OR, USA
Jayashree Kalpathy-Cramer
University of Applied Sciences Western Switzerland, TechnoArk 3, 3960, Sierre, Switzerland
Henning Müller
Centrum Wiskunde and Infoormatica, Science Park 123, 1098, Amsterdam, XG, The Netherlands
Theodora Tsikrika

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dumont, E., Glotin, H., Paris, S., Zhao, ZQ. (2010). A Fast Visual Word Frequency - Inverse Image Frequency for Detector of Rare Concepts. In: Peters, C., et al. Multilingual Information Access Evaluation II. Multimedia Experiments. CLEF 2009. Lecture Notes in Computer Science, vol 6242. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15751-6_39

Download citation

DOI: https://doi.org/10.1007/978-3-642-15751-6_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15750-9
Online ISBN: 978-3-642-15751-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics