Skip to main content

A Fast Visual Word Frequency - Inverse Image Frequency for Detector of Rare Concepts

  • Conference paper
  • 493 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6242))

Abstract

In this paper we propose an original image retrieval model inspired from the vector space information retrieval model. We build for different features and different scales a visual concept dictionary composed by visual words intended to represent a semantic concept, and then we represent an image by the frequency of the visual words within the image. Then the image similarity is computed as in the textual domain where a textual document is represented by a vector in which each component is the frequency of occurrence of a specific textual word in that document. We then adapt the common text-based paradigm by using the TF-IDF weighting scheme to construct a WF-IIF weighting scheme in our Multi-Scale Visual Dictionary (MSVD) vector space model.

The experiments are conducted on the 2009 Visual Concept Detection ImageCLEF Campaign. We compare WF-IIF to usual direct Support-Vector Machine (SVM) algorithm. We demonstrate that SVM and WF-IIF are in average over all the concept giving the same Area Under the Curve (AUC). We then discuss the fusion process that should enhance the whole system, and of some particular properties of MSVD, that shall be less dependant of the training set size of each concept than the SVM.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Picard, R.W.: Toward a visual thesaurus. In: Springer Werlag Workshops in Computing, MIRO (1995)

    Google Scholar 

  2. Picard, R.W.: A society of models for video and image libraries (1996)

    Google Scholar 

  3. Zhang, R., Zhang, Z.M.: Hidden semantic concept discovery in region based image retrieval. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 996–1001 (2004)

    Google Scholar 

  4. Lim, J.H.: Categorizing visual contents by matching visual “keywords”. In: Huijsmans, D.P., Smeulders, A.W.M. (eds.) VISUAL 1999. LNCS, vol. 1614, pp. 367–374. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  5. Fauqueur, J., Boujemaa, N.: Mental image search by boolean composition of region categories. In: Multimedia Tools and Applications, pp. 95–117 (2004)

    Google Scholar 

  6. Souvannavong, F., Hohl, L., Mérialdo, B., Huet, B.: Enhancing latent semantic analysis video object retrieval with structural information. In: IEEE International Conference on Image Processing, ICIP 2004, Singapore, October 24-27 (2004)

    Google Scholar 

  7. Salton, G., Mcgill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, Inc., New York (1986)

    Google Scholar 

  8. Mitchell, T.: Machine Learning (October 1997)

    Google Scholar 

  9. Seymore, K., Chen, S., Rosenfeld, R., Chen, S., Rosenfeld, R.: Nonlinear interpolation of topic models for language model adaptation. In: Proceedings of ICSLP-1998, vol. 6, pp. 2503–2506 (1998)

    Google Scholar 

  10. Jensen, R., Shen, Q.: Fuzzy-rough data reduction with ant colony optimization. Fuzzy Sets and Systems (March 2004)

    Google Scholar 

  11. Nowak, S., Dunker, P.: Overview of the CLEF 2009 large-scale visual concept detection and annotation task. In: Peters, C., et al. (eds.) CLEF 2009 Workshop, Part II. LNCS, vol. 6242, pp. 94–109. Springer, Heidelberg (2010)

    Google Scholar 

  12. Smach, F., Lemaître, C., Gauthier, J.P., Miteran, J., Atri, M.: Generalized fourier descriptors with applications to objects recognition in svm context. J. Math. Imaging Vis. 30(1), 43–71 (2008)

    Article  Google Scholar 

  13. Glotin, H., Zhao, Z., Ayache, S.: Efficient image concept indexing by harmonic and arithmetic profiles entropy. In: Proceedings of 2009 IEEE International Conference on Image Processing (ICIP 2009), Cairo, Egypt, November 7-11 (2009)

    Google Scholar 

  14. Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm

  15. van de Sande, K., Gevers, T., Smeulders, A.: The university of Amsterdam’s concept detection system at imageCLEF 2009. In: Peters, C., et al. (eds.) CLEF 2009 Workshop, Part II. LNCS, vol. 6242, pp. 261–268. Springer, Heidelberg (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Dumont, E., Glotin, H., Paris, S., Zhao, ZQ. (2010). A Fast Visual Word Frequency - Inverse Image Frequency for Detector of Rare Concepts. In: Peters, C., et al. Multilingual Information Access Evaluation II. Multimedia Experiments. CLEF 2009. Lecture Notes in Computer Science, vol 6242. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15751-6_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15751-6_39

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15750-9

  • Online ISBN: 978-3-642-15751-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics