University of Amsterdam at the Visual Concept Detection and Annotation Tasks

van de Sande, Koen E. A.; Gevers, Theo

doi:10.1007/978-3-642-15181-1_18

Koen E. A. van de Sande⁵ &
Theo Gevers⁵

Part of the book series: The Information Retrieval Series ((INRE,volume 32))

988 Accesses
1 Citations

Abstract

Visual concept detection is important to access visual information on the level of objects and scene types. The current state–of–the–art in visual concept detection and annotation tasks is based on the bag–of–words model. Within the bag–of–words model, points are first sampled according to some strategy, then the area around these points are described using color descriptors. These descriptors are then vector–quantized against a codebook of prototypical descriptors, which results in a fixed–length representation of the image. Based on these representations, visual concept models are trained. In this chapter, we discuss the design choices within the bag–of–words model and their implications for concept detection accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Burghouts GJ, Geusebroek JM (2009) Performance evaluation of local color invariants. Computer Vision and Image Understanding 113:48–62
Article Google Scholar
Chang CC, Lin CJ (2001) LIBSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/ cjlin/libsvm
Everingham M, Van Gool L, Williams C, Winn J, Zisserman A (2010) The pascal visual object classes (voc) challenge. International Journal of Computer Vision 88(2):303–338
Article Google Scholar
Fei-Fei L, Perona P (2005) A bayesian hierarchical model for learning natural scene categories. In: IEEE Conference on Computer Vision and Pattern Recognition, vol 2, pp 524–531
Google Scholar
Geusebroek JM, van den Boomgaard R, Smeulders AWM, Geerts H (2001) Color invariance. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(12):1338–1350
Article Google Scholar
Jurie F, Triggs B (2005) Creating efficient codebooks for visual recognition. In: IEEE International Conference on Computer Vision, pp 604–610
Google Scholar
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: IEEE Conference on Computer Vision and Pattern Recognition, vol 2, pp 2169–2178
Google Scholar
Leung TK, Malik J (2001) Representing and recognizing the visual appearance of materials using three–dimensional textons. International Journal of Computer Vision 43(1):29–44
Article MATH Google Scholar
Lin HT, Lin CJ, Weng RC (2007) A note on Platt’s probabilistic outputs for support vector machines. Machine Learning 68(3):267–276
Article Google Scholar
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2):91–110
Article Google Scholar
Marszałek M, Schmid C, Harzallah H, van de Weijer J (2007) Learning object representations for visual object class recognition. Visual Recognition Challenge workshop, in conjunction with IEEE International Conference on Computer Vision
Google Scholar
Nowak S, Dunker P (2009) Overview of the clef 2009 large scale visual concept detection and annotation task. In: Working notes CLEF 2009, Corfu, Greece
Google Scholar
Van de Sande KEA, Gevers T, Snoek CGM (2008) A comparison of color features for visual concept classification. In: ACM International Conference on Image and Video Retrieval. ACM press, pp 141–150
Google Scholar
Van de Sande KEA, Gevers T, Snoek CGM (2010) Evaluating color descriptors for object and scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 32(9)
Google Scholar
Snoek CGM, Worring M (2009) Concept–based video retrieval. Foundations and Trends in Information Retrieval 4(2):215–322
Google Scholar
Snoek CGM, van de Sande KEA, de Rooij O, Huurnink B, van Gemert JC, Uijlings JRR, et al (2008) The MediaMill TRECVID 2008 semantic video search engine. In: Proceedings of the TRECVID Workshop
Google Scholar
Snoek CGM, van de Sande KEA, de Rooij O, Huurnink B, Uijlings JRR, van Liempt M, Bugalho M, Trancoso I, Yan F, Tahir MA, Mikolajczyk K, Kittler J, de Rijke M, Geusebroek JM, Gevers T, Worring M, Koelma DC, Smeulders AWM (2009) The MediaMill TRECVID 2009 semantic video search engine. In: Proceedings of the TRECVID Workshop
Google Scholar
Tuytelaars T, Mikolajczyk K (2008) Local invariant feature detectors: A survey. Foundations and Trends in Computer Graphics and Vision 3(3):177–280
Article Google Scholar
Uijlings JRR, Smeulders AWM, Scha RJH (2009) Real–time bag–of–words, approximately. In: ACM International Conference on Image and Video Retrieval. ACM press
Google Scholar
Van Gemert JC, Veenman CJ, Smeulders AWM, Geusebroek JM (2010) Visual word ambiguity. IEEE Transactions on Pattern Analysis and Machine Intelligence 32(7):1271–1283
Article Google Scholar
Vapnik VN (2000) The nature of statistical learning theory, 2nd edn. Springer
Google Scholar
Wang D, Liu X, Luo L, Li J, Zhang B (2007) Video diver: generic video indexing with diverse features. In: ACM International Workshop on Multimedia Information Retrieval. ACM press, Augsburg, Germany, pp 61–70
Chapter Google Scholar
Zhang J, Marszałek M, Lazebnik S, Schmid C (2007) Local features and kernels for classification of texture and object categories: A comprehensive study. International Journal of Computer Vision 73(2):213–238
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Amsterdam, Science Park 107, 1098 XG, Amsterdam, The Netherlands
Koen E. A. van de Sande & Theo Gevers

Authors

Koen E. A. van de Sande
View author publications
You can also search for this author in PubMed Google Scholar
Theo Gevers
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Koen E. A. van de Sande .

Editor information

Editors and Affiliations

HES-SO Business Information Systems, TechnoArk 3, Sierre, 3960, Switzerland
Henning Müller
Dept. Information Studies, University of Sheffield, Portobello Street 211, Sheffield, S1 4DP, United Kingdom
Paul Clough
, Computer Vision Lab/ETF-C 113.2, ETH Zürich, Zürich, 8092, Switzerland
Thomas Deselaers
Idiap Research Institute, rue Marconi 19, Martigny, 1920, Switzerland
Barbara Caputo

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

van de Sande, K.E.A., Gevers, T. (2010). University of Amsterdam at the Visual Concept Detection and Annotation Tasks. In: Müller, H., Clough, P., Deselaers, T., Caputo, B. (eds) ImageCLEF. The Information Retrieval Series, vol 32. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15181-1_18

Download citation

DOI: https://doi.org/10.1007/978-3-642-15181-1_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15180-4
Online ISBN: 978-3-642-15181-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics