Metric Learning for Large Scale Image Classification: Generalizing to New Classes at Near-Zero Cost

Mensink, Thomas; Verbeek, Jakob; Perronnin, Florent; Csurka, Gabriela

doi:10.1007/978-3-642-33709-3_35

Thomas Mensink^21,22,
Jakob Verbeek²¹,
Florent Perronnin²² &
…
Gabriela Csurka²²

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7573))

Included in the following conference series:

European Conference on Computer Vision

12k Accesses
100 Citations

Abstract

We are interested in large-scale image classification and especially in the setting where images corresponding to new or existing classes are continuously added to the training set. Our goal is to devise classifiers which can incorporate such images and classes on-the-fly at (near) zero cost. We cast this problem into one of learning a metric which is shared across all classes and explore k-nearest neighbor (k-NN) and nearest class mean (NCM) classifiers. We learn metrics on the ImageNet 2010 challenge data set, which contains more than 1.2M training images of 1K classes. Surprisingly, the NCM classifier compares favorably to the more flexible k-NN classifier, and has comparable performance to linear SVMs. We also study the generalization performance, among others by using the learned metric on the ImageNet-10K dataset, and we obtain competitive performance. Finally, we explore zero-shot classification, and show how the zero-shot model can be combined very effectively with small training datasets.

Download to read the full chapter text

Chapter PDF

Large Scale Metric Learning for Distance-Based Image Classification on Open Ended Data Sets

A Study on Metric-Based and Initialization-Based Methods for Few-Shot Image Classification

An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the Wild

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: A large-scale hierarchical image database. In: CVPR (2009)
Google Scholar
Checkik, G., Sharma, V., Shalit, U., Bengio, S.: Large scale online learning of image similarity through ranking. Journal of Machine Learning Research 11, 1109–1135 (2010)
Google Scholar
Deng, J., Berg, A.C., Li, K., Fei-Fei, L.: What Does Classifying More Than 10,000 Image Categories Tell Us? In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 71–84. Springer, Heidelberg (2010)
Chapter Google Scholar
Vedaldi, A., Zisserman, A.: Efficient additive kernels via explicit feature maps. In: CVPR (2010)
Google Scholar
Rohrbach, M., Stark, M., Schiele, B.: Evaluating knowledge transfer and zero-shot learning in a large-scale setting. In: CVPR (2011)
Google Scholar
Jégou, H., Douze, M., Schmid, C.: Product quantization for nearest neighbor search. IEEE Trans. PAMI 33, 117–128 (2011)
Article Google Scholar
Weston, J., Bengio, S., Usunier, N.: WSABIE: Scaling up to large vocabulary image annotation. In: IJCAI (2011)
Google Scholar
Sánchez, J., Perronnin, F.: High-dimensional signature compression for large-scale image classification. In: CVPR (2011)
Google Scholar
Jégou, H., Perronnin, F., Douze, M., Sánchez, J., Pérez, P., Schmid, C.: Aggregating local image descriptors into compact codes. IEEE Trans. PAMI (to appear, 2012)
Google Scholar
Lin, Y., Lv, F., Zhu, S., Yang, M., Cour, T., Yu, K., Cao, L., Huang, T.: Large-scale image classification: Fast feature extraction and SVM training. In: CVPR (2011)
Google Scholar
Perronnin, F., Akata, Z., Harchaoui, Z., Schmid, C.: Towards good practice in large-scale learning for image classification. In: CVPR (2012)
Google Scholar
Guillaumin, M., Mensink, T., Verbeek, J., Schmid, C.: Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation. In: ICCV (2009)
Google Scholar
Webb, A.R.: Statistical pattern recognition. Wiley, New-York (2002)
Book MATH Google Scholar
Veenman, C., Tax, D.: LESS: a model-based classifier for sparse subspaces. IEEE Trans. PAMI 27, 1496–1500 (2005)
Article Google Scholar
Zhou, X., Zhang, X., Yan, Z., Chang, S.-F., Hasegawa-Johnson, M., Huang, T.: SIFT-Bag kernel for video event analysis. In: ACM Multimedia (2008)
Google Scholar
Weinberger, K., Saul, L.: Distance metric learning for large margin nearest neighbor classification. Journal of Machine Learning Research 10, 207–244 (2009)
MATH Google Scholar
Bottou, L.: Large-scale machine learning with stochastic gradient descent. In: COMPSTAT (2010)
Google Scholar
Gray, R., Neuhoff, D.: Quantization. IEEE Trans. Information Theory 44, 2325–2383 (1998)
Article MATH MathSciNet Google Scholar
Perronnin, F., Sánchez, J., Mensink, T.: Improving the Fisher Kernel for Large-Scale Image Classification. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 143–156. Springer, Heidelberg (2010)
Chapter Google Scholar
Zhang, J., Marszałek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: a comprehensive study. IJCV 73, 213–238 (2007)
Article Google Scholar
Nowak, E., Jurie, F.: Learning visual similarity measures for comparing never seen objects. In: CVPR (2007)
Google Scholar
Chai, J., Liua, H., Chenb, B., Baoa, Z.: Large margin nearest local mean classifier. Signal Processing 90, 236–248 (2010)
Article MATH Google Scholar
Fei-Fei, L., Fergus, R., Perona, P.: One-shot learning of object categories. IEEE Trans. PAMI 28, 594–611 (2006)
Article Google Scholar
Lampert, C., Nickisch, H., Harmeling, S.: Learning to detect unseen object classes by between-class attribute transfer. In: CVPR (2009)
Google Scholar
Tommasi, T., Caputo, B.: The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories. In: BMVC (2009)
Google Scholar
Larochelle, H., Erhan, D., Bengio, Y.: Zero-data learning of new tasks. In: AAAI Conference on Artificial Intelligence (2008)
Google Scholar
Bai, B., Weston, J., Grangier, D., Collobert, R., Qi, Y., Sadamasa, K., Chapelle, O., Weinberger, K.: Learning to rank with (a lot of) word features. Information Retrieval – Special Issue on Learning to Rank 13, 291–314 (2010)
Article Google Scholar
Gauvain, J.L., Lee, C.H.: Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Trans. Speech and Audio Proc. 2, 291–298 (1994)
Article Google Scholar

Download references

Author information

Authors and Affiliations

LEAR, INRIA Grenoble, 655 Avenue de l’Europe, 38330, Montbonnot, France
Thomas Mensink & Jakob Verbeek
TVPA, Xerox Research Centre Europe, 6 chemin de Maupertuis, 38240, Meylan, France
Thomas Mensink, Florent Perronnin & Gabriela Csurka

Authors

Thomas Mensink
View author publications
You can also search for this author in PubMed Google Scholar
Jakob Verbeek
View author publications
You can also search for this author in PubMed Google Scholar
Florent Perronnin
View author publications
You can also search for this author in PubMed Google Scholar
Gabriela Csurka
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Ltd, CB3 0FB, Cambridge, UK
Andrew Fitzgibbon
Dept. of Computer Science, University of North Carolina, 27599, Chapel Hill, NC, USA
Svetlana Lazebnik
California Institute of Technology, 91125, Pasadena, CA, USA
Pietro Perona
Institute of Industrial Science, The University of Tokyo, 153-8505, Tokyo, Japan
Yoichi Sato
INRIA, 38330, Montbonnot, France
Cordelia Schmid

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mensink, T., Verbeek, J., Perronnin, F., Csurka, G. (2012). Metric Learning for Large Scale Image Classification: Generalizing to New Classes at Near-Zero Cost. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7573. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33709-3_35

Download citation

DOI: https://doi.org/10.1007/978-3-642-33709-3_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33708-6
Online ISBN: 978-3-642-33709-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Metric Learning for Large Scale Image Classification: Generalizing to New Classes at Near-Zero Cost

Abstract

Chapter PDF

Similar content being viewed by others

Large Scale Metric Learning for Distance-Based Image Classification on Open Ended Data Sets

A Study on Metric-Based and Initialization-Based Methods for Few-Shot Image Classification

An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the Wild

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Metric Learning for Large Scale Image Classification: Generalizing to New Classes at Near-Zero Cost

Abstract

Chapter PDF

Similar content being viewed by others

Large Scale Metric Learning for Distance-Based Image Classification on Open Ended Data Sets

A Study on Metric-Based and Initialization-Based Methods for Few-Shot Image Classification

An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the Wild

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation