Sparse Patch-Histograms for Object Classification in Cluttered Images

Deselaers, Thomas; Hegerath, Andre; Keysers, Daniel; Ney, Hermann

doi:10.1007/11861898_21

Sparse Patch-Histograms for Object Classification in Cluttered Images

Thomas Deselaers²⁰,
Andre Hegerath²⁰,
Daniel Keysers²⁰ &
…
Hermann Ney²⁰

Conference paper

2252 Accesses
20 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4174))

Abstract

We present a novel model for object recognition and detection that follows the widely adopted assumption that objects in images can be represented as a set of loosely coupled parts. In contrast to former models, the presented method can cope with an arbitrary number of object parts. Here, the object parts are modelled by image patches that are extracted at each position and then efficiently stored in a histogram. In addition to the patch appearance, the positions of the extracted patches are considered and provide a significant increase in the recognition performance. Additionally, a new and efficient histogram comparison method taking into account inter-bin similarities is proposed. The presented method is evaluated for the task of radiograph recognition where it achieves the best result published so far. Furthermore it yields very competitive results for the commonly used Caltech object detection tasks.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Opelt, A., Pinz, A., Fussenegger, M., Auer, P.: Generic object recognition with boosting 28(3), 416–431 (2006)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: A sparse object category model for efficient learning and exhaustive recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2005), pp. 380–389 (2005)
Google Scholar
Ulusoy, I., Bishop, C.M.: Generative versus discriminative methods for object recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2005), pp. 258–265 (2005)
Google Scholar
Marée, R., Geurts, P., Piater, J., Wehenkel, L.: Random subwindows for robust image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 34–40 (2005)
Google Scholar
Deselaers, T., Keysers, D., Ney, H.: Discriminative training for object recognition using image patches. In: IEEE Conference on Computer Vision and Pattern Recognition, San Diego, CA, vol. 2, pp. 157–162 (2005)
Google Scholar
Bosch, A., Zissermann, A., Muñoz, X.: Scene Classification Via pLSA. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 517–530. Springer, Heidelberg (2006)
Chapter Google Scholar
Shotton, J., Winn, J., Rother, C., Criminisi, A.: TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-class Object Recognition and Segmentation. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 1–15. Springer, Heidelberg (2006)
Chapter Google Scholar
Fergus, R., Perona, P., Zissermann, A.: Object class recognition by unsupervised scale-invariant learning. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2003), Blacksburg, VG, pp. 264–271 (2003)
Google Scholar
Leibe, B., Schiele, B.: Scale-Invariant Object Categorization Using a Scale-Adaptive Mean-Shift Search. In: Rasmussen, C.E., Bülthoff, H.H., Schölkopf, B., Giese, M.A. (eds.) DAGM 2004. LNCS, vol. 3175, pp. 145–153. Springer, Heidelberg (2004)
Chapter Google Scholar
Linde, O., Lindberg, T.: Object recognition using composed recpetive field histograms of higher dimensionality. In: International Conference on Pattern Recognition, Cambridge, UK (2004)
Google Scholar
Deselaers, T., Keysers, D., Ney, H.: Improving a Discriminative Approach to Object Recognition Using Image Patches. In: Kropatsch, W.G., Sablatnig, R., Hanbury, A. (eds.) DAGM 2005. LNCS, vol. 3663, pp. 326–333. Springer, Heidelberg (2005)
Chapter Google Scholar
Puzicha, J., Rubner, Y., Tomasi, C., Buhmann, J.: Empirical evaluation of dissimilarity measures for color and texture. In: International Conference on Computer Vision, Corfu, Greece, vol. 2, pp. 1165–1173 (1999)
Google Scholar
Keysers, D., Gollan, C., Ney, H.: Local context in non-linear deformation models for handwritten character recognition. In: International Conference on Pattern Recognition, Cambridge, UK, vol. 4, pp. 511–514 (2004)
Google Scholar
Jeon, J., Manmatha, R.: Using maximum entropy for automatic image annotation. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 24–32. Springer, Heidelberg (2004)
Chapter Google Scholar
Darroch, J.N., Ratcliff, D.: Generalized iterative scaling for log-linear models. Annals of Mathematical Statistics 43(5), 1470–1480 (1972)
Article MATH MathSciNet Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
MATH Google Scholar
Clough, P., Müller, H., Deselaers, T., Grubinger, M., Lehmann, T.M., Jensen, J., Hersh, W.R.: The CLEF 2005 cross–language image retrieval track. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, pp. 535–557. Springer, Heidelberg (2006)
Chapter Google Scholar
Zhang, W., Yu, B., Zelinsky, G.J., Samaras, D.: Object class recognition using multiple layer boosting with heterogeneous features. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2005), pp. 323–330 (2005)
Google Scholar
Deselaers, T., Keysers, D., Ney, H.: Classification error rate for quantitative evaluation of content-based image retrieval systems. In: International Conference on Pattern Recognition 2004, Cambridge, UK, vol. 2, pp. 505–508 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Human Language Technology and Pattern Recognition Group, RWTH Aachen University, Aachen, Germany
Thomas Deselaers, Andre Hegerath, Daniel Keysers & Hermann Ney

Authors

Thomas Deselaers
View author publications
You can also search for this author in PubMed Google Scholar
Andre Hegerath
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Keysers
View author publications
You can also search for this author in PubMed Google Scholar
Hermann Ney
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Norwegian Information Security Laboratory, Gjøvik University College, Norway
Katrin Franke
Fraunhofer FIRST (IDA), Berlin, Germany
Klaus-Robert Müller
Department of Security Technology, Fraunhofer Institute for Production Systems and Design Technology (IPK), Pascalstr. 8-9, 10587, Berlin, Germany
Bertram Nickolay
Department of Electronic Imaging Technology, Fraunhofer Institute for Information and Communication Technology, Heinrich Hertz Institute (HHI), Einsteinufer 37, 10587, Berlin, Germany
Ralf Schäfer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Deselaers, T., Hegerath, A., Keysers, D., Ney, H. (2006). Sparse Patch-Histograms for Object Classification in Cluttered Images. In: Franke, K., Müller, KR., Nickolay, B., Schäfer, R. (eds) Pattern Recognition. DAGM 2006. Lecture Notes in Computer Science, vol 4174. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11861898_21

Download citation

DOI: https://doi.org/10.1007/11861898_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44412-1
Online ISBN: 978-3-540-44414-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics