Abstract
This chapter presents an approach for texture and object recognition that uses scale- or affine-invariant local image features in combination with a discriminative classifier. Textures are represented using a visual dictionary found by quantizing appearance-based descriptors of local features. Object classes are represented using a dictionary of composite semi-local parts, or groups of nearby features with stable and distinctive appearance and geometric layout. A discriminative maximum entropy framework is used to learn the posterior distribution of the class label given the occurrences of parts from the dictionary in the training set. Experiments on two texture and two object databases demonstrate the effectiveness of this framework for visual classification.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Agarwal, S., Roth, D.: Learning a Sparse Representation for Object Detection. In: Heyden, A., et al. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 113–127. Springer, Heidelberg (2002)
Berg, A., Berg, T., Malik, J.: Shape Matching and Object Recognition Using Low-Distortion Correspondence. In: Proc. CVPR (2005)
Berger, A., Della Pietra, S., Della Pietra, V.: A Maximum Entropy Approach to Natural Language Processing. Computational Linguistics 22(1), 39–71 (1996)
Brodatz, P.: Textures: A Photographic Album for Artists and Designers. Dover, New York (1966)
Chen, S., Goodman, J.: An Empirical Study of Smoothing Techniques for Language Modeling. In: Proc. Conf. of the Association for Computational Linguistics, pp. 310–318 (1996)
Csurka, G., Bray, C., Dance, C., Fan, L.: Visual Categorization with Bags of Keypoints. In: ECCV Workshop on Statistical Learning in Computer Vision (2004)
Dorko, G., Schmid, C.: Selection of Scale-Invariant Parts for Object Class Recognition. In: Proc. ICCV, vol.I, pp. 634–640 (2003)
Fergus, R., Perona, P., Zisserman, A.: Object Class Recognition by Unsupervised Scale-Invariant Learning. In: Proc. CVPR 2003, vol. II, pp. 264–271 (2003)
Ferrari, V., Tuytelaars, T., Van Gool, L.: Simultaneous object recognition and segmentation by image exploration. In: Proc. ECCV (2004)
Jeon, J., Manmatha, R.: Using maximum entropy for automatic image annotation. In: Enser, P.G.B., et al. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 24–32. Springer, Heidelberg (2004)
Jurie, F., Schmid, C.: Scale-invariant Shape Features for Recognition of Object Categories. In: Proc. CVPR (2004)
Keysers, D., Och, F., Ney, H.: Maximum Entropy and Gaussian Models for Image Object Recognition. In: Van Gool, L. (ed.) DAGM 2002. LNCS, vol. 2449, p. 498. Springer, Heidelberg (2002)
Lazebnik, S., Schmid, C., Ponce, J.: A Sparse Texture Representation Using Local Affine Regions. IEEE Trans. PAMI 27(8), 1265–1278 (2005)
Lazebnik, S., Schmid, C., Ponce, J.: A Maximum Entropy Framework for Part-Based Texture and Object Recognition. In: Proc. ICCV 2005 (to appear, 2005)
Lazebnik, S., Schmid, C., Ponce, J.: Semi-local Affine Parts for Object Recognition. In: Proc. BMVC 2004 (2004)
Lindeberg, T.: Feature Detection with Automatic Scale Selection. IJCV 30(2), 77–116 (1998)
Lowe, D.: Distinctive image features from scale-invariant keypoints. IJCV 60(2), 91–110 (2004)
Mahamud, S., Hebert, M., Lafferty, J.: Combining Simple Discriminators for Object Discrimination. In: Heyden, A., et al. (eds.) ECCV 2002. LNCS, vol. 2352, pp. 776–790. Springer, Heidelberg (2002)
McCallum, A., Nigam, K.: A Comparison of Event Models for Naive Bayes Text Classification. In: AAAI 1998 Workshop on Learning for Text Categorization, pp. 41–48 (1998)
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. In: Proc. CVPR 2003, vol. 2, pp. 257–263 (2003)
Nigam, K., Lafferty, J., McCallum, A.: Using Maximum Entropy for Text Classification. In: IJCAI Workshop on Machine Learning for Information Filtering, pp. 61–67 (1999)
Rothganger, F., Lazebnik, S., Schmid, C., Ponce, J.: 3D object modeling and recognition using local affine-invariant image descriptors and multi-view spatial constraints. In: IJCV (to appear, 2005)
Sivic, J., Zisserman, A.: Video Google: A Text Retrieval Approach to Object Matching in Videos. In: Proc. ICCV 2003, pp. 1470–1477 (2003)
Sivic, J., Russell, B., Efros, A., Zisserman, A., Freeman, W.: Discovering objects and their location in images. In: Proc. ICCV 2005 (to appear, 2005)
Varma, M., Zisserman, A.: Texture Classification: Are Filter Banks Necessary? In: Proc. CVPR 2003, vol. 2, pp. 691–698 (2003)
Weber, M., Welling, M., Perona, P.: Unsupervised learning of models for recognition. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1842, pp. 18–32. Springer, Heidelberg (2000)
Willamowski, J., Arregui, D., Csurka, G., Dance, C.R., Fan, L.: Categorizing nine visual classes using local appearance descriptors. In: International Workshop on Learning for Adaptable Visual Systems (2004)
Zhu, S.C., Wu, Y.N., Mumford, D.: Filters, Random Fields, and Maximum Entropy (FRAME): Towards a Unified Theory for Texture Modeling. IJCV 27(2), 1–20 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Lazebnik, S., Schmid, C., Ponce, J. (2006). A Discriminative Framework for Texture and Object Recognition Using Local Image Features. In: Ponce, J., Hebert, M., Schmid, C., Zisserman, A. (eds) Toward Category-Level Object Recognition. Lecture Notes in Computer Science, vol 4170. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11957959_22
Download citation
DOI: https://doi.org/10.1007/11957959_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68794-8
Online ISBN: 978-3-540-68795-5
eBook Packages: Computer ScienceComputer Science (R0)