A two-stage probabilistic approach for object recognition

Li, Stan Z.; Hornegger, Joachim

doi:10.1007/BFb0054776

Stan Z. Li¹ &
Joachim Hornegger²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1407))

Included in the following conference series:

European Conference on Computer Vision

176 Accesses
4 Citations

Abstract

Assume that some objects are present in an image but can be seen only partially and are overlapping each other. To recognize the objects, we have to firstly separate the objects from one another, and then match them against the modeled objects using partial observation. This paper presents a probabilistic approach for solving this problem. Firstly, the task is formulated as a two-stage optimal estimation process. The first stage, matching, separates different objects and finds feature correspondences between the scene and each potential model object. The second stage, recognition, resolves inconsistencies among the results of matching to different objects and identifies object categories. Both the matching and recognition are formulated in terms of the maximum a posteriori (MAP) principle. Secondly, contextual constraints, which play an important role in solving the problem, are incorporated in the probabilistic formulation. Specifically, between-object constraints are encoded in the prior distribution modeled as a Markov random field, and within-object constraints are encoded in the likelihood distribution modeled as a Gaussian. They are combined into the posterior distribution which defines the MAP solution. Experimental results are presented for matching and recognizing jigsaw objects under partial occlusion, rotation, translation and scaling.

Download to read the full chapter text

Chapter PDF

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

N. Ayache and O. D. Faugeras. “HYPER: A new approach for the representation and positioning of two-dimensional objects”. IEEE Transactions on Pattern Analysis and Machine Intelligence, 8(1):44–54, January 1986.
Google Scholar
J. Besag. “On the statistical analysis of dirty pictures” (with discussions). Journal of the Royal Statistical Society, Series B, 48:259–302, 1986.
MATH MathSciNet Google Scholar
P. J. Besl and R. C. Jain. “Three-Dimensional object recognition”. Computing Surveys, 17(1):75–145, March 1985.
Article Google Scholar
R. Chellappa and A. Jain, editors. Markov Random Fields: Theory and Applications. Academic Press, 1993.
Google Scholar
P. R. Cooper. “Parallel structure recognition with uncertainty: Coupled segmentation and matching”. In Proceedings of IEEE International Conference on Computer Vision, pages 287–290, 1990.
Google Scholar
O. Faugeras. Three-Dimensional Computer Vision — A Geometric Viewpoint. MIT Press, Cambridge, MA, 1993.
Google Scholar
S. Geman and D. Geman. “Stochastic relaxation, Gibbs distribution and the Bayesian restoration of images”. IEEE Transactions on Pattern Analysis and Machine Intelligence, 6(6):721–741, November 1984.
Article MATH Google Scholar
W. E. L. Grimson. Object Recognition by Computer — The Role of Geometric Constraints. MIT Press, Cambridge, MA, 1990.
Google Scholar
J. Hornegger and H. Niemann. “Statistical learning, localization and identification of objects”. In Proceedings of IEEE International Conference on Computer Vision, pages 914–919, MIT, MA, 1995.
Google Scholar
R. A. Hummel and S. W. Zucker. “On the foundations of relaxation labeling process”. IEEE Transactions on Pattern Analysis and Machine Intelligence, 5(3):267–286, May 1983.
MATH Google Scholar
S. Kirkpatrick, C. D. Gellatt, and M. P. Vecchi. “Optimization by simulated annealing”. Science, 220:671–680, 1983.
MathSciNet Google Scholar
S. Z. Li. Markov Random Field Modeling in Computer Vision. Springer-Verlag, New York, 1995.
Google Scholar
S. Z. Li, H. Wang, K. L. Chan, and M. Petrou. “Minimization of MRP energy with relaxation labeling”. Journal of Mathematical Imaging and Vision, 7:149–161, 1997.
Article MathSciNet Google Scholar
K. V. Mardia and G. K. Kanji, editors. Statistics and Images: 1. Advances in Applied Statistics. Carfax, 1993.
Google Scholar
J. W. Modestino and J. Zhang. “A Markov random field model-based approach to image interpretation”. IEEE Transactions on Pattern Analysis and Machine Intelligence, 14(6):606–615, 1992.
Article Google Scholar
C. Peterson and B. Soderberg. “A new method for mapping optimization problems onto neural networks”. International Journal of Neural Systems, 1(1):3–22, 1989.
Article MATH Google Scholar
Ullman. High-Level Vision: Object Recognition and Visual Cognition. MIT Press, 1996.
Google Scholar

Download references

Author information

Authors and Affiliations

School of EEE, Nanyang Technological University, Nanyang Avenue, 639798, Singapore
Stan Z. Li
Robotics Laboratory, Stanford University, Gates Building 134, 94305-9010, Stanford, CA, USA
Joachim Hornegger

Authors

Stan Z. Li
View author publications
You can also search for this author in PubMed Google Scholar
Joachim Hornegger
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Hans Burkhardt Bernd Neumann

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, S.Z., Hornegger, J. (1998). A two-stage probabilistic approach for object recognition. In: Burkhardt, H., Neumann, B. (eds) Computer Vision — ECCV’98. ECCV 1998. Lecture Notes in Computer Science, vol 1407. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0054776

Download citation

DOI: https://doi.org/10.1007/BFb0054776
Published: 26 May 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64613-6
Online ISBN: 978-3-540-69235-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics