Abstract
Assume that some objects are present in an image but can be seen only partially and are overlapping each other. To recognize the objects, we have to firstly separate the objects from one another, and then match them against the modeled objects using partial observation. This paper presents a probabilistic approach for solving this problem. Firstly, the task is formulated as a two-stage optimal estimation process. The first stage, matching, separates different objects and finds feature correspondences between the scene and each potential model object. The second stage, recognition, resolves inconsistencies among the results of matching to different objects and identifies object categories. Both the matching and recognition are formulated in terms of the maximum a posteriori (MAP) principle. Secondly, contextual constraints, which play an important role in solving the problem, are incorporated in the probabilistic formulation. Specifically, between-object constraints are encoded in the prior distribution modeled as a Markov random field, and within-object constraints are encoded in the likelihood distribution modeled as a Gaussian. They are combined into the posterior distribution which defines the MAP solution. Experimental results are presented for matching and recognizing jigsaw objects under partial occlusion, rotation, translation and scaling.
Chapter PDF
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
N. Ayache and O. D. Faugeras. “HYPER: A new approach for the representation and positioning of two-dimensional objects”. IEEE Transactions on Pattern Analysis and Machine Intelligence, 8(1):44–54, January 1986.
J. Besag. “On the statistical analysis of dirty pictures” (with discussions). Journal of the Royal Statistical Society, Series B, 48:259–302, 1986.
P. J. Besl and R. C. Jain. “Three-Dimensional object recognition”. Computing Surveys, 17(1):75–145, March 1985.
R. Chellappa and A. Jain, editors. Markov Random Fields: Theory and Applications. Academic Press, 1993.
P. R. Cooper. “Parallel structure recognition with uncertainty: Coupled segmentation and matching”. In Proceedings of IEEE International Conference on Computer Vision, pages 287–290, 1990.
O. Faugeras. Three-Dimensional Computer Vision — A Geometric Viewpoint. MIT Press, Cambridge, MA, 1993.
S. Geman and D. Geman. “Stochastic relaxation, Gibbs distribution and the Bayesian restoration of images”. IEEE Transactions on Pattern Analysis and Machine Intelligence, 6(6):721–741, November 1984.
W. E. L. Grimson. Object Recognition by Computer — The Role of Geometric Constraints. MIT Press, Cambridge, MA, 1990.
J. Hornegger and H. Niemann. “Statistical learning, localization and identification of objects”. In Proceedings of IEEE International Conference on Computer Vision, pages 914–919, MIT, MA, 1995.
R. A. Hummel and S. W. Zucker. “On the foundations of relaxation labeling process”. IEEE Transactions on Pattern Analysis and Machine Intelligence, 5(3):267–286, May 1983.
S. Kirkpatrick, C. D. Gellatt, and M. P. Vecchi. “Optimization by simulated annealing”. Science, 220:671–680, 1983.
S. Z. Li. Markov Random Field Modeling in Computer Vision. Springer-Verlag, New York, 1995.
S. Z. Li, H. Wang, K. L. Chan, and M. Petrou. “Minimization of MRP energy with relaxation labeling”. Journal of Mathematical Imaging and Vision, 7:149–161, 1997.
K. V. Mardia and G. K. Kanji, editors. Statistics and Images: 1. Advances in Applied Statistics. Carfax, 1993.
J. W. Modestino and J. Zhang. “A Markov random field model-based approach to image interpretation”. IEEE Transactions on Pattern Analysis and Machine Intelligence, 14(6):606–615, 1992.
C. Peterson and B. Soderberg. “A new method for mapping optimization problems onto neural networks”. International Journal of Neural Systems, 1(1):3–22, 1989.
Ullman. High-Level Vision: Object Recognition and Visual Cognition. MIT Press, 1996.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, S.Z., Hornegger, J. (1998). A two-stage probabilistic approach for object recognition. In: Burkhardt, H., Neumann, B. (eds) Computer Vision — ECCV’98. ECCV 1998. Lecture Notes in Computer Science, vol 1407. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0054776
Download citation
DOI: https://doi.org/10.1007/BFb0054776
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64613-6
Online ISBN: 978-3-540-69235-5
eBook Packages: Springer Book Archive