Abstract
This paper presents a visual particle filter for jointly tracking the position of a person and her head pose. The resulting information may be used to support automatic analysis of interactive people behavior, by supporting proxemics analysis and providing dynamic information on focus of attention. A pose-sensitive visual likelihood is proposed which models the appearance of the target on a key-view basis, and uses body part color histograms as descriptors. Quantitative evaluations of the method on the ‘CLEAR’07 CHIL head pose’ corpus are reported and discusssed. The integration of multi-view sensing, the joint estimation of location and orientation, the use of generative imaging models, and of simple visual matching measures, make the system robust to low image resolution and significant color distortion.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ba, S.O., Odobez, J.M.: A probabilistic framework for joint head tracking and pose estimation. In: Proc. of ICPR 2004, vol. 4, pp. 264–267 (2004)
Ba, S.O., Odobez, J.M.: A Rao-Blackwellized mixed state particle filter for head pose tracking. In: ACM-ICMI Workshop on Multi-modal Multi-party Meeting Processing (MMMP), Trento, Italy, pp. 9–16 (2005)
Parker, K.: Speaking turns in small group interaction: A context-sensitive event sequence model. Journal of Personality and Social Psychology 54(6), 965–971 (1988)
Stiefelhagen, R., Yang, J., Waibel, A.: Modeling Focus of Attention for Meeting Indexing based on Multiple Cues. IEEE Transactions on Neural Networks 13(4), 928–938 (2002)
Gourier, N., Maisonnasse, J., Hall, D., Crowley, J.L.: Head Pose Estimation on Low Resolution Images. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122, Springer, Heidelberg (2007)
Voit, M., Nickel, K., Stiefelhagen, R.: Multi-View Head Pose Estimation using Neural Networks. In: Proc. of the 2nd Canadian Conference on Computer and Robot Vision CRV 2005, pp. 347–352 (2005)
Canton-Ferrer, C., Casas, J.R., Pardas, M.: Fusion of multiple viewpoint information towards 3d face robust orientation detection. In: IEEE International Conference on Image Processing, Genoa, Italy, vol. 2, pp. 366–369 (2005)
Zhang, Z., Hu, Y., Liu, M., Huang, T.: Head Pose Estimation in Seminar Room using Multi View Face Detectors. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122. Springer, Heidelberg (2007)
Hall, E.T.: The Hidden Dimension: Man’s Use of Space in Public and Private. Doubleday, Garden City
Stiefelhagen, R., Garofolo, J.S. (eds.) Proceedings of CLEAR 2006 Workshop: Classification of Events, Activities and Relationships, Southampton, UK. CLEAR 2006. LNCS, vol. 4122. Springer, Heidelberg (2007), http://www.clearevaluation.org
Isard, M., MacCormick, J., BraMBLe,: A Bayesian Multiple-Blob Tracker. In: IEEE International Conference on Computer Vision and Pattern Recognition (2000)
Comaniciu, D., Ramesh, V., Meer, P.: Real-Time Tracking of Non-Rigid Objects using Mean-Shift. In: IEEE International Conference on Computer Vision (2003)
Birchfield, S.T., Rangarajan, S.: Spatiograms versus Histograms for Region-Based Tracking. In: IEEE International Conference on Computer Vision (2005)
Doucet, A., de Freitas, N., Gordon, N.: Sequential Monte Carlo Methods in Practice. Springer, Heidelberg (2001)
OpenGL: The Industry’s Foundation for High Performance Graphics. [Online]: http://www.opengl.org/
Lanz, O., Chippendale, P., Brunelli, R.: An Appearance-based Particle Filter for Visual Tracking in Smart Rooms. In: CLEAR 2007 Workshop: Classification of Events, Activities and Relationships. LNCS, vol. 4625. Springer, Heidelberg (2008)
Brunelli, R., Poggio, T.: Template Matching: Matched Spatial Filters and Beyond. Pattern Recognition 30(5), 751–768 (1997)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lanz, O., Brunelli, R. (2008). Joint Bayesian Tracking of Head Location and Pose from Low-Resolution Video. In: Stiefelhagen, R., Bowers, R., Fiscus, J. (eds) Multimodal Technologies for Perception of Humans. RT CLEAR 2007 2007. Lecture Notes in Computer Science, vol 4625. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68585-2_27
Download citation
DOI: https://doi.org/10.1007/978-3-540-68585-2_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68584-5
Online ISBN: 978-3-540-68585-2
eBook Packages: Computer ScienceComputer Science (R0)