Joint Bayesian Tracking of Head Location and Pose from Low-Resolution Video

Lanz, Oswald; Brunelli, Roberto

doi:10.1007/978-3-540-68585-2_27

Oswald Lanz¹ &
Roberto Brunelli¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4625))

Included in the following conference series:

1228 Accesses
7 Citations

Abstract

This paper presents a visual particle filter for jointly tracking the position of a person and her head pose. The resulting information may be used to support automatic analysis of interactive people behavior, by supporting proxemics analysis and providing dynamic information on focus of attention. A pose-sensitive visual likelihood is proposed which models the appearance of the target on a key-view basis, and uses body part color histograms as descriptors. Quantitative evaluations of the method on the ‘CLEAR’07 CHIL head pose’ corpus are reported and discusssed. The integration of multi-view sensing, the joint estimation of location and orientation, the use of generative imaging models, and of simple visual matching measures, make the system robust to low image resolution and significant color distortion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ba, S.O., Odobez, J.M.: A probabilistic framework for joint head tracking and pose estimation. In: Proc. of ICPR 2004, vol. 4, pp. 264–267 (2004)
Google Scholar
Ba, S.O., Odobez, J.M.: A Rao-Blackwellized mixed state particle filter for head pose tracking. In: ACM-ICMI Workshop on Multi-modal Multi-party Meeting Processing (MMMP), Trento, Italy, pp. 9–16 (2005)
Google Scholar
Parker, K.: Speaking turns in small group interaction: A context-sensitive event sequence model. Journal of Personality and Social Psychology 54(6), 965–971 (1988)
Article Google Scholar
Stiefelhagen, R., Yang, J., Waibel, A.: Modeling Focus of Attention for Meeting Indexing based on Multiple Cues. IEEE Transactions on Neural Networks 13(4), 928–938 (2002)
Article Google Scholar
Gourier, N., Maisonnasse, J., Hall, D., Crowley, J.L.: Head Pose Estimation on Low Resolution Images. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122, Springer, Heidelberg (2007)
Chapter Google Scholar
Voit, M., Nickel, K., Stiefelhagen, R.: Multi-View Head Pose Estimation using Neural Networks. In: Proc. of the 2nd Canadian Conference on Computer and Robot Vision CRV 2005, pp. 347–352 (2005)
Google Scholar
Canton-Ferrer, C., Casas, J.R., Pardas, M.: Fusion of multiple viewpoint information towards 3d face robust orientation detection. In: IEEE International Conference on Image Processing, Genoa, Italy, vol. 2, pp. 366–369 (2005)
Google Scholar
Zhang, Z., Hu, Y., Liu, M., Huang, T.: Head Pose Estimation in Seminar Room using Multi View Face Detectors. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122. Springer, Heidelberg (2007)
Chapter Google Scholar
Hall, E.T.: The Hidden Dimension: Man’s Use of Space in Public and Private. Doubleday, Garden City
Google Scholar
Stiefelhagen, R., Garofolo, J.S. (eds.) Proceedings of CLEAR 2006 Workshop: Classification of Events, Activities and Relationships, Southampton, UK. CLEAR 2006. LNCS, vol. 4122. Springer, Heidelberg (2007), http://www.clearevaluation.org
Isard, M., MacCormick, J., BraMBLe,: A Bayesian Multiple-Blob Tracker. In: IEEE International Conference on Computer Vision and Pattern Recognition (2000)
Google Scholar
Comaniciu, D., Ramesh, V., Meer, P.: Real-Time Tracking of Non-Rigid Objects using Mean-Shift. In: IEEE International Conference on Computer Vision (2003)
Google Scholar
Birchfield, S.T., Rangarajan, S.: Spatiograms versus Histograms for Region-Based Tracking. In: IEEE International Conference on Computer Vision (2005)
Google Scholar
Doucet, A., de Freitas, N., Gordon, N.: Sequential Monte Carlo Methods in Practice. Springer, Heidelberg (2001)
MATH Google Scholar
OpenGL: The Industry’s Foundation for High Performance Graphics. [Online]: http://www.opengl.org/
Lanz, O., Chippendale, P., Brunelli, R.: An Appearance-based Particle Filter for Visual Tracking in Smart Rooms. In: CLEAR 2007 Workshop: Classification of Events, Activities and Relationships. LNCS, vol. 4625. Springer, Heidelberg (2008)
Google Scholar
Brunelli, R., Poggio, T.: Template Matching: Matched Spatial Filters and Beyond. Pattern Recognition 30(5), 751–768 (1997)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Bruno Kessler Foundation - irst, Via Sommarive 18, 38050, Povo di Trento, Italy
Oswald Lanz & Roberto Brunelli

Authors

Oswald Lanz
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Brunelli
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Rainer Stiefelhagen Rachel Bowers Jonathan Fiscus

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lanz, O., Brunelli, R. (2008). Joint Bayesian Tracking of Head Location and Pose from Low-Resolution Video. In: Stiefelhagen, R., Bowers, R., Fiscus, J. (eds) Multimodal Technologies for Perception of Humans. RT CLEAR 2007 2007. Lecture Notes in Computer Science, vol 4625. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68585-2_27

Download citation

DOI: https://doi.org/10.1007/978-3-540-68585-2_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68584-5
Online ISBN: 978-3-540-68585-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics