Skip to main content

Joint Bayesian Tracking of Head Location and Pose from Low-Resolution Video

  • Conference paper
Multimodal Technologies for Perception of Humans (RT 2007, CLEAR 2007)

Abstract

This paper presents a visual particle filter for jointly tracking the position of a person and her head pose. The resulting information may be used to support automatic analysis of interactive people behavior, by supporting proxemics analysis and providing dynamic information on focus of attention. A pose-sensitive visual likelihood is proposed which models the appearance of the target on a key-view basis, and uses body part color histograms as descriptors. Quantitative evaluations of the method on the ‘CLEAR’07 CHIL head pose’ corpus are reported and discusssed. The integration of multi-view sensing, the joint estimation of location and orientation, the use of generative imaging models, and of simple visual matching measures, make the system robust to low image resolution and significant color distortion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ba, S.O., Odobez, J.M.: A probabilistic framework for joint head tracking and pose estimation. In: Proc. of ICPR 2004, vol. 4, pp. 264–267 (2004)

    Google Scholar 

  2. Ba, S.O., Odobez, J.M.: A Rao-Blackwellized mixed state particle filter for head pose tracking. In: ACM-ICMI Workshop on Multi-modal Multi-party Meeting Processing (MMMP), Trento, Italy, pp. 9–16 (2005)

    Google Scholar 

  3. Parker, K.: Speaking turns in small group interaction: A context-sensitive event sequence model. Journal of Personality and Social Psychology 54(6), 965–971 (1988)

    Article  Google Scholar 

  4. Stiefelhagen, R., Yang, J., Waibel, A.: Modeling Focus of Attention for Meeting Indexing based on Multiple Cues. IEEE Transactions on Neural Networks 13(4), 928–938 (2002)

    Article  Google Scholar 

  5. Gourier, N., Maisonnasse, J., Hall, D., Crowley, J.L.: Head Pose Estimation on Low Resolution Images. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122, Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  6. Voit, M., Nickel, K., Stiefelhagen, R.: Multi-View Head Pose Estimation using Neural Networks. In: Proc. of the 2nd Canadian Conference on Computer and Robot Vision CRV 2005, pp. 347–352 (2005)

    Google Scholar 

  7. Canton-Ferrer, C., Casas, J.R., Pardas, M.: Fusion of multiple viewpoint information towards 3d face robust orientation detection. In: IEEE International Conference on Image Processing, Genoa, Italy, vol. 2, pp. 366–369 (2005)

    Google Scholar 

  8. Zhang, Z., Hu, Y., Liu, M., Huang, T.: Head Pose Estimation in Seminar Room using Multi View Face Detectors. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  9. Hall, E.T.: The Hidden Dimension: Man’s Use of Space in Public and Private. Doubleday, Garden City

    Google Scholar 

  10. Stiefelhagen, R., Garofolo, J.S. (eds.) Proceedings of CLEAR 2006 Workshop: Classification of Events, Activities and Relationships, Southampton, UK. CLEAR 2006. LNCS, vol. 4122. Springer, Heidelberg (2007), http://www.clearevaluation.org

  11. Isard, M., MacCormick, J., BraMBLe,: A Bayesian Multiple-Blob Tracker. In: IEEE International Conference on Computer Vision and Pattern Recognition (2000)

    Google Scholar 

  12. Comaniciu, D., Ramesh, V., Meer, P.: Real-Time Tracking of Non-Rigid Objects using Mean-Shift. In: IEEE International Conference on Computer Vision (2003)

    Google Scholar 

  13. Birchfield, S.T., Rangarajan, S.: Spatiograms versus Histograms for Region-Based Tracking. In: IEEE International Conference on Computer Vision (2005)

    Google Scholar 

  14. Doucet, A., de Freitas, N., Gordon, N.: Sequential Monte Carlo Methods in Practice. Springer, Heidelberg (2001)

    MATH  Google Scholar 

  15. OpenGL: The Industry’s Foundation for High Performance Graphics. [Online]: http://www.opengl.org/

  16. Lanz, O., Chippendale, P., Brunelli, R.: An Appearance-based Particle Filter for Visual Tracking in Smart Rooms. In: CLEAR 2007 Workshop: Classification of Events, Activities and Relationships. LNCS, vol. 4625. Springer, Heidelberg (2008)

    Google Scholar 

  17. Brunelli, R., Poggio, T.: Template Matching: Matched Spatial Filters and Beyond. Pattern Recognition 30(5), 751–768 (1997)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Rainer Stiefelhagen Rachel Bowers Jonathan Fiscus

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lanz, O., Brunelli, R. (2008). Joint Bayesian Tracking of Head Location and Pose from Low-Resolution Video. In: Stiefelhagen, R., Bowers, R., Fiscus, J. (eds) Multimodal Technologies for Perception of Humans. RT CLEAR 2007 2007. Lecture Notes in Computer Science, vol 4625. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68585-2_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-68585-2_27

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68584-5

  • Online ISBN: 978-3-540-68585-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics