Skip to main content
Log in

Vision-Based System for Human Detection and Tracking in Indoor Environment

  • Published:
International Journal of Social Robotics Aims and scope Submit manuscript

Abstract

In this paper, we propose a vision-based system for human detection and tracking in indoor environment using a static camera. The proposed method is based on object recognition in still images combined with methods using temporal information from the video. Doing that, we improve the performance of the overall system and reduce the task complexity. We first use background subtraction to limit the search space of the classifier. The segmentation is realized by modeling each background pixel by a single Gaussian model. As each connected component detected by the background subtraction potentially corresponds to one person, each blob is independently tracked. The tracking process is based on the analysis of connected components position and interest points tracking. In order to know the nature of various objects that could be present in the scene, we use multiple cascades of boosted classifiers based on Haar-like filters. We also present in this article a wide evaluation of this system based on a large set of videos.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Ballard DH, Brown CM (1982) Computer vision. Prentice Hall Professional Technical Reference

  2. Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Surf: Speeded up robust features. Comput Vis Image Underst 110(3):346–359

    Article  Google Scholar 

  3. Belongie S, Malik J, Puzicha J (2001) Matching shapes. In: International conference on computer vision, pp 454–461

  4. Benezeth Y, Jodoin PM, Emile B, Laurent H, Rosenberger C (2008) Review and evaluation of commonly implemented background subtraction algorithms. In: International conference on pattern recognition, pp 1–4

  5. Bovik AC (2005) Handbook of image and video processing. Academic Press, Orlando

    Google Scholar 

  6. Collins R, Lipton A, Kanade T, Fujiyoshi H, Duggins D, Tsin Y, Tolliver D, Enomoto N, Hasegawa O (2000) A system for video surveillance and monitoring. Technical report, Robotics Institute, Carnegie Mellon University

  7. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Conference on computer vision and pattern recognition, vol 1, pp 886–893

  8. Dalal N, Triggs B, Schmid C (2006) Human detection using oriented histograms of flow and appearance. In: European conference on computer vision, vol 2, pp 428–441

  9. Dollár P, Wojek C, Schiele B, Perona P (2009) Pedestrian detection: A benchmark. In: Computer vision and pattern recognition, pp 304–311

  10. Elgammal A, Harwood D, Davis L (2000) Non-parametric model for background subtraction. In: European conference on computer vision, pp 751–767

  11. Everingham M, Zisserman A, Williams C, Van Gool L, Allan M, Bishop C, Chapelle O, Dalal N, Deselaers T, Dorko G (2005) The 2005 pascal visual object classes challenge. In: First PASCAL challenge workshop

  12. Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In: Computer vision and pattern recognition

  13. Filliat D (2007) A visual bag of words method for interactive qualitative localization and mapping. In: International conference on robotics and automation, pp 3921–3926

  14. Gao W, Ai H, Lao S (2009) Adaptive contour features in oriented granular space for human detection and segmentation. In: Computer vision and pattern recognition

  15. Gavrila DM (1999) The visual analysis of human movement: a survey. Comput Vis Image Underst 73:82–98

    Article  MATH  Google Scholar 

  16. Gavrila DM (2007) A Bayesian, exemplar-based approach to hierarchical shape matching. Pattern Anal Mach Intell 29(8):1408–1421

    Article  Google Scholar 

  17. Gavrila DM, Giebel J, Munder S (2004) Vison-based pedestrian detection: The protector system. In: Intelligent vehicles symposium, pp 13–18

  18. Gu C, Lim JJ, Arbel”er P, Malik, J (2009) Recognition using regions. In: Computer vision and pattern recognition, pp 1030–1037

  19. Haritaoglu I, Harwood D, Davis LS (2000) W 4: real-time surveillance of people and their activities. Pattern Anal Mach Intell 22:809–830

    Article  Google Scholar 

  20. Hemery B, Laurent H, Rosenberger C, Emile B (2008) Evaluation protocol for localization metrics application to a comparative study. In: International conference on image and signal processing, pp 273–280

  21. Hemery B, Laurent H, Rosenberger C (2009) Evaluation metric for image understanding. In: International conference on image processing

  22. Johnsen S, Tews A (2009) Real-time object tracking and classification using a static camera. In: People and detection workshop of the international conference of robotics and automation

  23. Jones M, Snow D (2008) Pedestrian detection using boosted features over many frames. In: International conference on pattern recognition, pp 1–4

  24. Kahl W, Settanni R (1987) US patent 4703171: Lighting control system with infrared occupancy detector

  25. Lowe DG (1999) Object recognition from local scale-invariant features. In: International conference on computer vision, vol 2, pp 1150–1157

  26. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60:91–110

    Article  Google Scholar 

  27. Lucas BD, Kanade T (1981) An iterative image registration technique with an application to stereo vision. In: International joint conference on artificial intelligence, pp 674–679

  28. Martin D, Fowlkes C, Tal D, Malik J (2001) A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In: International conference on computer vision, vol 2, pp 416–423

  29. Oliver NM, Rosario B, Pentland AP (2000) A Bayesian computer vision system for modeling human interactions. Pattern Anal Mach Intell 22:831–843

    Article  Google Scholar 

  30. Oren M, Papageorgiou C, Sinha P, Osuna E, Poggio T (1997) Pedestrian detection using wavelet templates. In: Computer vision and pattern recognition, pp 193–199

  31. Papageorgiou C, Poggio T (2000) A trainable system for object detection. Int J Comput Vis 38:15–33

    Article  MATH  Google Scholar 

  32. Schapire RE (2002) The boosting approach to machine learning: An overview. In: Workshop on NEC

  33. Schiele B, Andriluka M, Majer N, Roth S, Wojek C (2009) Visual people detection: Different models, comparison and discussion. In: People detection and tracking workshop of the international conference on robotics and automation

  34. Stauffer C, Grimson WEL (1999) Adaptive background mixture models for real-time tracking. In: Computer vision and pattern recognition, vol 2

  35. Stauffer C, Eric W, Grimson L (2000) Learning patterns of activity using real-time tracking. In: Pattern analysis and machine intelligence, pp 747–757

  36. Tuzel O, Porikli FM, Meer P (2008) Pedestrian detection via classification on Riemannian manifolds. Pattern Anal Mach Intell 30(10):1713–1727

    Article  Google Scholar 

  37. Utsumi A, Tetsutani N (2002) Human detection using geometrical pixel value structures. In: International conference on automatic face and gesture recognition, pp 34–39

  38. Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features. In: Computer vision and pattern recognition, vol 1, pp 511–518

  39. Viola P, Jones M, Snow D (2005) Detecting pedestrians using patterns of motion and appearance. Int J Comput Vis 63:153–161

    Article  Google Scholar 

  40. Wojek C, Walk S, Schiele B (2009) Multi-cue onboard pedestrian detection. In: Computer vision and pattern recognition, pp 794–801

  41. Wren C, Azarbayejani A, Darrell T, Pentland A (1997) Pfinder: Real-time tracking of the human body. In: Pattern analysis and machine intelligence

  42. Wu B, Nevatia R (2006) Tracking of multiple, partially occluded humans based on static body part detection. In: Computer vision and pattern recognition, vol 1, pp 951–958

  43. Wu B, Nevatia R (2007) Cluster boosted tree classifier for multi-view, multi-pose object detection. In: International conference on computer vision

  44. Zhao Q, Kang J, Tao H, Hua W (2006) Part-based human tracking in a multiple cues fusion framework. In: International conference on pattern recognition, vol 1, pp 450–455

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Y. Benezeth.

Additional information

This work was made possible with the financial support of the Regional Council of Le Centre, the French Industry Ministry within the CAPTHOM project of the Competitiveness Pole S2E2.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Benezeth, Y., Emile, B., Laurent, H. et al. Vision-Based System for Human Detection and Tracking in Indoor Environment. Int J of Soc Robotics 2, 41–52 (2010). https://doi.org/10.1007/s12369-009-0040-4

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12369-009-0040-4

Navigation