Abstract
In this paper, we propose a vision-based system for human detection and tracking in indoor environment using a static camera. The proposed method is based on object recognition in still images combined with methods using temporal information from the video. Doing that, we improve the performance of the overall system and reduce the task complexity. We first use background subtraction to limit the search space of the classifier. The segmentation is realized by modeling each background pixel by a single Gaussian model. As each connected component detected by the background subtraction potentially corresponds to one person, each blob is independently tracked. The tracking process is based on the analysis of connected components position and interest points tracking. In order to know the nature of various objects that could be present in the scene, we use multiple cascades of boosted classifiers based on Haar-like filters. We also present in this article a wide evaluation of this system based on a large set of videos.
Similar content being viewed by others
References
Ballard DH, Brown CM (1982) Computer vision. Prentice Hall Professional Technical Reference
Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Surf: Speeded up robust features. Comput Vis Image Underst 110(3):346–359
Belongie S, Malik J, Puzicha J (2001) Matching shapes. In: International conference on computer vision, pp 454–461
Benezeth Y, Jodoin PM, Emile B, Laurent H, Rosenberger C (2008) Review and evaluation of commonly implemented background subtraction algorithms. In: International conference on pattern recognition, pp 1–4
Bovik AC (2005) Handbook of image and video processing. Academic Press, Orlando
Collins R, Lipton A, Kanade T, Fujiyoshi H, Duggins D, Tsin Y, Tolliver D, Enomoto N, Hasegawa O (2000) A system for video surveillance and monitoring. Technical report, Robotics Institute, Carnegie Mellon University
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Conference on computer vision and pattern recognition, vol 1, pp 886–893
Dalal N, Triggs B, Schmid C (2006) Human detection using oriented histograms of flow and appearance. In: European conference on computer vision, vol 2, pp 428–441
Dollár P, Wojek C, Schiele B, Perona P (2009) Pedestrian detection: A benchmark. In: Computer vision and pattern recognition, pp 304–311
Elgammal A, Harwood D, Davis L (2000) Non-parametric model for background subtraction. In: European conference on computer vision, pp 751–767
Everingham M, Zisserman A, Williams C, Van Gool L, Allan M, Bishop C, Chapelle O, Dalal N, Deselaers T, Dorko G (2005) The 2005 pascal visual object classes challenge. In: First PASCAL challenge workshop
Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In: Computer vision and pattern recognition
Filliat D (2007) A visual bag of words method for interactive qualitative localization and mapping. In: International conference on robotics and automation, pp 3921–3926
Gao W, Ai H, Lao S (2009) Adaptive contour features in oriented granular space for human detection and segmentation. In: Computer vision and pattern recognition
Gavrila DM (1999) The visual analysis of human movement: a survey. Comput Vis Image Underst 73:82–98
Gavrila DM (2007) A Bayesian, exemplar-based approach to hierarchical shape matching. Pattern Anal Mach Intell 29(8):1408–1421
Gavrila DM, Giebel J, Munder S (2004) Vison-based pedestrian detection: The protector system. In: Intelligent vehicles symposium, pp 13–18
Gu C, Lim JJ, Arbel”er P, Malik, J (2009) Recognition using regions. In: Computer vision and pattern recognition, pp 1030–1037
Haritaoglu I, Harwood D, Davis LS (2000) W 4: real-time surveillance of people and their activities. Pattern Anal Mach Intell 22:809–830
Hemery B, Laurent H, Rosenberger C, Emile B (2008) Evaluation protocol for localization metrics application to a comparative study. In: International conference on image and signal processing, pp 273–280
Hemery B, Laurent H, Rosenberger C (2009) Evaluation metric for image understanding. In: International conference on image processing
Johnsen S, Tews A (2009) Real-time object tracking and classification using a static camera. In: People and detection workshop of the international conference of robotics and automation
Jones M, Snow D (2008) Pedestrian detection using boosted features over many frames. In: International conference on pattern recognition, pp 1–4
Kahl W, Settanni R (1987) US patent 4703171: Lighting control system with infrared occupancy detector
Lowe DG (1999) Object recognition from local scale-invariant features. In: International conference on computer vision, vol 2, pp 1150–1157
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60:91–110
Lucas BD, Kanade T (1981) An iterative image registration technique with an application to stereo vision. In: International joint conference on artificial intelligence, pp 674–679
Martin D, Fowlkes C, Tal D, Malik J (2001) A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In: International conference on computer vision, vol 2, pp 416–423
Oliver NM, Rosario B, Pentland AP (2000) A Bayesian computer vision system for modeling human interactions. Pattern Anal Mach Intell 22:831–843
Oren M, Papageorgiou C, Sinha P, Osuna E, Poggio T (1997) Pedestrian detection using wavelet templates. In: Computer vision and pattern recognition, pp 193–199
Papageorgiou C, Poggio T (2000) A trainable system for object detection. Int J Comput Vis 38:15–33
Schapire RE (2002) The boosting approach to machine learning: An overview. In: Workshop on NEC
Schiele B, Andriluka M, Majer N, Roth S, Wojek C (2009) Visual people detection: Different models, comparison and discussion. In: People detection and tracking workshop of the international conference on robotics and automation
Stauffer C, Grimson WEL (1999) Adaptive background mixture models for real-time tracking. In: Computer vision and pattern recognition, vol 2
Stauffer C, Eric W, Grimson L (2000) Learning patterns of activity using real-time tracking. In: Pattern analysis and machine intelligence, pp 747–757
Tuzel O, Porikli FM, Meer P (2008) Pedestrian detection via classification on Riemannian manifolds. Pattern Anal Mach Intell 30(10):1713–1727
Utsumi A, Tetsutani N (2002) Human detection using geometrical pixel value structures. In: International conference on automatic face and gesture recognition, pp 34–39
Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features. In: Computer vision and pattern recognition, vol 1, pp 511–518
Viola P, Jones M, Snow D (2005) Detecting pedestrians using patterns of motion and appearance. Int J Comput Vis 63:153–161
Wojek C, Walk S, Schiele B (2009) Multi-cue onboard pedestrian detection. In: Computer vision and pattern recognition, pp 794–801
Wren C, Azarbayejani A, Darrell T, Pentland A (1997) Pfinder: Real-time tracking of the human body. In: Pattern analysis and machine intelligence
Wu B, Nevatia R (2006) Tracking of multiple, partially occluded humans based on static body part detection. In: Computer vision and pattern recognition, vol 1, pp 951–958
Wu B, Nevatia R (2007) Cluster boosted tree classifier for multi-view, multi-pose object detection. In: International conference on computer vision
Zhao Q, Kang J, Tao H, Hua W (2006) Part-based human tracking in a multiple cues fusion framework. In: International conference on pattern recognition, vol 1, pp 450–455
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was made possible with the financial support of the Regional Council of Le Centre, the French Industry Ministry within the CAPTHOM project of the Competitiveness Pole S2E2.
Rights and permissions
About this article
Cite this article
Benezeth, Y., Emile, B., Laurent, H. et al. Vision-Based System for Human Detection and Tracking in Indoor Environment. Int J of Soc Robotics 2, 41–52 (2010). https://doi.org/10.1007/s12369-009-0040-4
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12369-009-0040-4