Combination of RGB-D Features for Head and Upper Body Orientation Classification

Fitte-Duval, Laurent; Mekonnen, Alhayat Ali; Lerasle, Frédéric

doi:10.1007/978-3-319-48680-2_52

Laurent Fitte-Duval¹⁸,
Alhayat Ali Mekonnen¹⁸ &
Frédéric Lerasle¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10016))

Included in the following conference series:

International Conference on Advanced Concepts for Intelligent Vision Systems

2293 Accesses
2 Citations
3 Altmetric

Abstract

In Human-Robot Interaction (HRI), the intention of a person to interact with another agent (robot or human) can be inferred from his/her head and upper body orientation. Furthermore, additional information on the person’s overall intention and motion direction can be determined with the knowledge of both orientations. This work presents an exhaustive evaluation of various combinations of RGB and depth image features with different classifiers. These evaluations intend to highlight the best feature representation for the body part orientation to classify, i.e., the person’s head or upper body. Our experiments demonstrate that high classification performances can be achieved by combining only three families of RGB and depth features and using a multiclass SVM classifier.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
For extensive evaluation results, please refer to the supplemental material at http://homepages.laas.fr/aamekonn/acivs16/supplement.pdf.
2.
For extensive evaluation results, please refer to the supplemental material at http://homepages.laas.fr/aamekonn/acivs16/supplement.pdf.

References

Andriluka, M., Roth, S., Schiele, B.: Monocular 3d pose estimation and tracking by detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2010), pp. 623–630, June 2010
Google Scholar
Chen, C., Heili, A., Odobez, J.: Combined estimation of location and body pose in surveillance video. In: IEEE International Conference on Advanced Video and Signal-Based Surveillance (AVSS 2011), pp. 5–10 (2011)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2005), vol. 1, pp. 886–893, June 2005
Google Scholar
Dollar, P., Appel, R., Belongie, S., Perona, P.: Fast Feature Pyramids for Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 36(8), 1532–1545 (2014). 00127
Google Scholar
Fanelli, G., Gall, J., Van Gool, L.: Real time head pose estimation with random regression forests. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2011), pp. 617–624, June 2011
Google Scholar
Fitte-Duval, L., Mekonnen, A.A., Lerasle, F.: Upper body detection and feature set evaluation for body pose classification. In: International Conference on Computer Vision Theory and Applications (VISAPP 2015), pp. 439–446 (2015)
Google Scholar
Fumito, S., Daisuke, D., Ichiro, I., Hiroshi, M., Hironobu, F.: Estimation of human orientation using coaxial RGB-depth images. In: International Conference on Computer Vision Theory and Applications (VISAPP 2015), pp. 113–120 (2015)
Google Scholar
Ghiass, R.-S., Arandjelović, O., Laurendeau, D.: Highly accurate, fully automatic head pose estimation from a low quality consumer-level rgb-d sensor. In: Workshop on Computational Models of Social Interactions: Human-Computer-Media Communication, pp. 25–34 (2015)
Google Scholar
Hayashi, M., Yamamoto, T., Aoki, Y., Ohshima, K., Tanabiki, M.: Head and upper body pose estimation in team sport videos. In: IAPR Asian Conference on Pattern Recognition (ACPR 2013), pp. 754–759, November 2013
Google Scholar
Huang, C., Ding, X., Fang, C.: Head pose estimation based on random forests for multiclass classification. In: International Conference on Pattern Recognition (ICPR 2010), pp. 934–937, August 2010
Google Scholar
Huynh, T., Min, R., Dugelay, J.-L.: An efficient LBP-based descriptor for facial depth images applied to gender recognition using RGB-D face data. In: Park, J.-I., Kim, J. (eds.) ACCV 2012. LNCS, vol. 7728, pp. 133–145. Springer, Heidelberg (2013). doi:10.1007/978-3-642-37410-4_12
Chapter Google Scholar
Jafari, O.H., Mitzel, D., Leibe, B.: Real-time RGB-D based people detection and tracking for mobile robots and head-worn cameras. In: IEEE International Conference on Robotics and Automation (ICRA 2014), pp. 5636–5643, May 2014
Google Scholar
Liu, W., Zhang, Y., Tang, S., Tang, J., Hong, R., Li, J.: Accurate estimation of human body orientation from RGB-D sensors. IEEE Trans. Cybern. 43(5), 1442–1452 (2013)
Article Google Scholar
Maji, S., Bourdev, L., Malik, J.: Action recognition from a distributed representation of pose and appearance. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3177–3184, June 2011. 00124
Google Scholar
Mollaret, C., Mekonnen, A.A., Ferrane, I., Pinquier, J., Lerasle, F.: Perceiving user’s intention-for-interaction: a probabilistic multimodal data fusion scheme. In: IEEE International Conference on Multimedia and Expo (ICME 2015), pp. 1–6, June 2015
Google Scholar
Murphy-Chutorian, E., Trivedi, M.M.: Head pose estimation computer vision: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 607–626 (2009). 00859
Google Scholar
Ojala, T., Pietikäinen, M., Mäenpää, T.: Gray scale and rotation invariant texture classification with local binary patterns. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1842, pp. 404–420. Springer, Heidelberg (2000). doi:10.1007/3-540-45054-8_27
Chapter Google Scholar
Ojala, T., Pietikinen, M., Harwood, D.: A comparative study of texture measures with classification based on featured distributions. Pattern Recogn. 29(1), 51–59 (1996)
Article Google Scholar
Papazov, C., Marks, T.K., Jones, M.: Real-time 3d head pose and facial landmark estimation from depth images using triangular surface patch features. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2015), pp. 4722–4730, June 2015
Google Scholar
Siriteerakul, T.: Advance in head pose estimation from low resolution images: a review. Int. J. Comput. Sci. Issues 9(3), 442–449 (2012)
Google Scholar
Spinello, L., Arras, K.O.: People detection in RGB-D data. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2011), pp. 3838–3843, September 2011
Google Scholar
Tao, J., Klette, R.: Integrated pedestrian and direction classification using a random decision forest. In: IEEE International Conference on Computer Vision Workshops (ICCVW’13), pp. 230–237, December 2013
Google Scholar
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, New York (1999)
MATH Google Scholar
Wu, S., Yu, S., Chen, W.: An attempt to pedestrian detection in depth images. In: Chinese Conference on Intelligent Visual Surveillance (IVS 2011), pp. 1–3 (2011)
Google Scholar

Download references

Acknowledgment

This work is funded by the ROMEO2 project (http://www.projetromeo.com/) in the framework of the Structuring Projects of Competitiveness Clusters (PSPC).

Author information

Authors and Affiliations

LAAS-CNRS, Université de Toulouse, CNRS, UPS, Toulouse, France
Laurent Fitte-Duval, Alhayat Ali Mekonnen & Frédéric Lerasle

Authors

Laurent Fitte-Duval
View author publications
You can also search for this author in PubMed Google Scholar
Alhayat Ali Mekonnen
View author publications
You can also search for this author in PubMed Google Scholar
Frédéric Lerasle
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alhayat Ali Mekonnen .

Editor information

Editors and Affiliations

Université Paris-Sud 11 , Orsay, France
Jacques Blanc-Talon
University of Salento , Lecce, Lecce, Italy
Cosimo Distante
Ghent University , Gent, Belgium
Wilfried Philips
CSIRO ICT Centre , Sydney, New South Wales, Australia
Dan Popescu
University of Antwerp , Wilrijk, Belgium
Paul Scheunders

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fitte-Duval, L., Mekonnen, A.A., Lerasle, F. (2016). Combination of RGB-D Features for Head and Upper Body Orientation Classification. In: Blanc-Talon, J., Distante, C., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2016. Lecture Notes in Computer Science(), vol 10016. Springer, Cham. https://doi.org/10.1007/978-3-319-48680-2_52

Download citation

DOI: https://doi.org/10.1007/978-3-319-48680-2_52
Published: 21 October 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48679-6
Online ISBN: 978-3-319-48680-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics