Abstract
In Human-Robot Interaction (HRI), the intention of a person to interact with another agent (robot or human) can be inferred from his/her head and upper body orientation. Furthermore, additional information on the person’s overall intention and motion direction can be determined with the knowledge of both orientations. This work presents an exhaustive evaluation of various combinations of RGB and depth image features with different classifiers. These evaluations intend to highlight the best feature representation for the body part orientation to classify, i.e., the person’s head or upper body. Our experiments demonstrate that high classification performances can be achieved by combining only three families of RGB and depth features and using a multiclass SVM classifier.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
For extensive evaluation results, please refer to the supplemental material at http://homepages.laas.fr/aamekonn/acivs16/supplement.pdf.
- 2.
For extensive evaluation results, please refer to the supplemental material at http://homepages.laas.fr/aamekonn/acivs16/supplement.pdf.
References
Andriluka, M., Roth, S., Schiele, B.: Monocular 3d pose estimation and tracking by detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2010), pp. 623–630, June 2010
Chen, C., Heili, A., Odobez, J.: Combined estimation of location and body pose in surveillance video. In: IEEE International Conference on Advanced Video and Signal-Based Surveillance (AVSS 2011), pp. 5–10 (2011)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2005), vol. 1, pp. 886–893, June 2005
Dollar, P., Appel, R., Belongie, S., Perona, P.: Fast Feature Pyramids for Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 36(8), 1532–1545 (2014). 00127
Fanelli, G., Gall, J., Van Gool, L.: Real time head pose estimation with random regression forests. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2011), pp. 617–624, June 2011
Fitte-Duval, L., Mekonnen, A.A., Lerasle, F.: Upper body detection and feature set evaluation for body pose classification. In: International Conference on Computer Vision Theory and Applications (VISAPP 2015), pp. 439–446 (2015)
Fumito, S., Daisuke, D., Ichiro, I., Hiroshi, M., Hironobu, F.: Estimation of human orientation using coaxial RGB-depth images. In: International Conference on Computer Vision Theory and Applications (VISAPP 2015), pp. 113–120 (2015)
Ghiass, R.-S., Arandjelović, O., Laurendeau, D.: Highly accurate, fully automatic head pose estimation from a low quality consumer-level rgb-d sensor. In: Workshop on Computational Models of Social Interactions: Human-Computer-Media Communication, pp. 25–34 (2015)
Hayashi, M., Yamamoto, T., Aoki, Y., Ohshima, K., Tanabiki, M.: Head and upper body pose estimation in team sport videos. In: IAPR Asian Conference on Pattern Recognition (ACPR 2013), pp. 754–759, November 2013
Huang, C., Ding, X., Fang, C.: Head pose estimation based on random forests for multiclass classification. In: International Conference on Pattern Recognition (ICPR 2010), pp. 934–937, August 2010
Huynh, T., Min, R., Dugelay, J.-L.: An efficient LBP-based descriptor for facial depth images applied to gender recognition using RGB-D face data. In: Park, J.-I., Kim, J. (eds.) ACCV 2012. LNCS, vol. 7728, pp. 133–145. Springer, Heidelberg (2013). doi:10.1007/978-3-642-37410-4_12
Jafari, O.H., Mitzel, D., Leibe, B.: Real-time RGB-D based people detection and tracking for mobile robots and head-worn cameras. In: IEEE International Conference on Robotics and Automation (ICRA 2014), pp. 5636–5643, May 2014
Liu, W., Zhang, Y., Tang, S., Tang, J., Hong, R., Li, J.: Accurate estimation of human body orientation from RGB-D sensors. IEEE Trans. Cybern. 43(5), 1442–1452 (2013)
Maji, S., Bourdev, L., Malik, J.: Action recognition from a distributed representation of pose and appearance. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3177–3184, June 2011. 00124
Mollaret, C., Mekonnen, A.A., Ferrane, I., Pinquier, J., Lerasle, F.: Perceiving user’s intention-for-interaction: a probabilistic multimodal data fusion scheme. In: IEEE International Conference on Multimedia and Expo (ICME 2015), pp. 1–6, June 2015
Murphy-Chutorian, E., Trivedi, M.M.: Head pose estimation computer vision: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 607–626 (2009). 00859
Ojala, T., Pietikäinen, M., Mäenpää, T.: Gray scale and rotation invariant texture classification with local binary patterns. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1842, pp. 404–420. Springer, Heidelberg (2000). doi:10.1007/3-540-45054-8_27
Ojala, T., Pietikinen, M., Harwood, D.: A comparative study of texture measures with classification based on featured distributions. Pattern Recogn. 29(1), 51–59 (1996)
Papazov, C., Marks, T.K., Jones, M.: Real-time 3d head pose and facial landmark estimation from depth images using triangular surface patch features. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2015), pp. 4722–4730, June 2015
Siriteerakul, T.: Advance in head pose estimation from low resolution images: a review. Int. J. Comput. Sci. Issues 9(3), 442–449 (2012)
Spinello, L., Arras, K.O.: People detection in RGB-D data. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2011), pp. 3838–3843, September 2011
Tao, J., Klette, R.: Integrated pedestrian and direction classification using a random decision forest. In: IEEE International Conference on Computer Vision Workshops (ICCVW’13), pp. 230–237, December 2013
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, New York (1999)
Wu, S., Yu, S., Chen, W.: An attempt to pedestrian detection in depth images. In: Chinese Conference on Intelligent Visual Surveillance (IVS 2011), pp. 1–3 (2011)
Acknowledgment
This work is funded by the ROMEO2 project (http://www.projetromeo.com/) in the framework of the Structuring Projects of Competitiveness Clusters (PSPC).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Fitte-Duval, L., Mekonnen, A.A., Lerasle, F. (2016). Combination of RGB-D Features for Head and Upper Body Orientation Classification. In: Blanc-Talon, J., Distante, C., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2016. Lecture Notes in Computer Science(), vol 10016. Springer, Cham. https://doi.org/10.1007/978-3-319-48680-2_52
Download citation
DOI: https://doi.org/10.1007/978-3-319-48680-2_52
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48679-6
Online ISBN: 978-3-319-48680-2
eBook Packages: Computer ScienceComputer Science (R0)