Skip to main content

Estimation of Pointing Poses on Monocular Images with Neural Techniques - An Experimental Comparison

  • Conference paper
  • 1882 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4669))

Abstract

Poses and gestures are an important part of the nonverbal inter-human communication. In the last years many different methods for estimating poses and gestures in the field of Human-Machine-Interfaces were developed. In this paper for the first time we present an experimental comparison of several re-implemented Neural Network based approaches for a demanding visual instruction task on a mobile system. For the comparison we used several Neural Networks (Neural Gas, SOM, LLM, PSOM and MLP) and a k-Nearest-Neighbourhood classificator on a common data set of images, which we recorded on our mobile robot Horos under real world conditions. For feature extraction we use Gaborjets and the features of a special histogram on the image. We also compare the results of the different approaches with the results of human subjects who estimated the target point of a pointing pose. The results obtained demonstrate that a cascade of MLPs is best suited to cope with the task and achieves results equal to human subjects.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Haasch, A., Hofemann, N., Fritsch, J., Sagerer, G.: A Multi-Modal Object Attention System for a Mobile Robot. In: Int. Conf. on Intelligent Robots and Systems, pp. 1499–1504 (2005)

    Google Scholar 

  2. Nickel, K., Stiefelhagen, R.: Recognition of 3D-Pointing Gestures for Human-Robot-Interaction. In: European Conference on Computer Vision, pp. 28–38 (2004)

    Google Scholar 

  3. Nölker, C., Ritter, H.: Illumination Independent Recognition of Deictic Arm Postures. In: Proceedings of the 24th Annual Conference of the IEEE Industrial Electronics Society, Aachen, pp. 2006–2011. IEEE Computer Society Press, Los Alamitos (1998)

    Google Scholar 

  4. Richarz, J., Martin, C., Scheidig, A., Gross, H-M.: There You Go! - Estimation Pointing Gestures in Monocular Images for Mobile Robot Instruction. In: Int. Symposium on Robot and Human Interactive Communication, pp. 546–551 (2006)

    Google Scholar 

  5. Takahashi, K., Tanigawa, T.: Remarks on Real-Time Human Posture Estimation from Silhouette Image using Neural Network. In: Proc. of the IEEE Int. Conf. on Systems, Man and Cybernetics: The Hague, pp. 370–375. IEEE Computer Society Press, Los Alamitos (2004)

    Google Scholar 

  6. Krüger, V., Sommer, G.: Gabor Wavelet Networks for Efficient Head Pose Estimation. Image and Vision Computing 20(9-10), 665–672 (2002)

    Article  Google Scholar 

  7. Stiefelhagen, R.: Estimating Head Pose with Neural Networks - Results on the Pointing04 ICPR Workshop Evaluation. In: Pointing04 ICPR, Cambridge, UK (2004)

    Google Scholar 

  8. Viola, P., Jones, M.: Rapid Object Detection using a Boosted Cascade of Simple Features. Proc. Conf. of Computer Vision and Patter Recognition 1, 511–518 (2001)

    Google Scholar 

  9. Gross, H.-M., Richarz, J., Mller, St., Scheidig, A., Martin, Chr.: Probabilistic Multi-modal People Tracker and Monocular Pointing Pose Estimator for Visual Instruction of Mobile Robot Assistants. In: Proc. IEEE World Congress on Computational Intelligence (WCCI 2006), pp. 8325–8333 (2006)

    Google Scholar 

  10. Martinetz, T., Schulten, K.: A Neural-Gas Network Learns Topologies. In: Proc. of the ICANN 1991. Helsinki, pp. 397–402 (1991)

    Google Scholar 

  11. Kohonen, T.: Self-Organized Formation of Topologically Correct Feature Maps. Biological Cybernetics 43, 59–69 (1982)

    Article  MATH  Google Scholar 

  12. Ritter, H.: Learning with the Self-Organizing Map. In: Kohonen, T., et al. (eds.) Artifical Neural Networks, pp. 379–384. Elsevier Science Publishers, Amsterdam (1991)

    Google Scholar 

  13. Walther, J.A., Ritter, H.: Rapid Learning with Parametrized Self-Organizing Maps. Neurocomputing 12, 131–153 (1996)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Joaquim Marques de Sá Luís A. Alexandre Włodzisław Duch Danilo Mandic

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Steege, FF., Martin, C., Groß, HM. (2007). Estimation of Pointing Poses on Monocular Images with Neural Techniques - An Experimental Comparison. In: de Sá, J.M., Alexandre, L.A., Duch, W., Mandic, D. (eds) Artificial Neural Networks – ICANN 2007. ICANN 2007. Lecture Notes in Computer Science, vol 4669. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74695-9_61

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74695-9_61

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74693-5

  • Online ISBN: 978-3-540-74695-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics