skip to main content
10.1145/1027933.1027967acmconferencesArticle/Chapter ViewAbstractPublication Pagesicmi-mlmiConference Proceedingsconference-collections
Article

A vision-based sign language recognition system using tied-mixture density HMM

Authors Info & Claims
Published:13 October 2004Publication History

ABSTRACT

In this paper, a vision-based medium vocabulary Chinese sign language recognition (SLR) system is presented. The proposed recognition system consists of two modules. In the first module, techniques of robust hands detection, background subtraction and pupils detection are efficiently combined to precisely extract the feature information with the aid of simple colored gloves in the unconstrained environment. Meanwhile, an effective and efficient hierarchical feature description scheme with different scale features to characterize sign language is proposed, where principal component analysis (PCA) is employed to characterize the finger features more elaborately. In the second part, a Tied-Mixture Density Hidden Markov Models (TMDHMM) framework for SLR is proposed, which can speed up the recognition without the significant loss of recognition accuracy compared with the continuous hidden Markov models (CHMM). Experimental results based on 439 frequently used Chinese sign language (CSL) words show that the proposed methods can work well for the medium vocabulary SLR in the environment without special constraints and the recognition accuracy is up to 92.5%.

References

  1. R. H. Liang, and M. Ouhyoung, A real-time continuous gesture recognition system for sign language, Proc. 3rd Int'l Conf. Automatic Face and Gesture Recognition, Nara, pp.558--565, 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M.W. Kadous, Machine recognition of Auslan signs using PowerGloves: towards large-lexicon recognition of sign language, Proc. Workshop on the Integration of Gesture in Language and Speech, pp. 165--174, 1996.]]Google ScholarGoogle Scholar
  3. S.S. Fels and G.E. Hinton, Glove-talk: A neural network interface between a data-glove and a speech synthesizer, IEEE Trans. Neural Networks, vol. 4, no. 1, pp. 2--8, 1993.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J.S. Kim, W. Jang, and Z. Bien, A dynamic gesture recognition system for the Korean sign language (KSL), IEEE Trans. Systems, Man, and Cybernetics, vol. 26, no. 2, pp. 354--359, 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. W. Gao, J.Y. Ma, J.Q. Wu, and C.L. Wang, Sign language recognition based on HMM/ANN/DP, Int'l J. Pattern Recognition and Artificial Intelligence, vol. 14, no. 5, pp. 587--602, 2000.]]Google ScholarGoogle ScholarCross RefCross Ref
  6. H. Mastuo, S. Igi, S. Lu, Y. Nagashima, Y. Takata, and T. Teshima, The recognition algorithm with non-contact for Japanese sign language using morphological analysis, Proc. Int'l Gesture Workshop, pp. 273--284, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. T. Starner, J. Weaver, and A. Pentland, Real-time American sign language recognition using desk and wearable computer based video, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 12, pp. 1371--1375, 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. Vogler and D. Metaxas, Adapting hidden Markov models for ASL recognition by using three-dimensional computer vision methods, Proc. IEEE Int'l Conf. Systems, Man and Cybernetics, Orlando, pp.156--161, 1997.]]Google ScholarGoogle ScholarCross RefCross Ref
  9. C. Vogler and D. Metaxas, Toward scalability in ASL recognition: breaking down signs into phonemes, Proc. Int'l Gesture Workshop, Gif-sur-Yvette, France, pp. 400--404, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. C. Vogler and D. Metaxas, A framework for recognizing the simultaneous aspects of American sign language, Computer Vision and Image Understanding, vol. 81, no. 3, pp. 358--384, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. K. Grobel and M. Assan, Isolated sign language recognition using hidden Markov models, Proc. IEEE Int'l Conf. Systems, Man and Cybernetics, pp. 162--167, 1996.]]Google ScholarGoogle Scholar
  12. B. Bauer and H. Hienz, Relevant features for video-based continuous sign language recognition, Proc. 4th Int'l Conf. Automatic Face and Gesture Recognition, Grenoble, pp. 440--445, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. L. R. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition, Proceedings of the IEEE, vol. 77, no. 2, pp. 257--285, 1989.]] Google ScholarGoogle ScholarCross RefCross Ref
  14. V. V. Digalakis, P. Monaco and H. Murveit, Genones: generalized mixture tying in continuous hidden Markov model-based speech recognizers. IEEE Trans. Speech and Audio Processing, vol 4, no. 4, pp. 281--289, 1996.]]Google ScholarGoogle ScholarCross RefCross Ref
  15. J. Miao, W. Gao et al, Gravity-Center Template Based Human Face Feature Detection, Proc. 3rd Int'l Conf. Multimodal Interface, Beijing, pp. 207--214, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. Jabri, Z. Duric, H. Wechsler, and A. Rosenfeld, Detection and location of people in video images using adaptive fusion of color and edge information, Proc. Int'l Conf. Pattern Recognition, Barcelona, Spain, vol. 4, pp. 627--630, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. C. Wang and W. Gao, The application of HMMs consisting of different number states in CSL recognition, Journal of Computer Research and Development, vol. 38, no. 1, pp.111--115, 2001 (In Chinese).]]Google ScholarGoogle Scholar

Index Terms

  1. A vision-based sign language recognition system using tied-mixture density HMM

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ICMI '04: Proceedings of the 6th international conference on Multimodal interfaces
          October 2004
          368 pages
          ISBN:1581139950
          DOI:10.1145/1027933

          Copyright © 2004 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 13 October 2004

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          Overall Acceptance Rate453of1,080submissions,42%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader