ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Speech Communication
Volume 36, Issues 3-4, March 2002, Pages 305-315
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (176 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/S0167-6393(00)00089-3    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2002 Elsevier Science B.V. All rights reserved.

Speaker clustering for speech recognition using vocal tract parameters

Masaki NaitoCorresponding Author Contact Information, a, Li Dengb and Yoshinori Sagisakaa

a ATR Interpreting Telecommunications Research Laboratory, 2-2 Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-0288, Japan b Department of Electrical and Computer Engineering, University of Waterloo, Waterloo, Ont., Canada N2L 3G1

Received 4 April 2000;
revised 14 August 2000;
accepted 29 September 2000
Available online 7 January 2002.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

We propose speaker clustering methods for speech recognition based on vocal tract (VT) size related articulatory parameters associated with individual speakers. Two parameters characterizing gross VT dimensions are first derived from the formant frequencies of two vowels and are then used to cluster speakers. The resulting speaker clusters are significantly different from speaker clusters obtained by conventional acoustic criteria. Then phoneme recognition experiments are carried out by using speaker-clustered HMMs (SC-HMMs) trained for each cluster. The proposed method requires a small amount of speech data for speaker clustering and for selecting the most suitable SC-HMM for a target speaker, but gives higher recognition rates than conventional speaker clustering methods based on acoustic criteria.

Author Keywords: Vocal tract parameters; Speaker-clustering; Speech recognition

Article Outline

1. Introduction
2. Speaker clustering methods
2.1. Vocal tract parameters
2.2. Estimation of VT parameters
2.3. Speaker-clustering algorithm
3. Speaker cluster selection
3.1. Speaker cluster selection based on the VT parameters
3.2. Speaker cluster selection based on the maximum likelihood criterion
4. Experiments
4.1. Conditions
4.2. Speaker clustering
4.3. Distribution of VT parameters
4.4. Results of speaker clustering
4.5. Speech recognition experiments using speaker cluster selection based on VT parameters
4.6. Speech recognition experiments using speaker cluster selection based on likelihood
5. Conclusions
Acknowledgements
References










Speech Communication
Volume 36, Issues 3-4, March 2002, Pages 305-315
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.