ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Speech Communication
Volume 49, Issue 9, September 2007, Pages 743-759
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (675 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.specom.2007.05.001    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2007 Elsevier B.V. All rights reserved.

Improving the intelligibility of dysarthric speech

Alexander B. Kaina, Corresponding Author Contact Information, E-mail The Corresponding Author, E-mail The Corresponding Author, John-Paul Hosoma, Xiaochuan Niua, Jan P.H. van Santena, Melanie Fried-Okenb and Janice Staehelyb

aCenter for Spoken Language Understanding, OGI School of Science & Engineering, Oregon Health & Science University, Portland, OR, USA bDepartments of Neurology and Otolaryngology, Oregon Institute on Disability and Development, Oregon Health & Science University, Portland, OR, USA

Received 29 July 2006; 
revised 18 April 2007. 
Available online 16 May 2007.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Dysarthria is a speech motor disorder usually resulting in a substantive decrease in speech intelligibility by the general population. In this study, we have significantly improved the intelligibility of dysarthric vowels of one speaker from 48% to 54%, as evaluated by a vowel identification task using 64 CVC stimuli judged by 24 listeners. Improvement was obtained by transforming the vowels of a speaker with dysarthria to more closely match the vowel space of a non-dysarthric (target) speaker. The optimal mapping feature set, from a list of 21 candidate feature sets, proved to be one utilizing vowel duration and F1–F3 stable points, which were calculated using shape-constrained isotonic regression. The choice of speaker-specific or speaker-independent vowel formant targets appeared to be insignificant. Comparisons with “oracle” conditions were performed in order to evaluate the analysis/re-synthesis system independently of the transformation function.

Keywords: Speech processing; Speech transformation; Speech modification; Intelligibility; Dysarthria

Article Outline

1. Introduction
2. Preliminary experiments
3. Method
3.1. Overview
3.2. Database design and recording
3.2.1. CVC Database
3.2.2. Vowel-target database
3.3. Analysis
3.4. Transformation
3.4.1. Input and output features
3.4.2. Training
3.4.3. Objective evaluation
3.5. Synthesis
3.5.1. Feature modification and generation
3.5.2. Formant synthesis
4. Evaluation
4.1. Stimuli
4.2. Test administration
4.3. Results and discussion
5. Conclusion
Acknowledgements
References







Speech Communication
Volume 49, Issue 9, September 2007, Pages 743-759
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.