ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Computer Speech & Language
Volume 20, Issue 4, October 2006, Pages 420-440
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (512 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.csl.2005.05.003    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2005 Elsevier Ltd All rights reserved.

Combining language models in the input interface of a spoken dialogue system

R. López-CózarCorresponding Author Contact Information, a, E-mail The Corresponding Author and Z. Callejasa, E-mail The Corresponding Author

aDepartment Languages and Computer Systems, Computer Science Faculty, Granada University, 18071 Granada, Spain

Received 4 February 2004; 
revised 9 December 2004; 
accepted 24 May 2005. 
Available online 22 June 2005.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

This paper presents a new technique to enhance the performance of the input interface of spoken dialogue systems based on a procedure that combines during speech recognition the advantages of using prompt-dependent language models with those of using a language model independent of the prompts generated by the dialogue system. The technique proposes to create a new speech recognizer, termed contextual speech recognizer, that uses a prompt-independent language model to allow recognizing any kind of sentence permitted in the application domain, and at the same time, uses contextual information (in the form of prompt-dependent language models) to take into account that some sentences are more likely to be uttered than others at a particular moment of the dialogue. The experiments show the technique allows enhancing clearly the performance of the input interface of a previously developed dialogue system based exclusively on prompt-dependent language models. But most important, in comparison with a standard speech recognizer that uses just one prompt-independent language model without contextual information, the proposed recognizer allows increasing the word accuracy and sentence understanding rates by 4.09% and 4.19% absolute, respectively. These scores are slightly better than those obtained using linear interpolation of the prompt-independent and prompt-dependent language models used in the experiments.

Article Outline

1. Introduction
1.1. Prompt-dependent and prompt-independent language models
2. The proposed technique: contextual speech recognizer
2.1. Previous related work
2.2. Word-networks
2.3. Word-class bigrams
2.4. Procedure to analyze word-networks
3. Experiments
3.1. Description of the corpora used in the experiments
3.2. Experimental results
3.2.1. Initial input interface for in-context and out-of-context analysis
3.2.2. Contextual speech recognizer for in-context and out-of-context analysis
3.2.2.1. Determination of the best value for the probability increment parameter
3.2.2.2. In-context and out-of-context results for the best value of the probability increment
3.2.2.3. Linear interpolation of prompt-independent and prompt-dependent language models for sentence analysis
3.3. Limitations of the technique proposed in this paper
4. Conclusions and future work
Acknowledgements
References







Computer Speech & Language
Volume 20, Issue 4, October 2006, Pages 420-440
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.