ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Computer Speech & Language
Volume 20, Issue 4, October 2006, Pages 589-608
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (235 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.csl.2005.10.001    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2005 Elsevier Ltd All rights reserved.

Morphology-based language modeling for conversational Arabic speech recognition

Katrin Kirchhoffa, Corresponding Author Contact Information, E-mail The Corresponding Author, Dimitra Vergyrib, E-mail The Corresponding Author, Jeff Bilmesa, E-mail The Corresponding Author, Kevin Duha, E-mail The Corresponding Author and Andreas Stolckeb, E-mail The Corresponding Author

aDepartment of Electrical Engineering, University of Washington, Box 352500, Seattle, WA 98195-2500, USA bSRI International, Menlo Park, CA 94720, USA

Received 26 May 2004; 
revised 19 March 2005; 
accepted 5 October 2005. 
Available online 2 November 2005.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Language modeling for large-vocabulary conversational Arabic speech recognition is faced with the problem of the complex morphology of Arabic, which increases the perplexity and out-of-vocabulary rate. This problem is compounded by the enormous dialectal variability and differences between spoken and written language. In this paper, we investigate improvements in Arabic language modeling by developing various morphology-based language models. We present four different approaches to morphology-based language modeling, including a novel technique called factored language models. Experimental results are presented for both rescoring and first-pass recognition experiments.

Article Outline

1. Introduction
2. Linguistic properties of Arabic
3. Data
4. Morphology-based language models
4.1. Previous work
4.2. Models for Arabic
4.2.1. Particle model
4.2.2. Class-based models
4.2.3. Stream models
4.2.4. Factored language models
5. Recognition system
6. Rescoring experiments
7. Automatic parameter search in factored language models
7.1. Parameter search using GAs
7.2. GA experiments and parameter search results
8. First-pass recognition with factored language models
9. Conclusions
Acknowledgements
Appendix A
References









Computer Speech & Language
Volume 20, Issue 4, October 2006, Pages 589-608
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.