Abstract
Speech recognition technology has been around for several decades now, and a considerable amount of applications have been developed around this technology. However, the current state of the art of speech recognition systems still generate errors in the recognizer’s output. Techniques to automatically detect and even correct speech transcription errors have emerged. Due to the complexity of the problem, these error detection approaches have failed to ensure both a high recall and a precision ratio. The goal of this paper is to present an approach that combines several error detection techniques to ensure a better classification rate. Experimental results have proven that such an approach can indeed improve on the current state of the art of automatic error detection in speech transcription.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Shi, Y.: An investigation of linguistic information for speech recognition error detection, Ph.D. thesis, University of Maryland (2008)
Patwardhan, S., Pedersen, T.: Using wordnet-based context vectors to estimate the semantic relatedness of concepts. In: EACL 2006 Workshop Making Sense of Sense - Bringing Computational Linguistics and Psycholinguistics Together, pp. 1–8 (April 2006)
Cox, S., Dasmahapatra, S.: High-level approaches to confidence estimation in speech recognition. IEEE Transactions on Speech and Audio Processing 10(7), 460–471 (2002)
Inkpen, D., Desilets, A.: Semantic similarity for detecting recognition errors in automatic speech transcripts. In: HLT/EMNLP, pp. 49–56 (2005)
Voll, K.D.: A Methodology of error detection: Improving speech recognition in radiology, Ph.D. thesis, Simon Fraser University (2006)
Abida, K., Karray, F., Abida, W.: cROVER: Improving ROVER using automatic error detection. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP (May 2011)
Abida, K., Karray, F., Abida, W.: mproving rover using latent semantic indexing-based error detection. In: IEEE International Conference on Multimedia and Expo, ICME (July 2011)
Fiscus, J., et al.: 1997 english broadcast news speech (HUB4). LDC, Philadelphia (1998)
Brants, T., Franz, A.: Web 1T 5-gram version 1. LDC, Philadelphia (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abida, K., Abida, W., Karray, F. (2011). Combination of Error Detection Techniques in Automatic Speech Transcription. In: Kamel, M., Karray, F., Gueaieb, W., Khamis, A. (eds) Autonomous and Intelligent Systems. AIS 2011. Lecture Notes in Computer Science(), vol 6752. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21538-4_23
Download citation
DOI: https://doi.org/10.1007/978-3-642-21538-4_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21537-7
Online ISBN: 978-3-642-21538-4
eBook Packages: Computer ScienceComputer Science (R0)