Combination of Error Detection Techniques in Automatic Speech Transcription

Abida, Kacem; Abida, Wafa; Karray, Fakhri

doi:10.1007/978-3-642-21538-4_23

Combination of Error Detection Techniques in Automatic Speech Transcription

Kacem Abida²²,
Wafa Abida²³ &
Fakhri Karray²²

Conference paper

2150 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6752))

Abstract

Speech recognition technology has been around for several decades now, and a considerable amount of applications have been developed around this technology. However, the current state of the art of speech recognition systems still generate errors in the recognizer’s output. Techniques to automatically detect and even correct speech transcription errors have emerged. Due to the complexity of the problem, these error detection approaches have failed to ensure both a high recall and a precision ratio. The goal of this paper is to present an approach that combines several error detection techniques to ensure a better classification rate. Experimental results have proven that such an approach can indeed improve on the current state of the art of automatic error detection in speech transcription.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Shi, Y.: An investigation of linguistic information for speech recognition error detection, Ph.D. thesis, University of Maryland (2008)
Google Scholar
Patwardhan, S., Pedersen, T.: Using wordnet-based context vectors to estimate the semantic relatedness of concepts. In: EACL 2006 Workshop Making Sense of Sense - Bringing Computational Linguistics and Psycholinguistics Together, pp. 1–8 (April 2006)
Google Scholar
Cox, S., Dasmahapatra, S.: High-level approaches to confidence estimation in speech recognition. IEEE Transactions on Speech and Audio Processing 10(7), 460–471 (2002)
Article Google Scholar
Inkpen, D., Desilets, A.: Semantic similarity for detecting recognition errors in automatic speech transcripts. In: HLT/EMNLP, pp. 49–56 (2005)
Google Scholar
Voll, K.D.: A Methodology of error detection: Improving speech recognition in radiology, Ph.D. thesis, Simon Fraser University (2006)
Google Scholar
Abida, K., Karray, F., Abida, W.: cROVER: Improving ROVER using automatic error detection. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP (May 2011)
Google Scholar
Abida, K., Karray, F., Abida, W.: mproving rover using latent semantic indexing-based error detection. In: IEEE International Conference on Multimedia and Expo, ICME (July 2011)
Google Scholar
Fiscus, J., et al.: 1997 english broadcast news speech (HUB4). LDC, Philadelphia (1998)
Google Scholar
Brants, T., Franz, A.: Web 1T 5-gram version 1. LDC, Philadelphia (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Electrical and Computer Engineering Dept., University of Waterloo, Ontario, Canada
Kacem Abida & Fakhri Karray
Voice Enabling Systems Technology Inc., Waterloo, Ontario, Canada
Wafa Abida

Authors

Kacem Abida
View author publications
You can also search for this author in PubMed Google Scholar
Wafa Abida
View author publications
You can also search for this author in PubMed Google Scholar
Fakhri Karray
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical and Computer Engineering, University of Waterloo, N2L 3G1, Waterloo, ON, Canada
Mohamed Kamel & Fakhri Karray &
School of Information Technology and Engineering (SITE), University of Orrawa, 800 King Edward Avenue, K1N 6N5, Ottawa, ON, Canada
Wail Gueaieb
Faculty of Engineering and Material Science, Department of Mechatronics Engineering, German University in Egypt, Main Entrance El Tagamoa El Khames, New Cairo City, Egypt
Alaa Khamis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Abida, K., Abida, W., Karray, F. (2011). Combination of Error Detection Techniques in Automatic Speech Transcription. In: Kamel, M., Karray, F., Gueaieb, W., Khamis, A. (eds) Autonomous and Intelligent Systems. AIS 2011. Lecture Notes in Computer Science(), vol 6752. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21538-4_23

Download citation

DOI: https://doi.org/10.1007/978-3-642-21538-4_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21537-7
Online ISBN: 978-3-642-21538-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics