ACM Home Page
Please provide us with feedback. Feedback
Challenges in adopting speech recognition
Full text html formatHtml (27 KB), pdf formatPdf (160 KB)
Source Communications of the ACM archive
Volume 47 ,  Issue 1  (January 2004) table of contents
Multimodal interfaces that flex, adapt, and persist
SPECIAL ISSUE: Multimodal interfaces that flex, adapt, and persist table of contents
Pages: 69 - 75  
Year of Publication: 2004
ISSN:0001-0782
Authors
Li Deng  Microsoft Research, Redmond, WA
Xuedong Huang  Microsoft .NET Speech Technologies Group, Redmond, WA
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 28,   Downloads (12 Months): 228,   Citation Count: 6
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/962081.962108
What is a DOI?

ABSTRACT

Although progress has been impressive, there are still several hurdles that speech recognition technology must clear before ubiquitous adoption can be realized. R&D in spontaneous and free-flowing speech style is critical to its success.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
DARPA's EARS Conference (Boston, MA, May 21--22, 2003).
 
2
DARPA's EARS Kickoff Meeting (Vienna, VA, May 9--10, 2002).
 
3
Datamonitor. Voice Automation---Past, Present, and Future. White Paper (July 2003).
 
4
Deng, L., and O'Shaughnessy, D. Speech Processing---A Dynamic and Optimization-Oriented Approach. Marcel Dekker, NY, 2003.
 
5
Deng, L. Wang, K., Acero, A., Hon, H., Droppo, J., Boulis, C., Wang, Y., Jacoby, D., Mahajan, M., Chelba, C., and Huang, X.D. Distributed speech processing in MiPad's multimodal user interface. IEEE Transactions on Speech and Audio 10 (2002), 605--619.
 
6
Furui, S. Recent progress in spontaneous speech recognition and understanding. In Proceedings of the IEEE Workshop on Multimedia Signal Processing (Dec. 2002).
 
7
Hirsch, H., and Pearce, D. The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions. ISCA ITRW Workshop on Automatic Speech Recognition (Paris, 2000).
 
8
 
9
Neti, C., Iyengar, G., Potamianos, G., Senior, A., and Maison, B. Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction. In the ICSLP Proceedings 1. (Beijing, 2000), 11--14.
 
10
Oviatt, S. Breaking the robustness barrier: Recent progress on the design of robust multimodal systems. Advances in Computers. M. Zelkowitz, Ed. Academic Press, 2002, 305--341.
 
11
Zhang, Y. et al. Air- and bone-conductive integrated microphones for robust speech detection and enhancement. In Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding. (St. Thomas, U.S. Virgin Islands, Dec, 2003.)



Peer to Peer - Readers of this Article have also read: