| Challenges in adopting speech recognition |
| Full text |
Html
(27 KB),
Pdf
(160 KB)
|
| Source
|
Communications of the ACM
archive
Volume 47 , Issue 1 (January 2004)
table of contents
Multimodal interfaces that flex, adapt, and persist
SPECIAL ISSUE: Multimodal interfaces that flex, adapt, and persist
table of contents
Pages: 69 - 75
Year of Publication: 2004
ISSN:0001-0782
|
|
Authors
|
|
Li Deng
|
Microsoft Research, Redmond, WA
|
|
Xuedong Huang
|
Microsoft .NET Speech Technologies Group, Redmond, WA
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 28, Downloads (12 Months): 228, Citation Count: 6
|
|
|
ABSTRACT
Although progress has been impressive, there are still several hurdles that speech recognition technology must clear before ubiquitous adoption can be realized. R&D in spontaneous and free-flowing speech style is critical to its success.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
DARPA's EARS Conference (Boston, MA, May 21--22, 2003).
|
| |
2
|
DARPA's EARS Kickoff Meeting (Vienna, VA, May 9--10, 2002).
|
| |
3
|
Datamonitor. Voice Automation---Past, Present, and Future. White Paper (July 2003).
|
| |
4
|
Deng, L., and O'Shaughnessy, D. Speech Processing---A Dynamic and Optimization-Oriented Approach. Marcel Dekker, NY, 2003.
|
| |
5
|
Deng, L. Wang, K., Acero, A., Hon, H., Droppo, J., Boulis, C., Wang, Y., Jacoby, D., Mahajan, M., Chelba, C., and Huang, X.D. Distributed speech processing in MiPad's multimodal user interface. IEEE Transactions on Speech and Audio 10 (2002), 605--619.
|
| |
6
|
Furui, S. Recent progress in spontaneous speech recognition and understanding. In Proceedings of the IEEE Workshop on Multimedia Signal Processing (Dec. 2002).
|
| |
7
|
Hirsch, H., and Pearce, D. The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions. ISCA ITRW Workshop on Automatic Speech Recognition (Paris, 2000).
|
| |
8
|
|
| |
9
|
Neti, C., Iyengar, G., Potamianos, G., Senior, A., and Maison, B. Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction. In the ICSLP Proceedings 1. (Beijing, 2000), 11--14.
|
| |
10
|
Oviatt, S. Breaking the robustness barrier: Recent progress on the design of robust multimodal systems. Advances in Computers. M. Zelkowitz, Ed. Academic Press, 2002, 305--341.
|
| |
11
|
Zhang, Y. et al. Air- and bone-conductive integrated microphones for robust speech detection and enhancement. In Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding. (St. Thomas, U.S. Virgin Islands, Dec, 2003.)
|
CITED BY 6
|
|
|
|
|
|
Lee Hoi Leong , Shinsuke Kobayashi , Noboru Koshizuka , Ken Sakamura, CASIS: a context-aware speech interface system, Proceedings of the 10th international conference on Intelligent user interfaces, January 10-13, 2005, San Diego, California, USA
|
|
|
|
|
|
|
Maja Pantic , Alex Pentland , Anton Nijholt , Thomas Huang, Human computing and machine understanding of human behavior: a survey, Proceedings of the 8th international conference on Multimodal interfaces, November 02-04, 2006, Banff, Alberta, Canada
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE conference on Design automation
Gwo-Dong Chen
, Daniel D. Gajski
|