ABSTRACT
The Ambient Spotlight is a prototype system based on personal meeting capture using a laptop and a portable microphone array. The system automatically recognises and structures the meeting content using automatic speech recognition, topic segmentation and extractive summarisation. The recognised speech in the meeting is used to construct queries to automatically link meeting segments to other relevant material, both multimodal and textual. The interface to the system is constructed around a standard calendar interface, and it is integrated with the laptop's standard indexing, search and retrieval.
- P. Garner, J. Dines, T. Hain, A. El Hannani, M. Karafiat, D. Korchagin, M. Lincoln, V. Wan, and L. Zhang. Real-time ASR from meetings. In Proc. Interspeech, 2009.Google Scholar
- P.-Y. Hsueh, J. D. Moore, and S. Renals. Automatic segmentation of multiparty dialogue. In Proc. EACL06, 2006.Google Scholar
- J. Kilgour, J. Carletta, and S. Renals. The Ambient Spotlight: Queryless desktop search from meeting speech. In Proc SSCS - ACM Multimedia Workshop on Searching Spontaneous Conversational Speech, 2010. Google ScholarDigital Library
- A. Popescu-Belis, P. Poller, J. Kilgour, E. Boertjes, J. Carletta, S. Castronovo, M. Fapso, M. Flynn, A. Nanchen, T. Wilson, J. de Wit, and M. Yazdani. A multimedia retrieval system using speech input. In Proc. ACM ICMI-MLMI, pages 223--224, 2009. Google ScholarDigital Library
- S. Renals. Recognition and understanding of meetings. In Proc. NAACL/HLT, 2010. Google ScholarDigital Library
- A. Waibel and R. Stiefelhagen. Computers in the Human Interaction Loop. Springer, 2009. Google ScholarDigital Library
Index Terms
- The Ambient Spotlight: personal multimodal search without query
Recommendations
The ambient spotlight: queryless desktop search from meeting speech
SSCS '10: Proceedings of the 2010 international workshop on Searching spontaneous conversational speechIt has recently become possible to record any small meeting using a laptop equipped with a plug-and-play USB microphone array. We show the potential for such recordings in a personal aid that allows project managers to record their meetings and, when ...
On the perception of "segmental intonation": F0 context effects on sibilant identification in German
In normal modally voiced utterances, voiceless fricatives like [s], [ź], [f], and [x] vary such that their aperiodic pitch impressions mirror the pitch level of the adjacent F0 contour. For instance, if the F0 contour creates a high or low pitch context,...
Psycho-acoustics inspired automatic speech recognition
AbstractUnderstanding the human spoken language recognition process is still a far scientific goal. Nowadays, commercial automatic speech recognisers (ASRs) achieve high performance at recognising clean speech, but their approaches are poorly ...
Highlights- We propose a novel Automatic Speech Recognizer inspired by psycho-acoustic studies.
Comments