Abstract
This study investigates the influence of the speech quality of Embodied Conversational Agents (ECAs) on users’ perception, behavior and emotions. Twenty-four subjects interacted in a Wizard of Oz (WOZ) setup with two ECAs in two scenarios of a virtual theater partner application. In both scenarios, each ECA had three different speech qualities: natural, high-quality synthetic and low-quality synthetic. Eye gaze data show that subjects’ visual attention was not influenced by ECA’s speech quality, but by their look. On the other hand, subjects’ self-report of emotions and verbal descriptions of their perceptions were influenced by ECAs’ speech quality. Finally, Galvanic Skin Response data were neither influenced by ECAs’ look, nor by their speech quality. These results stress the importance of the correct matching of the auditory and visual modalities of ECAs and give methodological insights for the assessment of user’s perception, behavior and emotions when interacting with virtual characters.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Campbell, N.: Specifying Affect and Emotion for Expressive Speech Synthesis. In: Gelbukh, A. (ed.) CICLing 2004. LNCS, vol. 2945, pp. 395–406. Springer, Heidelberg (2004)
Cassell, J., Sullivan, J., Prevost, S., Churchill, E. (eds.): Embodied conversational agents. MIT Press, Cambridge (2000)
Chateau, N., Mersiol, M.: AMUSE: A tool for evaluating affective interfaces. In: Proc. CHI 2005, Workshop Innovative approaches to evaluating affective interfaces (2005)
Darves, C., Oviatt, S., Coulston, R.: The Impact of Auditory Embodiment on Animated Character Design. In: Proceedings of the International Joint Conference on Autonomous Agents and Multi-Agent Systems, Embodied Agents Workshop (2002)
Desmet, P.M.A., Hekkert, P., Jacobs, J.J.: When a car makes you smile: Development and application of an instrument to measure product emotions. In: Hoch, S.J., Meyer, R.J. (eds.) Advances in Consumer Research, vol. 27, pp. 111–117 (2000)
Höök, K., Sengers, P., Andersson, G.: Sense and Sensibility: Evaluation and Interactive Art Computer Human Interaction, Fort Lauderdale (2003)
Jones, M.R., Yee, W.: Attending to auditory events: the role of temporal organization. In: McAdams, S., Bigand, E. (eds.) Thinking in sound: The cognitive psychology of human audition, pp. 69–112. Oxford University Press, New York (1993)
Lockhart, R.: Interrelations between amplitude, latency, rise time and the edelberg recovery measure of the galvanic skin response. Psychophysiology 9(4), 437–442 (1967)
Mersiol, M., Chateau, N., Maffiolo, V.: Talking Heads: Which Matching between Faces and Synthetic Voices? In: Proc. Fourth IEEE International Conference on Multimodal Interfaces, pp. 69–74 (2002)
Narayanan, S., Alwan, A.: Text to Speech Synthesis: New Paradigms and Advances. Prentice Hall PTR, New Jersey (2004)
Nijholt, A.: Towards multi-modal emotion display in embodied agents. In: Proceedings of the 5th Biannual Conference on Artificial Neural Networks and Expert Systems, Dunedin, New Zealand (2001)
Pelé, D., Breton, G., Panaget, F., Loyson, S.: Let’s find a restaurant with Nestor. A 3D embodied conversational agent on the Web. In: AAMAS Workshop on embodied conversational characters as individuals, Australia (2003)
Polkowsky, M., Lewis, J.: Expanding the MOS: Development and psychometric evaluation of the MOS-R and the MOS-X. International Journal of Speech Technology 6, 161–182 (2003)
Rayner, K.: Eye movements in reading and information processing: 20 years of research. Psychologica Bulletin 124, 372–422 (1998)
Ruttkay, Z., Doorman, C., Noot, H.: Evaluating ECAs - What and how? In: Proc. of the AAMAS 2002 Workshop on ’Embodied conversational agents - let’s specify and evaluate them!’, Bologna, Italy (2002)
Ruttkay, Z., Pelachaud, C. (eds.): From Brows to Trust: Evaluating Embodied Conversational Agents. Kluwer Academic Publishers, Dordrecht (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chateau, N., Maffiolo, V., Pican, N., Mersiol, M. (2005). The Effect of Embodied Conversational Agents’ Speech Quality on Users’ Attention and Emotion. In: Tao, J., Tan, T., Picard, R.W. (eds) Affective Computing and Intelligent Interaction. ACII 2005. Lecture Notes in Computer Science, vol 3784. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11573548_84
Download citation
DOI: https://doi.org/10.1007/11573548_84
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29621-8
Online ISBN: 978-3-540-32273-3
eBook Packages: Computer ScienceComputer Science (R0)