Skip to main content

Prosody Enhances Cognitive Infocommunication: Materials from the HuComTech Corpus

  • Chapter
  • First Online:

Part of the book series: Intelligent Systems Reference Library ((ISRL,volume 105))

Abstract

The multimodal HuComTech corpus aims at annotating, studying and publishing data related to a wide spectrum of markers of human behavior in human-human spoken dialogues. By doing so the final goal is to both understand human cognitive behavior in conversational settings and contribute to the enhancement of human-machine interaction systems. One of the main issues still leaving wide spaces for further development is related to speech prosody, the understanding of its association with possible cognitive processes for the expression of emotions as well as the online production of speech utterances. Since the latter often results in incomplete structures, the study of the relation between grammatical incompleteness and prosody can both contribute to a better understanding of human cognition and the enhancement of cognitive infocommunication systems. The data and analyses presented in this paper are intended to serve both these purposes. Two different approaches will be presented as methods of data exploration: the study of static temporal alignments within the ELAN annotation tool, and the discovery of dynamic temporal patterns using the Theme framework.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://hdl.handle.net/1839/00-0000-0000-001A-E17C-1@view

References

  1. Sagisaka Y, Campbell N, Higuchi N (eds) (1996) Computing prosody: computational models for processing spontaneous speech. Springer, New York

    Google Scholar 

  2. Rajeswari KC, Uma Maheswari P (2012) Prosody modeling techniques for text-to-speech synthesis systems—A survey. Int J Comput Appl (0975–8887) 39(16):8

    Google Scholar 

  3. Teixeira JP (2012) Prosody generation model for TTS systems: segmental durations and F0 contours with fujisaki model. LAP LAMBERT Academic Publishing

    Google Scholar 

  4. Chaloupka Z, Hork P (2012) Prosody modelling for TTS systems using statistical methods. In: Cognitive behavioural systems, COST 2102 International training school, Dresden, Germany, February 21–26, 2011. Revised Selected Papers, Springer, Heidelberg, pp 174–183

    Google Scholar 

  5. Roy BC, Frank MC, Roy D (2012) Relating activity contexts to early word learning in dense longitudinal data. In: Proceedings of the 34th annual meeting of the cognitive science society. Sapporo, 2012

    Google Scholar 

  6. Baranyi P, Csapo A (2012) Definition and synergies of cognitive infocommunications. Acta Polytech Hung 9(1):67–83

    Google Scholar 

  7. Sallai G (2012) Defining infocommunications and related terms. Acta Polytech Hung 9(6):5–15

    Google Scholar 

  8. Baranyi P, Csapo A, Varlaki P (2014) An overview of research trends in coginfocom. In: IEEE International conference on intelligent engineering systems, Tihany, pp 181–186

    Google Scholar 

  9. Hunyadi L (2011) Multimodal human-computer interaction technologies. Theoretical modeling and application in speech processing, Argumentum 7, pp 240–260

    Google Scholar 

  10. Ekman P, Friesen W (1978) Facial action coding system: a technique for the measurement of facial movement. Consulting Psychologists Press, Palo Alto

    Google Scholar 

  11. Hunyadi L, Incompleteness and fragmentation in spoken language syntax and its relation to prosody and gesturing: cognitive processes versus possible formal cues. Knowledge-based information systems in practice. Springer (to appear)

    Google Scholar 

  12. Szekrnyes I (2014) Annotation and interpretation of prosodic data in the HuComTech corpus for multimodal user interfaces. J Multimodal User Interfaces 8(2):143–150

    Article  Google Scholar 

  13. Magnusson MS (1996) Hidden real-time patterns in intra- and inter-individual behavior: description and detection. Eur J Psychol Assess 12(2):112–123

    Article  Google Scholar 

  14. Ladd DR (1996) Intonational phonology. Cambridge University Press, Cambridge

    Google Scholar 

  15. Edlund J, Heldner M, Hirschberg J (2009) Pause and gap length in face-to-face interaction. In: Proceedings of Interspeech 2009, Brighton

    Google Scholar 

  16. Hunyadi L (2010) Cognitive grouping and recursion in prosody. In: Hulst H van der (ed) Recursion and human language, de Guyter, Berlin & New York, pp 343–370

    Google Scholar 

  17. Hunyadi L (2002) Hungarian sentence prosody and universal grammar. Peter Lang, New York

    Google Scholar 

  18. Abuczki A (2011) A multimodal analysis of the sequential organization of verbal and nonverbal interaction. Argumentum 7:261–279

    Google Scholar 

  19. Hunyadi L (2010) Cognitive grouping and recursion in prosody. In: Hulst, Harry van der (ed) Recursion and human language. Studies in Generative Grammar [SGG] 104. de Gruyter Mouton, pp 343–370

    Google Scholar 

  20. Szekrenyes I (2015) ProsoTool, a method for automatic annotation of fundamental frequency. In: Cognitive Infocommunications (CogInfoCom), 2015 6th IEEE International Conference on, 19–21 Oct. 2015, Györ, IEEE 2015, pp 291–296

    Google Scholar 

Download references

Acknowledgments

Research presented in this paper was partly supported by project TÁMOP 4.2.2-C/11/1/KONV-2012 and OTKA NK116402. The work was made possible by the generous support of NeDiMAH (Network for Digital Methods in the Arts and Humanities), a cross-European project of the European Science Foundation.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Laszlo Hunyadi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Hunyadi, L., Szekrényes, I., Kiss, H. (2016). Prosody Enhances Cognitive Infocommunication: Materials from the HuComTech Corpus. In: Esposito, A., Jain, L. (eds) Toward Robotic Socially Believable Behaving Systems - Volume I . Intelligent Systems Reference Library, vol 105. Springer, Cham. https://doi.org/10.1007/978-3-319-31056-5_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-31056-5_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-31055-8

  • Online ISBN: 978-3-319-31056-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics