Vocal indicators of psychoactive drug effects

doi:10.1016/0167-6393(84)90019-0

Speech Communication

Volume 3, Issue 3, December 1984, Pages 245-252

https://doi.org/10.1016/0167-6393(84)90019-0 Get rights and content

Abstract

Vocal indicators of psychoactive drug effects were investigated in two experiments with normal subjects. In the first study antidepressants were administered, and in the second antihypertonics. Speech samples were collected at different stages during medication. Based on the known muscle-relaxing effect of these drugs, a decrease in average fundamental frequency (F0) and an increase in the proportion of spectral energy below 500 Hz (LFR) were expected. LFR showed the expected effect in both experiments. F0, on the other hand, only decreased in the first experiment, while in the second it showed a slight tendency to increase. The suitability of these two measures as indicators of emotional state is discussed.

Zusammenfassung

In zwei Experimenten wurde an normalen Versuchpersonen die Wirkung psychoaktiver Medikamente untersucht. In der ersten Studie wurden Antidepressiva, in der zweiten Studie Antihypertonika verabreicht. Zu unterschiedlichen Zeitpunkten in bezug auf die Einnahme wurden Sprechproben erhoben. Ausghend von der bereits bekannten muskelentspannenden Wirkung der untersuchten Präparate wurde ein Absinken der mittleren Grundfrequenz (F0) und ein Anstieg des Anteils spektraler Energie unterhalb von 500 Hz (LFR) hypostasiert. Der LFR zeigt den erwarteten Effekt in beiden Experimenten. Das Absinken der F0 war nur im ersten Experiment zu beobachten, im zweiten Experiment trat stattdessen eine leichte Tendenz zu einem Anstieg auf. Die Brauchbarkeit der beiden Maße als stimmliche Indikatoren emotionaler Zustände wird diskutiert.

Résumé

Deux expériences ont été realisées en vue d'étudier avec des sujets normaux les effets de substances psychoactives. Dans la première étude, la substance administrée était antidépressive, dans la seconde, antihypertonique. Des échantillons de parole ont été recuellis à différentes étapes du traitement. A partir de l'effet connu de ces subtances sur le relâchement musculaire, on peut prédire une diminution globale de la fréquence fondamentale (F0), et, à l'inverse, une augmentation de la proportion d'énergie spectrale au-dessus de 500 Hz (LFR). Le LFR montre l'effet attendu dans les deux expériences. La F0, par contre, décroît dans la première expérience mais tend à augmenter légèrement dans la deuxième expérience. La validité de ces deux mesures comme indices de l'état émotionnel est discutée.

References (14)

F.J. Tolkmitt et al.
Vocal indicators of psychiatric treatment effects in depressives and schizophrenics
J. Commun. Disorders
(1982)
D. Goerlitz
Ergebnisse und Probleme der ausdruckspsychologischen Sprechstimmforschung
(1972)
J. Laver
Individual features in voice quality
E. Gellhorn et al.
Autonomic nervous systems in psychiatric disorder
R.B. Malmo
On Emotions, Needs, and our Archaic Brain
K.R. Scherer
Nonlinguistic vocal indicators of emotion and psychopathology
E. Eriksoo et al.
Chemistry and pharmacology of a new potential antidepressant
Arzneim.-Forsch. (Drug Res.)
(1979)

There are more references available in the full text version of this article.

Cited by (11)

Vocal communication of emotion: A review of research paradigms
2003, Speech Communication
The current state of research on emotion effects on voice and speech is reviewed and issues for future research efforts are discussed. In particular, it is suggested to use the Brunswikian lens model as a base for research on the vocal communication of emotion. This approach allows one to model the complete process, including both encoding (expression), transmission, and decoding (impression) of vocal emotion communication. Special emphasis is placed on the conceptualization and operationalization of the major elements of the model (i.e., the speaker’s emotional state, the listener’s attribution, and the mediating acoustic cues). In addition, the advantages and disadvantages of research paradigms for the induction or observation of emotional expression in voice and speech and the experimental manipulation of vocal cues are discussed, using pertinent examples drawn from past and present research.
Der Aufsatz gibt einen umfassenden Überblick über den Forschungsstand zum Thema der Beeinflussung von Stimme und Sprechweise durch Emotionen des Sprechers. Allgemein wird vorgeschlagen, die Forschung zur vokalen Kommunikation der Emotionen am Brunswik’schen Linsenmodell zu orientieren. Dieser Ansatz erlaubt den gesamten Kommunikationsprozess zu modellieren, von der Enkodierung (Ausdruck), über die Transmission (Übertragung), bis zur Dekodierung (Eindruck). Besondere Aufmerksamkeit gilt den Problemen der Konzeptualisierung und Operationalisierung der zentralen Elemente des Modells (z.B., dem Emotionszustand des Sprechers, den Inferenzprozessen des Hörers, und den zugrundeliegenden vokalen Hinweisreizen). Anhand ausgewählter Beispiele empirischer Untersuchungen werden die Vor- und Nachteile verschiedener Forschungsparadigmen zur Induktion und Beobachtung des emotionalen Stimmausdrucks sowie zur experimentellen Manipulation vokaler Hinweisreize diskutiert.
L’état actuel de la recherche sur l’effet des émotions d’un locuteur sur la voix et la parole est décrit et des approches prometteuses pour le futur identifiées. En particulier, le modèle de perception de Brunswik (dit “de la lentille” est proposé) comme paradigme pour la recherche sur la communication vocale des émotions. Ce modèle permet la modélisation du processus complet, de l’encodage (expression) par la transmission au décodage (impression). La conceptualisation et l’opérationalization des éléments centraux du modèle (l’état émotionnel du locuteur, l’inférence de cet état par l’auditeur, et les indices auditifs) sont discuté en détail. De plus, en analysant des exemples de la recherche dans le domaine, les avantages et désavantages de différentes méthodes pour l’induction et l’observation de l’expression émotionnelle dans la voix et la parole et pour la manipulation expérimentale de différents indices vocaux sont évoqués.
Time- and spectrum-related variabilities in stressed speech under laboratory and real conditions
1996, Speech Communication
Stress induced by various types of situation leads to vocal signal modifications. Previous studies have indicated that stressed speech is associated with a higher fundamental frequency and noticeable changes in vowel spectrum. This paper presents pitch- and spectral-based analyses of stressed speech corpora drawn from both artificial and real situations. The laboratory corpus is obtained by means of the Stroop test, the real-case corpus is extracted from the Cockpit Voice Recording of a crashed aeroplane. Analyses relative to pitch are presented and an index of microprosodic variation, μ, is introduced. Spectrum-related indicators of stress are issued from a cumulative histogram of sound level and from statistical analyses of formant frequencies. Distances to the F1-F2-F3 centre are also investigated. All these variations, throughout the two different situations, show the direct link between some new vocal parameters and stress appearances. The results confirm the validity of laboratory experiments on stress, but emphazise quantitative as well as qualitative differences between the situations and the speakers involved.
Der durch verschiedene Situationstypen hervorgerufene Streβ führt zu Veränderungen des Stimmsignals. Vorhergehende Untersuchungen haben gezeigt, daβ die gestresste Stimme durch eine höhere Grundfrequenz und Schwankungen im Vokalspektrum gekennzeichnet ist. Der folgende Text stellt die gemeinsamen Analysen dieser beiden Parameter anhand des in realen und künstlichen Situationen gestressten Stimmkörpers dar. Der Stimmkörper des Labors ist der des Strooptestes, und der, der realen Situation ist der Auszug einer Cockpitgesprächsaufnahme eines verunglückten Flugzeuges. Die Grundfrequenz ist makroskopisch untersucht und ein Index μ für die mikroprosodischen Veränderungen eingeführt worden. Die Ludikatonen des Streβspektrums sind von einem Histogramm abgeleitet, das den Geräuschpegel und statistische Analysen der Formantfrequenzen verbindet. Die Abstände F1-F2-F3 im Zentrum sind ebenfalls untersucht worden. All diese Abweichungen zeigen in Bezug auf die zwei Situationen, daβ ein direkter Zusammenhang zwischen den neuen Parametern des Stimmsignals und dem Erscheinen des Streβ besteht. Die Ergebnisse bestätigen die Gültigkeit der Laborexperimente, aber zeigen auch quantitative und qualitative Unterschiede zwischen den Situationen und den betreffenden Sprechern auf.
Le stress provoqué par divers types de situations conduit à des modifications du signal vocal. Des études précédentes ont indiqué que la parole stressée est caractérisée par une fréquence fondamentale plus élevée et des altérations du spectre des voyelles. Cet article présente les analyses conjointes de ces deux paramètres à partir de corpus de parole stressée obtenus à la fois dans une situation réelle et dans une situation artificielle. Le corpus de laboratoire est celui du test de Stroop et le corpus de la situation réelle est extráit d'un enregistreur des conversations d'un avion accidenté. La fréquence fondamentale est étudiée macroscopiquement et un index μ de la variation microprosodique est introduit. Les indicateurs spectraux du stress résultent d'un histogramme cumulé du niveau sonore et d'analyses statistiques des fréquences des formants. Les distances par rapport au centre F1-F2-F3 sont aussi étudiées. Toutes ces variations, à travers les deux situations, montrent un lien direct entre certains nouveaux paramètres du signal vocal et les apparitions du stress. Les résultats confirment la validité des expérimentations de laboratoire, mais mettent également en évidence des différences quantitatives aussi bien que qualitatives entre les situations et les locuteurs concernés.
Speaking behavior and voice sound characteristics in depressive patients during recovery
1993, Journal of Psychiatric Research
Based on a sample of 30 depressive patients, we have investigated the time course of recovery from depression in so far as this time course was assessable through changes in psychopathology syndrome scores and through changes in speaking behavior and voice sound characteristics. Specifically, our study design provided 6 repeated assessments over 2 weeks and at a fixed time in the morning each Monday, Wednesday and Friday, plus a final assesment at the patients' releases from hospital. Thus, we were able to determine the degree to which single- parameter approaches to speaking behavior and voice sound characteristics reflect the individual time course of recovery from depression. In this context, we could rely upon a calibration sample with repeated assessments on 192 healthy volunteers which yielded all necessary information concerning reproductibility and sensitivity of speech parameters.
Our analysis revealed several prominent features of speaking behavior and voice sound characteristics to be closely related to the time course of recovery from depression. In particular, the parameters “F0-amplitude”, “F0-6db-bandwidth” and “F0-contour” which assess important characteristics of a speaker's voice timbre, as well as the parameters “energy” and “dynamics” which assess a speaker's mean loudness and the variation of loudness over time, displayed consistently high correlations with depression syndromes. Moreover, the results of single-case analysis turned out to be in remarkable accordance with those of the cross-sectional one: in almost two-thirds of patients there existed a significant relationship over time between the global depression scores and major speech parameters.
As to the remaining one-third of patients who did not fit the picture of high correlations between psychopathology and speech parameters, we found an overproportionally large number of non-improvers characterized by irregular patterns of slight improvement with subsequent deterioration, or of deterioration followed by slight improvement. In other words, one-third of patients displayed time courses of depression whose psychopathology is difficult to assess through standard exploration techniques. Accordingly, it is not clear whether the observed lack of correlation in these patients is due to insufficient data or to an actual disordance between the time development of psychopathology and that of speech parameters.
Vocal Assessment of Affective Disorders
2013, Depression and Expressive Behavior
The effect of use of drugs on speaker's fundamental frequency and formants
2012, 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012
Foundations of Voice Studies: An Interdisciplinary Approach to Voice Production and Perception
2011, Foundations of Voice Studies: An Interdisciplinary Approach to Voice Production and Perception

View all citing articles on Scopus

View full text

Vocal indicators of psychoactive drug effects

Abstract

Zusammenfassung

Résumé

J. Commun. Disorders

Ergebnisse und Probleme der ausdruckspsychologischen Sprechstimmforschung

Individual features in voice quality

Autonomic nervous systems in psychiatric disorder

On Emotions, Needs, and our Archaic Brain

Nonlinguistic vocal indicators of emotion and psychopathology

Chemistry and pharmacology of a new potential antidepressant

Arzneim.-Forsch. (Drug Res.)