A Comparison of Recordings of Sentences and Spontaneous Speech: Perceptual and Acoustic Measures in Preschool Children’s Voices
Introduction
In a clinical setting, problems related to voice function are routinely assessed by perceptual evaluations of voice quality.1 The most common material for this assessment is a standardized recording, consisting of reading a text aloud, naming pictures or repeating sentences, and sustaining vowels depending on the patient’s population. The recordings are often carried out in a sound-treated booth aiming at high-quality recordings. Based on these recordings, a perceptual assessment of voice quality along different perceptual parameters is carried out.2, 3, 4 The result of the perceptual evaluation together with laryngeal status makes up the basis for decisions regarding intervention. Also, improvement in voice quality is one of the primary benchmarks against which treatment outcome is evaluated, commonly assessed by an evaluation along perceptual and acoustic parameters after completed intervention.
A dysfunctional voice may be a serious social and psychological problem for adults5 and children.6, 7 Many habits, including vocal habits, are probably established during childhood. Thus, undesirable vocal habits may originate during early childhood and continue into adult life.8, 9 This would point to the importance of voice research focusing on child’s voice and the treatment and prevention of voice disorders in children. This research also needs to include vocal behavior and vocal demands in children’s everyday life.
Recently, the importance of in situ recordings of natural vocal behavior in everyday life situations has been pointed out.10, 11, 12, 13 In a study of preschool teachers’ voices, Södersten et al14 compared mean fundamental frequency (F0) in a controlled recording to the F0 in the spontaneous speech during work. They found that the mean F0 was higher in the work-related recording compared with the controlled condition, indicating that a controlled recording may not reflect mean F0 in spontaneous speech under natural conditions. There are few studies of children’s voice use in a natural setting. In a recent study of mean F0 in children’s and teachers’ voices in a preschool setting, the results showed a significant difference for both children and adults between the recordings of sentences compared with real work/play situations.15 The findings support the conclusion that controlled setups are not suitable to evaluate F0 values in a natural setting. These findings have also been supported by two studies of preschool-aged children, a case study of a 5-year-old boy16 and a study comparing F0 in children at play compared with structured situations.17 The results indicate that studio recordings in a clinical setting need to be complemented by recordings in real-life situations and environments to correctly assess habitual F0 in children.
The question asked in the present study is, does this difference in vocal behavior regarding F0 also apply to other aspects of voice? Thus, children’s voice quality, mean F0, and perturbation in a controlled recording were compared with sentences obtained during regular activities at the day-care center (DCC).
Section snippets
Subjects
Recordings from eleven 5-year-old children in a previous study on environmental factors contributing to voice problems in children were selected.18 The children had no history of hearing or speech problems, or frequent ear, nose, and throat infections. No initial survey of voice quality was made. The children attended three DCCs in a city with approx. 135,000 inhabitants at the time of the data collection, situated 200 km south of Stockholm in Sweden. An informed consent form was signed by the
Results
Interrater agreement for the perceptual evaluation was calculated using a Spearman’s rho correlation. The agreement between judges was satisfactory. For the controlled sentences, the agreement varied between rho = 0.81 and 0.89 for the different parameters with the highest agreement for the parameter hyperfunction. For the spontaneous speech sentences, the agreement was somewhat higher varying between rho = 0.90 and 0.95 with the highest value for the parameter hoarseness. The perceptual evaluation
Discussion
In the present study, the relationship between acoustic measures and a perceptual evaluation of controlled recordings of repeated sentences, and sentences selected from spontaneous speech were investigated. The data were obtained from recordings of 11 children on the same day and in the same environment. Selected samples were chosen to be as similar as possible. Thus, sections with shouting or obviously elevated F0 were disregarded to avoid clear differences in the compared samples. Comparisons
Conclusion
The evaluation of voice quality, F0, and perturbation in standard sentences and sentences selected from spontaneous speech was compared. A total of 62 samples from 11 children were analyzed. Data showed a correlation between the standard sentences and sentences selected from spontaneous speech for the voice quality parameter hoarseness only. F0 was significantly higher in spontaneous speech. For boys, there was a correlation across speech tasks for the parameters breathiness and perturbation
Acknowledgment
Valuable comments regarding the statistical analyses were gratefully received from Örjan Dahlström, PhD, Linköping University.
References (29)
- et al.
Test-retest study of the GRBAS scale: influence of experience and professional background on perceptual rating of voice quality
J Voice
(1997) - et al.
Assessing outcomes for dysphonic patients
J Voice
(1998) - et al.
Effects of family therapy on children’s voices
J Voice
(2005) - et al.
Pediatric Voice Handicap Index (pVHI): a new tool for evaluating pediatric dysphonia
Int J Pediatr Otorhinolaryngol
(2007) - et al.
A longitudinal study of the prevalence of voice disorders in children from a rural school division
J Commun Disord
(1989) - et al.
Cancellation of simulated environmental noise as a tool for measuring vocal performance during noise exposure
J Voice
(2002) - et al.
Vocal behaviour and vocal loading factors for pre-school teacher at work studied with binaural DAT recordings
J Voice
(2002) - et al.
Mean F0 values obtained through standard phrase pronunciation compared with values obtained from the normal work environment: a study on teacher and child voices performed in a preschool environment
J Voice
(2010) A comparison of a child’s fundamental frequencies in structured elicited vocalizations versus unstructured natural vocalizations: a case study
Int J Pediatr Otorhinolaryngol
(2009)- et al.
Investigation of habitual pitch during freeplay activities for preschool-aged children
Int J Pediatr Otorhinolaryngol
(2009)
Child voice and noise: a pilot study of the effect of a day at the day-care on ten children’s voice quality according to perceptual evaluation
J Voice
Relations between voice range profiles and physiological and perceptual voice characteristics in ten-year-old children
J Voice
Vibratory characteristics of the vocal folds in young adult and geriatric women
J Voice
Loud speech in realistic environmental noise: phonetogram data, perceptual voice quality, subjective ratings, and gender differences in healthy speakers
J Voice
Cited by (7)
Preschool children's taste acceptance of highly concentrated fluoride compounds: Effects on nonverbal behavior
2013, Journal of Clinical Pediatric DentistryWithin-talker and within-session stability of acoustic characteristics of conversational and clear speaking stylesa)
2024, Journal of the Acoustical Society of AmericaFeature Fusion and Ablation Analysis in Gender Identification of Preschool Children from Spontaneous Speech
2023, Circuits, Systems, and Signal ProcessingVocal Characteristics across English-Northern Sotho Bilingual Speakers: A Comparative Study
2023, Folia Phoniatrica et LogopaedicaVocal characteristics of 5-year-old children: proposed normative values based on a French-speaking population<sup>†</sup>
2020, Logopedics Phoniatrics Vocology