Abstract
The temporal structure of speech has been shown to be highly variable. Speaking rate, stress, and other factors influence the duration of individual speech sounds. The highly elastic nature of speech would seem to pose a problem for the listener, especially with respect to the perception of temporal speech cues such as voice-onset time (VOT) and quantity: How does the listener disentangle those temporal changes whicqh are linguistically significant from those which are extrinsic to the linguistic message? This paper reports data on the behavior of two Icelandic speech cues at different speaking rates. The results show that manipulations of rate have the effect of slightly blurring the distinction between unaspirated and aspirated stops. Despite great changes in the absolute durations of vowels and consonants, the two categories of syllables-V:C and VC:-are nonetheless kept totally distinct. In two perceptual experiments, it is shown that while the ratio of vowel to rhyme duration is the primary cue to quantity and remains invariant at different rates, no such ratio can be defined for VOT. These results imply that quantity is the only one of these two speech cues that is selfnormalizing for rate. Models of rate-dependent speech processing need to address this difference.
Article PDF
Similar content being viewed by others
References
Bannert, R. (1979). The effect of sentence accent on quantity. In E. Fischer-Jørgensen, J. Rischel, & N. Thorsen (Eds.),Proceedings of the Ninth International Congress of Phonetic Sciences (Vol. 2, pp. 253–259). Copenhagen: Institute of Phonetics.
Diehl, R. L., &Walsh, M. A. (1989). An auditory basis for the stimulus-length effect in the perception of stops and glides.Journal of the Acoustical Society of America,85, 2154–2164.
Einarsson, S. (1927).Beiträge zur Phonetik der Isländischen Sprache [Contributions to the phonetics of the Icelandic language]. Oslo: A. W. Brøgger.
Fowler, C. A. (1990). Sound-producing objects as objects of perception: Rate normalization and nonspeech perception.Journal of the Acoustical Society of America,88, 1236–1249.
Ganong, W. F. (1980). Phonetic categorization in auditory word perception.Journal of Experimental Psychology: Human Perception & Performance,6, 110–125.
Green, K. P., Stevens, E. B., &Kuhl, P. K. (1994). Talker continuity and the use of rate information during phonetic perception.Perception & Psychophysics,55, 249–260.
Klatt, D. H. (1980). Software for a cascade/parallel formant synthesizer.Journal of the Acoustical Society of America,67, 971–995.
Klatt, D. H., &Klatt, L. C. (1990). Analysis, synthesis, and perception of voice quality among female and male talkers.Journal of the Acoustical Society of America,87, 820–857.
Lehiste, I. (1970).Suprasegmentals. Cambridge, MA: MIT Press.
Lisker, L. (1974). On time and timing in speech. In T. A. Sebeok (Ed.),Current trends in linguistics (Vol. 12, pp. 2387–2418). The Hague: Mouton.
Magen, H. S., &Blumstein, S. E. (1993). Effects of speaking rate on the vowel length distinction in Korean.Journal of Phonetics,21, 387–409.
Massaro, D. W., &Cohen, M. M. (1983). Consonant/vowel ratio: An improbable cue in speech.Perception & Psychophysics,33, 501–505.
Miller, J. L. (1981). Effects of speaking rates on segmental distinctions. In P. D. Eimas & J. L. Miller (Eds.),Perspectives on the study of speech (pp. 39–74). Hillsdale, NJ: Erlbaum.
Miller, J. L. (1987). Rate-dependent processing in speech perception. In A. W. Ellis (Ed.),Progress in the psychology of language (Vol. 3, pp. 119–157). Hove, U.K.: Erlbaum.
Miller, J. L., &Baer, T. (1983). Some effects of speaking rate on the production of /b/ and /w/.Journal of the Acoustical Society of America,73, 1751–1755.
Miller, J. L., Green, K. P., &Reeves, A. (1986). Speaking rate and segments: A look at the relation between speech production and speech perception for the voicing contrast.Phonetica,43, 106–115.
Miller, J. L., &Liberman, A. M. (1979). Some effects of lateroccurring information on the perception of stop consonant and semivowel.Perception & Psychophysics,25, 457–465.
Miller, J. L., &Volaitis, L. E. (1989). Effect of speaking rate on the perceptual structure of a phonetic category.Perception & Psychophysics,46, 505–512.
Nooteboom, S. G. (1979). Complex control of simple decisions in the perception of vowel length. In E. Fischer-Jørgensen, J. Rischel, & N. Thorsen (Eds.),Proceedings of the Ninth International Congress of Phonetic Sciences (Vol. 2, pp. 298–304). Copenhagen: Institute of Phonetics.
Pind, J. (1982).The perception of quantity and preaspiration in Icelandic. Unpublished doctoral thesis, University of Sussex, Brighton.
Pind, J. (1986). The perception of quantity in Icelandic.Phonetica,43, 116–139.
Pind, J.(in press). Constancy and normalization in the perception of voice offset time as a cue for preaspiration.Acta Psychologica.
Pisoni, D. B., Carrell, T. D., &Gans, S. J. (1983). Perception of the duration of rapid spectrum changes in speech and nonspeech signals.Perception & Psychophysics,34, 314–322.
Port, R. F., &Dalby, J. (1982). Consonant/vowel ratio as a cue for voicing in English.Perception & Psychophysics,32, 141–152.
Summerfield, Q. (1981). On articulatory rate and perceptual constancy in phonetic perception.Journal of Experimental Psychology: Human Perception & Performance,7, 1074–1095.
Volaitis, L. E., &Miller, J. L. (1992). Phonetic prototypes: Influence of place of articulation and speaking rate on the internal structure of voicing categories.Journal of the Acoustical Society of America,92, 723–735.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research was supported by the Icelandic Science Foundation and by the Research Foundation of the University of Iceland.
The paper has benefited considerably from discussions with Harlan Lane, Joanne Miller, Kenneth Stevens, and from comments on an earlier draft by Joanne Miller, Robert Port, and two anonymous reviewers.
Rights and permissions
About this article
Cite this article
Pind, J. Speaking rate, voice-onset time, and quantity: The search for higher-order invariants for two Icelandic speech cues. Perception & Psychophysics 57, 291–304 (1995). https://doi.org/10.3758/BF03213055
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/BF03213055