ABSTRACT
In this paper an architecture and an implementation for a linguistically based prosodic analyser is presented. The implementation is designed to handle typical prosodic input in the form of parallel input channels, and processes each input channel independently in a data-directed, phonologically motivated configuration of partly parallel, partly cascaded feature modules and module clusters, each implemented as finite transducers, producing intonationally relevant categories as output. The design criteria included maximal restriction of computational power (the system could be compiled into one massive finite transducer); relevance to computational linguistic formalisms with a view to developing an integrated model mapping prosodic structures on to textual structures; relatability to speech recognition algorithms, and to phonological theories. It was implemented in an object oriented environment with parallel processing simulation (CheOPS), and a linguistically interesting surface language (BATLAN).
- Berwick, R. C. S. Pilato, 1987, "Learning syntax by automata induction." Machine Learning 2, 9--35. Google ScholarDigital Library
- Bolc, L. & M. Maksymienko, 1981. Komputerowy system przetwarzania tekstow fonematycznych. U Warsaw Press.Google Scholar
- Braun, G. & Jin, F., 1987. Akzentwahrehmung and Akzenterkennung. "Prosodische Kohääsion" Project Report U Bielefeld.Google Scholar
- Chomyszyn, J., 1986. "A phonemic transcription program for Polish." Int. J. Man-Machine Studies 25, 271--293. Google ScholarDigital Library
- Church, K. W., 1980. Memory limitations in natural language processing. Master's thesis, M.I.T.Google Scholar
- Church, K. W., 1983. Phrase Structure Parsing. A method for taking advantage of allophonic constraints. Ph.D. thesis, M.I.T.Google Scholar
- Eikmeyer, H. J., 1986. "CheOPS: an object-oriented system in PROLOG." User Manual Bielefeld.Google Scholar
- Eikmeyer, H. J. & Gibbon, D., 1983. "BATNET: ein ATN-System in einer Nicht-LISP-Umgebung." Sprache und Datenverarbeitung 7, 26--35.Google Scholar
- Gibbon, D., 1981. "A new look at intonation syntax and semantics". In: A. James, P. Westney, eds., New Linguistic Impulses in Foreign Language Teaching. Tübingen: Narr.Google Scholar
- Gibbon, D., 1987. "Finite state processing of tone systems." In: Proc. 3rd Conf. European Chapter of ACL, Copenhagen, 1--3 April 1987, 291--298. Google ScholarDigital Library
- 't Hart, J. & Collier, R., 1975. "Integrating different levcls of intonation analysis." J. Phonetics 3, 235--255.Google Scholar
- Kay, M., 1987. "Nonconcatenative Finite-State Morphology." Proc. 3rd Conf. European Chapter of ACL, Copenhagen, 1--3 April 1987, 2--10. Google ScholarDigital Library
- Kay, M. & Kaplan, R., 1981. "Phonological rules and finite-state transducers." Paper at Annual Meeting of ACL, 28.2.1981, NYC. (Cited by Koskenniemi).Google Scholar
- Levinson, S. E., 1986. "Continuously variable duration hidden Markov models for automatic speech recognition." Computer Speech and Language 1, 29--45. Google ScholarDigital Library
- Pierrehumbert, J., 1980. The Phonology and Phonetics of English Intonation. Ph.D. thesis, M.I.T.Google Scholar
- Pignataro, V., 1987. Ein Sprachgenerierungsmodell mit Topik und Fokus. "Prosodische Kohäsion" Project Report, U Bielefeld.Google Scholar
- The PSI/PHI architecture for prosodic parsing
Recommendations
Incorporated speech overlapped factor (Φ) into speech clarity index (Ψ): method to improve dysarthric speech severity evaluation
i-CREATe '07: Proceedings of the 1st international convention on Rehabilitation engineering & assistive technology: in conjunction with 1st Tan Tock Seng Hospital Neurorehabilitation MeetingDysarthria is a name given to a group of speech disorders. Our research focuses on developing an automated system for dysarthric speech assessment. Being different from traditional speech assessments which normally rely on human-perceptual analyses, ...
Prosodic word boundary detection from Bengali continuous speech
AbstractDetection of word boundaries in continuous speech is a tedious process due to the absence of a definite pause or silence in the word boundary position. Thus, continuous speech recognition is a very challenging task. However, the prosodic word ...
Comments