Copyright © 1995 Published by Elsevier Science Ltd.
Contributed article
Speed invariant speech recognition using variable velocity delay lines
Received 10 December 1993;
References and further reading may be available for this article. To view references and further reading you must purchase this article.
Abstract
A neural network model for speech recognition is proposed, based on neurophysiologicalfindings of the auditory system. The first stage of the system is a feature-extracting module that is a model of the auditory pathway between the cochlea and the auditory cortex. The feature-extracting module extracts constant frequency (CF), FM-ascending (FM-A), and FM-descending (FM-D) components. The second stage is a recognition module that is able to perform time-distortion invariant recognition without ignoring information concerning the relative lengths of each feature. This module consists of a main block and two subblocks. The recognition results are obtained from the main block The two subblocks are used for monitoring the speed of the input pattern. Each block is a neocognitron-like network for which the first layer consists of variable-velocity delay lines. The propagation velocities of the delay lines of the upper and lower blocks are faster and slower, respectively, than that of the main block. The propagation velocities of these delay lines are controlled in such a way that the duration of the feature on the delay line of the main block is the same as the duration of a similar feature of a training pattern. This velocity control is accomplished by comparing the outputs of the two subblocks. The propagation velocities of these three delay lines are variable but the ratio of velocities is kept constant. The computer-simulated system was trained using several Japanese words. After the training was completed, the system recognized each of the words correctly without being affected by their spoken speeds.
Author Keywords: Speech recognition; Auditory pathway; Variable velocity delay line; Velocity control; Speed invariant; Neocognitron; Unsupervised learning







E-mail Article
Add to my Quick Links

Cited By in Scopus (4)





