ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Neural Networks
Volume 8, Issue 2, 1995, Pages 167-177
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Purchase PDF (1069 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/0893-6080(94)00069-X    How to Cite or Link Using DOI (Opens New Window)
Copyright © 1995 Published by Elsevier Science Ltd.

Contributed article

Speed invariant speech recognition using variable velocity delay lines

K. YamauchiCorresponding Author Contact Information, M. Fukuda and K. Fukushima

Faculty of Engineering Science, Osaka University, Japan

Received 10 December 1993; 
accepted 21 June 1994. ;
Available online 20 April 2000.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

A neural network model for speech recognition is proposed, based on neurophysiologicalfindings of the auditory system. The first stage of the system is a feature-extracting module that is a model of the auditory pathway between the cochlea and the auditory cortex. The feature-extracting module extracts constant frequency (CF), FM-ascending (FM-A), and FM-descending (FM-D) components. The second stage is a recognition module that is able to perform time-distortion invariant recognition without ignoring information concerning the relative lengths of each feature. This module consists of a main block and two subblocks. The recognition results are obtained from the main block The two subblocks are used for monitoring the speed of the input pattern. Each block is a neocognitron-like network for which the first layer consists of variable-velocity delay lines. The propagation velocities of the delay lines of the upper and lower blocks are faster and slower, respectively, than that of the main block. The propagation velocities of these delay lines are controlled in such a way that the duration of the feature on the delay line of the main block is the same as the duration of a similar feature of a training pattern. This velocity control is accomplished by comparing the outputs of the two subblocks. The propagation velocities of these three delay lines are variable but the ratio of velocities is kept constant. The computer-simulated system was trained using several Japanese words. After the training was completed, the system recognized each of the words correctly without being affected by their spoken speeds.

Author Keywords: Speech recognition; Auditory pathway; Variable velocity delay line; Velocity control; Speed invariant; Neocognitron; Unsupervised learning

Article Outline

• References

Neural Networks
Volume 8, Issue 2, 1995, Pages 167-177
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.