ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
Neural Networks
Volume 3, Issue 6, 1990, Pages 671-692
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Purchase PDF (1796 K)

Article Toolbox
 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/0893-6080(90)90056-Q    
How to Cite or Link Using DOI (Opens New Window)

Copyright © 1990 Published by Elsevier Ltd.

Original contribution

A stochastic reinforcement learning algorithm for learning real-valued functions

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Vijaykumar GullapalliCorresponding Author Contact Information, a

aDepartment of Computer and Information Science, University of Massachusetts USA


Received 26 September 1989; 
accepted 31 January 1990. 
Available online 5 March 2003.

Abstract

Most of the research in reinforcement learning has been on problems with discrete action spaces. However, many control problems require the application of continuous control signals. In this paper, we present a stochastic reinforcement learning algorithm for learning functions with continuous outputs using a connectionist network. We define stochastic units that compute their real-valued outputs as a function of random activations generated using the normal distribution. Learning takes place by using our algorithm to adjust the two parameters of the normal distribution so as to increase the probability of producing the optimal real value for each input pattern. The performance of the algorithm is studied by using it to learn tasks of varying levels of difficulty. Further, as an example of a potential application, we present a network incorporating these stochastic real-valued units that learns to perform an underconstrained positioning task using a simulated 3 degree-of-freedom robot arm.

Keywords: Neural networks; Associative reinforcement learning; Learning algorithm; Stochastic automata; Real-valued functions; Shaping; Robotics; Neurocontrol


Corresponding Author Contact InformationRequests for reprints should be sent to Vijaykumar Gullapalli, COINS Dept., University of Massachusetts, Amherst, MA 01003.

Neural Networks
Volume 3, Issue 6, 1990, Pages 671-692
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.