ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Neurocomputing
Volume 9, Issue 3, December 1995, Pages 271-292
Control and Robotics, Part III
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Purchase PDF (1628 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/0925-2312(95)00035-X    How to Cite or Link Using DOI (Opens New Window)
Copyright © 1995 Published by Elsevier Science B.V.

Paper

Direct associative reinforcement learning methods for dynamic systems control

Vijaykumar GullapalliE-mail The Corresponding Author

Dept. of Mechanical and Aerospace Engineering, Princeton University, Princeton, NJ 08544, USA

Received 17 May 1994; 
accepted 9 March 1995. ;
Available online 21 April 2000.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Most problems in learning to control dynamic systems involve learning under uncertainty, noise, and the lack of explicit instructional information about how to perform a task. Under these circumstances, techniques developed by artificial intelligence researchers for ‘learning from examples’, including the ‘supervised learning’ techniques studied by neural network researchers, are impractical because of the difficulty of obtaining training information (the ‘examples’) in the form of situation-action training pairs. A useful alternative in such situations is a learning technique that can discover appropriate actions in various situations through a search process that is guided by evaluative performance feedback. Reinforcement learning methods developed by neural network researchers are examples of such techniques. This paper focuses on direct reinforcement learning techniques and discusses their role in learning control by relating them to similar adaptive control methods. Several examples are also presented to illustrate the power and utility of direct reinforcement learning techniques for learning control.

Author Keywords: Reinforcement learning; Learning control; Direct methods; Peg-in-hole insertion; Inverse kinematics; Pole balancing

Article Outline

• References

Neurocomputing
Volume 9, Issue 3, December 1995, Pages 271-292
Control and Robotics, Part III
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.