Title: Reinforcement learning for an ART-based fuzzy adaptive learning control network
Authors: Lin, CJ
Lin, CT
交大名義發表
電控工程研究所
National Chiao Tung University
Institute of Electrical and Control Engineering
Issue Date: 1-May-1996
Abstract: This paper proposes a reinforcement fuzzy adaptive learning control network (RFALCON) for solving various reinforcement learning problems, The proposed RFALCON is constructed by integrating two fuzzy adaptive learning control networks (FALCON's), each of which is a connectionist model with a feedforward multilayer network developed for the realization of a fuzzy controller, One FALCON performs as a critic network (fuzzy predictor), and the other as an action network (fuzzy controller), Using the temporal difference prediction method, the critic network can predict the external reinforcement signal and provide a more informative internal reinforcement signal to the action network, The action network performs a stochastic exploratory algorithm to adapt itself according to the internal reinforcement signal, An ART-based reinforcement structure/parameter-learning algorithm is developed for constructing the RFALCON dynamically, During the learning process, both structure learning and parameter learning are performed simultaneously in the two FALCON's, The proposed RFALCON can construct a fuzzy control system dynamically and automatically through a reward/penalty signal (i.e., a ''good'' or ''bad'' signal), It is best applied to the learning environment, where obtaining exact training data is expensive, The proposed RFALCON has two important features, First, it reduces the combinatorial demands placed by the standard methods for adaptive Linearization of a system, Second, the RFALCON is a highly autonomous system, Initially, there are no hidden nodes (i.e., no membership functions or Fuzzy rules), They are created and begin to grow as learning proceeds, The RFALCON can also dynamically partition the input-output spaces, tune activation (membership) functions, and find proper network connection types (fuzzy rules), Computer simulations have been conducted to illustrate the performance and applicability of the proposed learning scheme.
URI: http://dx.doi.org/10.1109/72.501728
http://hdl.handle.net/11536/1299
ISSN: 1045-9227
DOI: 10.1109/72.501728
Journal: IEEE TRANSACTIONS ON NEURAL NETWORKS
Volume: 7
Issue: 3
Begin Page: 709
End Page: 731
Appears in Collections:Articles


Files in This Item:

  1. A1996UL25900015.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.