Reinforcement learning for an ART-based fuzzy adaptive learning control network

doi:10.1109/72.501728

標題:	Reinforcement learning for an ART-based fuzzy adaptive learning control network
作者:	Lin, CJ Lin, CT 交大名義發表電控工程研究所 National Chiao Tung University Institute of Electrical and Control Engineering
公開日期:	1-May-1996
摘要:	This paper proposes a reinforcement fuzzy adaptive learning control network (RFALCON) for solving various reinforcement learning problems, The proposed RFALCON is constructed by integrating two fuzzy adaptive learning control networks (FALCON's), each of which is a connectionist model with a feedforward multilayer network developed for the realization of a fuzzy controller, One FALCON performs as a critic network (fuzzy predictor), and the other as an action network (fuzzy controller), Using the temporal difference prediction method, the critic network can predict the external reinforcement signal and provide a more informative internal reinforcement signal to the action network, The action network performs a stochastic exploratory algorithm to adapt itself according to the internal reinforcement signal, An ART-based reinforcement structure/parameter-learning algorithm is developed for constructing the RFALCON dynamically, During the learning process, both structure learning and parameter learning are performed simultaneously in the two FALCON's, The proposed RFALCON can construct a fuzzy control system dynamically and automatically through a reward/penalty signal (i.e., a ''good'' or ''bad'' signal), It is best applied to the learning environment, where obtaining exact training data is expensive, The proposed RFALCON has two important features, First, it reduces the combinatorial demands placed by the standard methods for adaptive Linearization of a system, Second, the RFALCON is a highly autonomous system, Initially, there are no hidden nodes (i.e., no membership functions or Fuzzy rules), They are created and begin to grow as learning proceeds, The RFALCON can also dynamically partition the input-output spaces, tune activation (membership) functions, and find proper network connection types (fuzzy rules), Computer simulations have been conducted to illustrate the performance and applicability of the proposed learning scheme.
URI:	http://dx.doi.org/10.1109/72.501728 http://hdl.handle.net/11536/1299
ISSN:	1045-9227
DOI:	10.1109/72.501728
期刊:	IEEE TRANSACTIONS ON NEURAL NETWORKS
Volume:	7
Issue:	3
起始頁:	709
結束頁:	731
Appears in Collections:	Articles

Files in This Item:

A1996UL25900015.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.