標題: | Reinforcement learning for an ART-based fuzzy adaptive learning control network |
作者: | Lin, CJ Lin, CT 交大名義發表 電控工程研究所 National Chiao Tung University Institute of Electrical and Control Engineering |
公開日期: | 1-May-1996 |
摘要: | This paper proposes a reinforcement fuzzy adaptive learning control network (RFALCON) for solving various reinforcement learning problems, The proposed RFALCON is constructed by integrating two fuzzy adaptive learning control networks (FALCON's), each of which is a connectionist model with a feedforward multilayer network developed for the realization of a fuzzy controller, One FALCON performs as a critic network (fuzzy predictor), and the other as an action network (fuzzy controller), Using the temporal difference prediction method, the critic network can predict the external reinforcement signal and provide a more informative internal reinforcement signal to the action network, The action network performs a stochastic exploratory algorithm to adapt itself according to the internal reinforcement signal, An ART-based reinforcement structure/parameter-learning algorithm is developed for constructing the RFALCON dynamically, During the learning process, both structure learning and parameter learning are performed simultaneously in the two FALCON's, The proposed RFALCON can construct a fuzzy control system dynamically and automatically through a reward/penalty signal (i.e., a ''good'' or ''bad'' signal), It is best applied to the learning environment, where obtaining exact training data is expensive, The proposed RFALCON has two important features, First, it reduces the combinatorial demands placed by the standard methods for adaptive Linearization of a system, Second, the RFALCON is a highly autonomous system, Initially, there are no hidden nodes (i.e., no membership functions or Fuzzy rules), They are created and begin to grow as learning proceeds, The RFALCON can also dynamically partition the input-output spaces, tune activation (membership) functions, and find proper network connection types (fuzzy rules), Computer simulations have been conducted to illustrate the performance and applicability of the proposed learning scheme. |
URI: | http://dx.doi.org/10.1109/72.501728 http://hdl.handle.net/11536/1299 |
ISSN: | 1045-9227 |
DOI: | 10.1109/72.501728 |
期刊: | IEEE TRANSACTIONS ON NEURAL NETWORKS |
Volume: | 7 |
Issue: | 3 |
起始頁: | 709 |
結束頁: | 731 |
Appears in Collections: | Articles |
Files in This Item:
If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.