標題: | GA-based fuzzy reinforcement learning for control of a magnetic bearing system |
作者: | Lin, CT Jou, CP 電控工程研究所 Institute of Electrical and Control Engineering |
關鍵字: | action network;active magnetic bearing;adaptive heuristic critic;critic network |
公開日期: | 1-Apr-2000 |
摘要: | This paper proposes a TD (temporal difference) and GA (genetic algorithm)-based reinforcement (TDGAR) learning method and applies it to the control of a real magnetic bearing system. The TDGAR learning scheme is a new hybrid GA, which integrates the TD prediction method and the GA to perform the reinforcement learning task, The TDGAR learning system is composed of two integrated feedforward networks. One neural network acts as a critic network to guide the learning of the other network (the action network) which determines the outputs (actions) of the TDGAR learning system, The action network can be a normal neural network or a neural fuzzy network, Using the TD prediction method, the critic network can predict the external reinforcement signal and provide a more informative internal reinforcement signal to the action network. The action network uses the GA to adapt itself according to the internal reinforcement signal, The key concept of the TDGAR learning scheme is to formulate the internal reinforcement signal as the fitness function for the GA such that the GA can evaluate the candidate solutions (chromosomes) regularly, even during periods without external feedback from the environment. This enables the GA to proceed to new generations regularly without waiting for the arrival of the external reinforcement signal. This can usually accelerate the GA learning since a reinforcement signal may only be available at a time long after a sequence of actions has occurred in the reinforcement learning problem, The proposed TDGAR learning system has been used to control an active magnetic bearing (AMB) system in practice. A systematic design procedure is developed to achieve successful integration of all the subsystems including magnetic suspension, mechanical structure, and controller training. The results show that the TDGAR learning scheme can successfully find a neural controller or a neural fuzzy controller for a self-designed magnetic bearing system. |
URI: | http://dx.doi.org/10.1109/3477.836376 http://hdl.handle.net/11536/30611 |
ISSN: | 1083-4419 |
DOI: | 10.1109/3477.836376 |
期刊: | IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS |
Volume: | 30 |
Issue: | 2 |
起始頁: | 276 |
結束頁: | 289 |
Appears in Collections: | Articles |
Files in This Item:
If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.