標題: | A reinforcement neuro-fuzzy combiner for multiobjective control |
作者: | Lin, CT Chung, IF 電控工程研究所 Institute of Electrical and Control Engineering |
關鍵字: | critic information;mutual credit assignment;priori knowledge;reinforcement learning;soft stitch |
公開日期: | 1-十二月-1999 |
摘要: | This paper proposes a neuro-fuzzy combiner (NFC) with reinforcement learning capability for solving multiobjective control problems, The proposed NFC can combine n existing low-level controllers in a hierarchical way to form a multiobjective fuzzy controller, It is assumed that each low-level (fuzzy or nonfuzzy) controller has been well designed to serve a particular objective, The role of the NFC is to fuse the n actions decided by the n low-level controllers and determine a proper action acting on the environment (plant) at each time step. Hence, the NFC can combine low-level controllers and achieve multiple objectives (goals) at once. The NFC acts like a switch that chooses a proper action from the actions of low-level controllers according to the feedback information from the environment. In fact, the NFC is a soft switch; it allows more than one low-level actions to be active with different degrees through fuzzy combination at each time step, An NFC can be designed by the trial-and-error approach if enough a priori knowledge is available, or it can be obtained by supervised learning if precise input/output training data are available. In the more practical cases when there is no instructive teaching information available, the NFC can learn by itself using the proposed reinforcement learning scheme. Adopted with reinforcement learning capability, the NFC can learn to achieve desired multiobjectives simultaneously through the rough reinforcement feedback from the environment, which contains only critic information such as "success (good)" or "failure (bad)" for each desired objective. Computer simulations have been conducted to illustrate the performance and applicability of the proposed architecture and learning scheme. |
URI: | http://dx.doi.org/10.1109/3477.809028 http://hdl.handle.net/11536/30918 |
ISSN: | 1083-4419 |
DOI: | 10.1109/3477.809028 |
期刊: | IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS |
Volume: | 29 |
Issue: | 6 |
起始頁: | 726 |
結束頁: | 744 |
顯示於類別: | 期刊論文 |