完整後設資料紀錄
DC 欄位語言
dc.contributor.authorPi, Chen-Huanen_US
dc.contributor.authorHu, Kai-Chunen_US
dc.contributor.authorCheng, Stoneen_US
dc.contributor.authorWu, I-Chenen_US
dc.date.accessioned2020-03-02T03:23:32Z-
dc.date.available2020-03-02T03:23:32Z-
dc.date.issued2020-02-01en_US
dc.identifier.issn0967-0661en_US
dc.identifier.urihttp://dx.doi.org/10.1016/j.conengprac.2019.104222en_US
dc.identifier.urihttp://hdl.handle.net/11536/153802-
dc.description.abstractThis paper proposes a low-level quadrotor control algorithm using neural networks with model-free reinforcement learning, then explores the algorithm's capabilities on quadrotor hover and tracking tasks. We provide a new point of view by examining the well-known policy gradient algorithm from reinforcement learning, then relaxing its requirements to improve training efficiency. Without requiring expert demonstrations, the improved algorithm is then applied to train a quadrotor controller with its output directly mapped to four actuators in a simulator, which is a technique used to control any linear or nonlinear system under unknown dynamic parameters and disturbances. We show two experimental tasks both in simulation and real-world quadrotors to verify our method and demonstrate performance: 1) hovering at a fixed position, and 2) tracking along a specific trajectory.en_US
dc.language.isoen_USen_US
dc.subjectReinforcement learningen_US
dc.subjectPolicy gradienten_US
dc.subjectQuadrotoren_US
dc.titleLow-level autonomous control and tracking of quadrotor using reinforcement learningen_US
dc.typeArticleen_US
dc.identifier.doi10.1016/j.conengprac.2019.104222en_US
dc.identifier.journalCONTROL ENGINEERING PRACTICEen_US
dc.citation.volume95en_US
dc.citation.spage0en_US
dc.citation.epage0en_US
dc.contributor.department機械工程學系zh_TW
dc.contributor.department應用數學系zh_TW
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentDepartment of Mechanical Engineeringen_US
dc.contributor.departmentDepartment of Applied Mathematicsen_US
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:000510526900020en_US
dc.citation.woscount0en_US
顯示於類別:期刊論文