完整後設資料紀錄
DC 欄位語言
dc.contributor.authorHsueh, Chu-Hsuanen_US
dc.contributor.authorWu, I-Chenen_US
dc.contributor.authorChen, Jr-Changen_US
dc.contributor.authorHsu, Tsan-shengen_US
dc.date.accessioned2019-04-02T06:04:32Z-
dc.date.available2019-04-02T06:04:32Z-
dc.date.issued2018-01-01en_US
dc.identifier.issn2376-6816en_US
dc.identifier.urihttp://dx.doi.org/10.1109/TAAI.2018.00034en_US
dc.identifier.urihttp://hdl.handle.net/11536/151040-
dc.description.abstractThe AlphaZero algorithm, developed by DeepMind, achieved superhuman levels of play in the games of chess, shogi, and Go, by learning without domain-specific knowledge except game rules. This paper investigates whether the algorithm can also learn theoretical values and optimal plays for non-deterministic games. Since the theoretical values of such games are expected win rates, not a simple win, loss, or draw, it is worthy investigating the ability of the AlphaZero algorithm to approximate expected win rates of positions. This paper also studies how the algorithm is influenced by a set of hyper-parameters. The tested non-deterministic game is a reduced and solved version of Chinese dark chess (CDC), called 2x4 CDC. The experiments show that the AlphaZero algorithm converges nearly to the theoretical values and the optimal plays in many of the settings of the hyper-parameters. To our knowledge, this is the first research paper that applies the AlphaZero algorithm to non-deterministic games.en_US
dc.language.isoen_USen_US
dc.subjectAlphaZeroen_US
dc.subjectnon-deterministic gameen_US
dc.subjectChinese dark chessen_US
dc.subjecttheoretical valueen_US
dc.titleAlphaZero for a Non-deterministic Gameen_US
dc.typeProceedings Paperen_US
dc.identifier.doi10.1109/TAAI.2018.00034en_US
dc.identifier.journal2018 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI)en_US
dc.citation.spage116en_US
dc.citation.epage121en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:000458676200025en_US
dc.citation.woscount0en_US
顯示於類別:會議論文