AlphaZero for a Non-deterministic Game

doi:10.1109/TAAI.2018.00034

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Hsueh, Chu-Hsuan	en_US
dc.contributor.author	Wu, I-Chen	en_US
dc.contributor.author	Chen, Jr-Chang	en_US
dc.contributor.author	Hsu, Tsan-sheng	en_US
dc.date.accessioned	2019-04-02T06:04:32Z	-
dc.date.available	2019-04-02T06:04:32Z	-
dc.date.issued	2018-01-01	en_US
dc.identifier.issn	2376-6816	en_US
dc.identifier.uri	http://dx.doi.org/10.1109/TAAI.2018.00034	en_US
dc.identifier.uri	http://hdl.handle.net/11536/151040	-
dc.description.abstract	The AlphaZero algorithm, developed by DeepMind, achieved superhuman levels of play in the games of chess, shogi, and Go, by learning without domain-specific knowledge except game rules. This paper investigates whether the algorithm can also learn theoretical values and optimal plays for non-deterministic games. Since the theoretical values of such games are expected win rates, not a simple win, loss, or draw, it is worthy investigating the ability of the AlphaZero algorithm to approximate expected win rates of positions. This paper also studies how the algorithm is influenced by a set of hyper-parameters. The tested non-deterministic game is a reduced and solved version of Chinese dark chess (CDC), called 2x4 CDC. The experiments show that the AlphaZero algorithm converges nearly to the theoretical values and the optimal plays in many of the settings of the hyper-parameters. To our knowledge, this is the first research paper that applies the AlphaZero algorithm to non-deterministic games.	en_US
dc.language.iso	en_US	en_US
dc.subject	AlphaZero	en_US
dc.subject	non-deterministic game	en_US
dc.subject	Chinese dark chess	en_US
dc.subject	theoretical value	en_US
dc.title	AlphaZero for a Non-deterministic Game	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.doi	10.1109/TAAI.2018.00034	en_US
dc.identifier.journal	2018 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI)	en_US
dc.citation.spage	116	en_US
dc.citation.epage	121	en_US
dc.contributor.department	資訊工程學系	zh_TW
dc.contributor.department	Department of Computer Science	en_US
dc.identifier.wosnumber	WOS:000458676200025	en_US
dc.citation.woscount	0	en_US
顯示於類別：	會議論文