標題: | On Strength Adjustment for MCTS-Based Programs |
作者: | Wu, I-Chen Wu, Ti-Rong Liu, An-Jen Guei, Hung Wei, Tinghan 資訊工程學系 Department of Computer Science |
公開日期: | 1-一月-2019 |
摘要: | This paper proposes an approach to strength adjustment for MCTS-based game-playing programs. In this approach, we use a softmax policy with a strength index to choose moves. Most importantly, we filter low quality moves by excluding those that have a lower simulation count than a pre-defined threshold ratio of the maximum simulation count. We perform a theoretical analysis, reaching the result that the adjusted policy is guaranteed to choose moves exceeding a lower bound in strength by using a threshold ratio. The approach is applied to the Go program ELF OpenGo. The experiment results show that is highly correlated to the empirical strength; namely, given a threshold ratio 0.1, z is linearly related to the Elo rating with regression error 47.95 Elo where -2 <= z <= 2. Meanwhile, the covered strength range is about 800 Elo ratings in the interval of in. With the ease of strength adjustment using, we present two methods to adjust strength and predict opponents' strengths dynamically. To our knowledge, this result is state-of-the-art in terms of the range of strengths in Elo rating while maintaining a controllable relationship between the strength and a strength index. |
URI: | http://hdl.handle.net/11536/152977 |
ISBN: | 978-1-57735-809-1 |
期刊: | THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE |
起始頁: | 1222 |
結束頁: | 1229 |
顯示於類別: | 會議論文 |