完整後設資料紀錄
DC 欄位語言
dc.contributor.authorLiu, An-Jenen_US
dc.contributor.authorWu, Ti-Rongen_US
dc.contributor.authorWu, I-Chenen_US
dc.contributor.authorGuei, Hungen_US
dc.contributor.authorWei, Ting-Hanen_US
dc.date.accessioned2020-10-05T01:59:46Z-
dc.date.available2020-10-05T01:59:46Z-
dc.date.issued2020-08-01en_US
dc.identifier.issn1556-603Xen_US
dc.identifier.urihttp://dx.doi.org/10.1109/MCI.2020.2998315en_US
dc.identifier.urihttp://hdl.handle.net/11536/154901-
dc.description.abstractThis paper proposes an approach to strength adjustment and assessment for Monte-Carlo tree search based game-playing programs. We modify an existing softmax policy with a strength index to choose moves. The most important modification is a mechanism which filters low-quality moves by excluding those that have a lower simulation count than a pre-defined threshold ratio of the maximum simulation count. Through theoretical analysis, we show that the adjusted policy is guaranteed to choose moves exceeding a lower bound in strength by using a threshold ratio. Experimental results show that the strength index is highly correlated to the empirical strength. With an index value between ?2, we can cover a strength range of about 800 Elo ratings. The strength adjustment and assessment methods were also tested in real-world scenarios with human players, ranging from professionals (strongest) to kyu rank amateurs (weakest). For amateur levels, we tested our mechanism on two popular Go online platforms - Fox Weiqi and Tygem. The result shows that our method can adjust program strength to different ranks stably. In terms of strength assessment, we proposed a new dynamic strength adjustment method, then used it to evaluate human professionals, predicting reliably their playing strengths within 15 games. Lastly, we collected survey responses asking players about strength perception, entertainment, and general comments for different aspects of analysis. To our best knowledge, this result is state-ofthe- art in terms of the range of strengths in Elo rating while maintaining a controllable relationship between the strength and a strength index.en_US
dc.language.isoen_USen_US
dc.titleStrength Adjustment and Assessment for MCTS-Based Programs [Research Frontier]en_US
dc.typeArticleen_US
dc.identifier.doi10.1109/MCI.2020.2998315en_US
dc.identifier.journalIEEE COMPUTATIONAL INTELLIGENCE MAGAZINEen_US
dc.citation.volume15en_US
dc.citation.issue3en_US
dc.citation.spage60en_US
dc.citation.epage73en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:000549316900005en_US
dc.citation.woscount0en_US
顯示於類別:期刊論文