[공지사항]을 빙자한 안부와 근황 
Show more

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Bridging Local and Global Knowledge via Transformer in Board Games

Created by
  • Haebom

Author

Yan-Ru Ju, Tai-Lin Wu, Chung-Chin Shih, Ti-Rong Wu

Outline

Although AlphaZero has achieved superhuman performance in board games, it struggles in scenarios that require a comprehensive understanding of the entire board (e.g., long-term pattern recognition in Go). In this paper, we propose ResTNet, which alternates residual blocks and Transformer blocks to bridge local and global knowledge. ResTNet improves the winning rate in several board games (9x9 Go: 54.6% → 60.8%, 19x19 Go: 53.6% → 60.9%, 19x19 hex: 50.4% → 58.0%) and effectively handles long-term patterns such as circle and ladder patterns in 19x19 Go. It reduces the mean squared error of circle pattern recognition from 2.58 to 1.07, reduces the attack probability against adversarial programs from 70.44% to 23.91%, and improves the accuracy of ladder pattern recognition from 59.15% to 80.01%. Visualization of attention maps provides insight into AlphaZero's decision-making process by capturing important game concepts in both Go and Hex. ResTNet presents a promising approach to local and global knowledge integration, paving the way for more effective AlphaZero-based algorithms for board games. The code can be found at https://rlg.iis.sinica.edu.tw/papers/restnet .

Takeaways, Limitations

Takeaways:
ResTNet presents a novel architecture that overcomes AlphaZero's limitations and improves its performance in board games.
We present a method to effectively integrate local and global information.
It helps us understand AlphaZero's decision-making process by improving its long-term pattern recognition capabilities.
We experimentally verified performance improvements in various board games.
Limitations:
Further research is needed to determine whether the proposed method is applicable to all types of board games.
It is possible that the performance gains of ResTNet may be biased towards certain games or patterns.
Further validation of the generalization ability to more complex and diverse patterns is needed.
👍