Skip to content

Latest commit

 

History

History
58 lines (43 loc) · 2.09 KB

File metadata and controls

58 lines (43 loc) · 2.09 KB

Ideas

  • Rubik's Cube
  • 2048
  • Minesweeper
  • Sudoku

Papers

Lots of these reference related work and implementations.

  • McAleer, S., Agostinelli, F., Shmakov, A., & Baldi, P. (2018). Solving the Rubik's Cube Without Human Knowledge. arXiv preprint arXiv:1805.07470.

  • McAleer, S., Agostinelli, F., Shmakov, A., & Baldi, P. (2018, September). Solving the Rubik's Cube with Approximate Policy Iteration. In International Conference on Learning Representations.

  • Agostinelli, F., McAleer, S., Shmakov, A., & Baldi, P. (2019). Solving the Rubik’s cube with deep reinforcement learning and search. Nature Machine Intelligence, 1(8), 356-363.

  • Nakov, P., & Wei, Z. (2003). Minesweeper,# Minesweeper. Unpublished Manuscript.

  • Castillo, L. P., & Wrobel, S. (2003, August). Learning minesweeper with multirelational learning. In International Joint Conference on Artificial Intelligence (Vol. 18, pp. 533-540). Lawrence Erlbaum Associates Ltd.

  • Mehta, A. (2021). Reinforcement Learning For Constraint Satisfaction Game Agents (15-Puzzle, Minesweeper, 2048, and Sudoku). arXiv preprint arXiv:2102.06019.

  • Wu IC., Yeh KH., Liang CC., Chang CC., Chiang H. (2014) Multi-Stage Temporal Difference Learning for 2048. In: Cheng SM., Day MY. (eds) Technologies and Applications of Artificial Intelligence. TAAI 2014. Lecture Notes in Computer Science, vol 8916. Springer, Cham. https://doi.org/10.1007/978-3-319-13987-6_34

  • Dedieu, A., & Amar, J. (2017). Deep reinforcement learning for 2048. In Conference on Neural Information Processing Systems (NIPS), 31st, Long Beach, CA, USA.

  • Matsuzaki, K. (2017, July). Developing a 2048 player with backward temporal coherence learning and restart. In Advances in Computer Games (pp. 176-187). Springer, Cham.

  • Goga, A. Reinforcement learning in 2048 game.

Links