Ideas

Rubik's Cube
2048
Minesweeper
Sudoku

Papers

Lots of these reference related work and implementations.

McAleer, S., Agostinelli, F., Shmakov, A., & Baldi, P. (2018). Solving the Rubik's Cube Without Human Knowledge. arXiv preprint arXiv:1805.07470.
McAleer, S., Agostinelli, F., Shmakov, A., & Baldi, P. (2018, September). Solving the Rubik's Cube with Approximate Policy Iteration. In International Conference on Learning Representations.
Agostinelli, F., McAleer, S., Shmakov, A., & Baldi, P. (2019). Solving the Rubik’s cube with deep reinforcement learning and search. Nature Machine Intelligence, 1(8), 356-363.
Nakov, P., & Wei, Z. (2003). Minesweeper,# Minesweeper. Unpublished Manuscript.
Castillo, L. P., & Wrobel, S. (2003, August). Learning minesweeper with multirelational learning. In International Joint Conference on Artificial Intelligence (Vol. 18, pp. 533-540). Lawrence Erlbaum Associates Ltd.
Mehta, A. (2021). Reinforcement Learning For Constraint Satisfaction Game Agents (15-Puzzle, Minesweeper, 2048, and Sudoku). arXiv preprint arXiv:2102.06019.
Wu IC., Yeh KH., Liang CC., Chang CC., Chiang H. (2014) Multi-Stage Temporal Difference Learning for 2048. In: Cheng SM., Day MY. (eds) Technologies and Applications of Artificial Intelligence. TAAI 2014. Lecture Notes in Computer Science, vol 8916. Springer, Cham. https://doi.org/10.1007/978-3-319-13987-6_34
Dedieu, A., & Amar, J. (2017). Deep reinforcement learning for 2048. In Conference on Neural Information Processing Systems (NIPS), 31st, Long Beach, CA, USA.
Matsuzaki, K. (2017, July). Developing a 2048 player with backward temporal coherence learning and restart. In Advances in Computer Games (pp. 176-187). Springer, Cham.
Goga, A. Reinforcement learning in 2048 game.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ideas

Papers

Links

FilesExpand file tree

notes.md

Latest commit

History

notes.md

File metadata and controls

Ideas

Papers

Links