Silver, David (7 december 2018). A general reinforcement learning algorithm that masters chess, shogi, and go through self-play. Science362 (6419): 1140–1144. PMID30523106. DOI: 10.1126/science.aar6404.
Schrittwieser, Julian (2020). Mastering Atari, Go, chess and shogi by planning with a learned model. Nature588 (7839): 604–609. PMID33361790. DOI: 10.1038/s41586-020-03051-4.
nih.gov
ncbi.nlm.nih.gov
Silver, David (7 december 2018). A general reinforcement learning algorithm that masters chess, shogi, and go through self-play. Science362 (6419): 1140–1144. PMID30523106. DOI: 10.1126/science.aar6404.
Schrittwieser, Julian (2020). Mastering Atari, Go, chess and shogi by planning with a learned model. Nature588 (7839): 604–609. PMID33361790. DOI: 10.1038/s41586-020-03051-4.