References analysis of the Wikipedia article "Apprentissage par renforcement profond" in the French version

arxiv.org

Francois-Lavet, Henderson, Islam et Bellemare, « An Introduction to Deep Reinforcement Learning », Foundations and Trends in Machine Learning, vol. 11, n^os 3–4,‎ 2018, p. 219–354 (ISSN 1935-8237, DOI 10.1561/2200000071, Bibcode 2018arXiv181112560F, arXiv 1811.12560).

Schrittwieser, Antonoglou, Hubert et Simonyan, « Mastering Atari, Go, chess and shogi by planning with a learned model », Nature, vol. 588, n^o 7839,‎ 23 décembre 2020, p. 604–609 (PMID 33361790, DOI 10.1038/s41586-020-03051-4, Bibcode 2020Natur.588..604S, arXiv 1911.08265, lire en ligne)

Levine, Finn, Darrell et Abbeel, « End-to-end training of deep visuomotor policies », JMLR, vol. 17,‎ janvier 2016 (arXiv 1504.00702, lire en ligne)

OpenAI « Solving Rubik's Cube with a Robot Hand » (2019) (arXiv 1910.07113, lire en ligne)

John Schulman, Sergey Levine, Philipp Moritz, Michael Jordan et Pieter Abbeel « Trust Region Policy Optimization » (2015) (arXiv 1502.05477, lire en ligne)
—International Conference on Machine Learning (ICML)

John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford et Oleg Klimov « Proximal Policy Optimization Algorithms » (2017) (arXiv 1707.06347, lire en ligne)

Timothy Lillicrap, Jonathan Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver et Daan Wierstra « Continuous control with deep reinforcement learning » (2016) (arXiv 1509.02971, lire en ligne)
—International Conference on Learning Representations (ICLR)

Volodymyr Mnih, Adria Puigdomenech Badia, Mehdi Mirzi, Alex Graves, Tim Harley, Timothy Lillicrap, David Silver et Koray Kavukcuoglu « Asynchronous Methods for Deep Reinforcement Learning » (2016) (arXiv 1602.01783, lire en ligne)
—International Conference on Machine Learning (ICML)

Tuomas Haarnoja, Aurick Zhou, Sergey Levine et Pieter Abbeel « Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor » (2018) (arXiv 1801.01290, lire en ligne)
—International Conference on Machine Learning (ICML)

(en) Patrik Reizinger et Márton Szemenyei, « Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning », IEEE,‎ 2019 (arXiv 1910.10840)

(en) « Maximum Entropy Deep Inverse Reinforcement Learning », sur Arxiv, 2015

Marcin Andrychowicz, Filip Wolski, Alex Ray, Jonas Schneider, Rachel Fong, Peter Welinder, Bob McGrew, Josh Tobin et Pieter Abbeel « Hindsight Experience Replay » (2018) (arXiv 1707.01495, lire en ligne)
—Advances in Neural Information Processing Systems (NeurIPS)

(en) Charles Packer, Katelyn Gao, Jernej Kos, Philipp Krähenbühl, Vladlen Koltun et al., « Assessing Generalization in Deep Reinforcement Learning », 15 mars 2019.

doi.org

dx.doi.org

Tesauro, « Temporal Difference Learning and TD-Gammon », Communications of the ACM, vol. 38, n^o 3,‎ mars 1995, p. 58–68 (DOI 10.1145/203330.203343, lire en ligne [archive du 9 février 2010], consulté le 10 mars 2017)

Mnih et al., « Human-level control through deep reinforcement learning », Nature, vol. 518, n^o 7540,‎ 2015, p. 529–533 (PMID 25719670, DOI 10.1038/nature14236, Bibcode 2015Natur.518..529M)

Silver, Huang, Maddison et Guez, « Mastering the game of Go with deep neural networks and tree search », Nature, vol. 529, n^o 7587,‎ 28 janvier 2016, p. 484–489 (ISSN 0028-0836, PMID 26819042, DOI 10.1038/nature16961, Bibcode 2016Natur.529..484S)

Bellemare, Candido, Castro et Gong, « Autonomous navigation of stratospheric balloons using reinforcement learning », Nature, vol. 588, n^o 7836,‎ 2 décembre 2020, p. 77–82 (PMID 33268863, DOI 10.1038/s41586-020-2939-8, Bibcode 2020Natur.588...77B, lire en ligne)

Williams, « Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning », Machine Learning, vol. 8, n^os 3–4,‎ 1992, p. 229–256 (DOI 10.1007/BF00992696)

harvard.edu

ui.adsabs.harvard.edu

Mnih et al., « Human-level control through deep reinforcement learning », Nature, vol. 518, n^o 7540,‎ 2015, p. 529–533 (PMID 25719670, DOI 10.1038/nature14236, Bibcode 2015Natur.518..529M)

issn.org

portal.issn.org

nature.com

nih.gov

ncbi.nlm.nih.gov

Mnih et al., « Human-level control through deep reinforcement learning », Nature, vol. 518, n^o 7540,‎ 2015, p. 529–533 (PMID 25719670, DOI 10.1038/nature14236, Bibcode 2015Natur.518..529M)

oita-u.ac.jp

shws.cc.oita-u.ac.jp

Katsunari Shibata et Yoichi Okabe « Reinforcement Learning When Visual Sensory Signals are Directly Given as Inputs » (1997) (lire en ligne)
—International Conference on Neural Networks (ICNN) 1997

Katsunari Shibata et Masaru Iida « Acquisition of Box Pushing by Direct-Vision-Based Reinforcement Learning » (2003) (lire en ligne)
—SICE Annual Conference 2003

Hiroki Utsunomiya et Katsunari Shibata « Contextual Behavior and Internal Representations Acquired by Reinforcement Learning with a Recurrent Neural Network in a Continuous State and Action Space Task » (2008) (lire en ligne)
—International Conference on Neural Information Processing (ICONIP) '08

Katsunari Shibata et Tomohiko Kawano « Learning of Action Generation from Raw Camera Images in a Real-World-like Environment by Simple Coupling of Reinforcement Learning and a Neural Network » (2008) (lire en ligne)
—International Conference on Neural Information Processing (ICONIP) '08

Apprentissage par renforcement profond (French Wikipedia)

arxiv.org

athenasc.com

bkgm.com

deepmind.com

doi.org

dx.doi.org

harvard.edu

ui.adsabs.harvard.edu

issn.org

portal.issn.org

jmlr.org

mlr.press

proceedings.mlr.press

nature.com

nih.gov

ncbi.nlm.nih.gov

oita-u.ac.jp

shws.cc.oita-u.ac.jp

openai.com

toronto.edu

cs.toronto.edu

web.archive.org

youtube.com