强化学习 (Chinese Wikipedia)

Analysis of information sources in references of the Wikipedia article "强化学习" in Chinese language version.

refsWebsite

Global rank Chinese rank

4web.archive.org

1^st place

3doi.org

2^nd place

23^rd place

2worldcat.org

5^th place

12^th place

2springer.com

274^th place

320^th place

1ieee.org

652^nd place

712^th place

1jair.org

low place

1semanticscholar.org

11^th place

332^nd place

1arxiv.org

69^th place

254^th place

1tokic.com

low place

arxiv.org

Kaelbling, L. P.; Littman, M. L.; Moore, A. W. Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research. 1996-05-01, 4: 237-285 [2025-03-15]. ISSN 1076-9757. S2CID 1708582. arXiv:cs/9605103 . doi:10.1613/jair.301. （原始内容存档于2025-05-04）.

doi.org

Hu, Junyan; Niu, Hanlin; Carrasco, Joaquin; Lennox, Barry; Arvin, Farshad. Voronoi-Based Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning. IEEE Transactions on Vehicular Technology. 2020-12, 69 (12): 14413-14423. ISSN 1939-9359. doi:10.1109/TVT.2020.3034800. （原始内容存档于2021-08-13）.
Kaelbling, L. P.; Littman, M. L.; Moore, A. W. Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research. 1996-05-01, 4: 237-285 [2025-03-15]. ISSN 1076-9757. S2CID 1708582. arXiv:cs/9605103 . doi:10.1613/jair.301. （原始内容存档于2025-05-04）.
van Otterlo, Martijn; Wiering, Marco, Wiering, Marco; van Otterlo, Martijn , 编, Reinforcement Learning and Markov Decision Processes 12, Springer Berlin Heidelberg: 3–42, 2012 [2025-03-15], ISBN 978-3-642-27644-6, doi:10.1007/978-3-642-27645-3_1

ieee.org

ieeexplore.ieee.org

Hu, Junyan; Niu, Hanlin; Carrasco, Joaquin; Lennox, Barry; Arvin, Farshad. Voronoi-Based Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning. IEEE Transactions on Vehicular Technology. 2020-12, 69 (12): 14413-14423. ISSN 1939-9359. doi:10.1109/TVT.2020.3034800. （原始内容存档于2021-08-13）.

jair.org

Kaelbling, L. P.; Littman, M. L.; Moore, A. W. Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research. 1996-05-01, 4: 237-285 [2025-03-15]. ISSN 1076-9757. S2CID 1708582. arXiv:cs/9605103 . doi:10.1613/jair.301. （原始内容存档于2025-05-04）.

semanticscholar.org

api.semanticscholar.org

Kaelbling, L. P.; Littman, M. L.; Moore, A. W. Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research. 1996-05-01, 4: 237-285 [2025-03-15]. ISSN 1076-9757. S2CID 1708582. arXiv:cs/9605103 . doi:10.1613/jair.301. （原始内容存档于2025-05-04）.

springer.com

link.springer.com

van Otterlo, Martijn; Wiering, Marco, Wiering, Marco; van Otterlo, Martijn , 编, Reinforcement Learning and Markov Decision Processes 12, Springer Berlin Heidelberg: 3–42, 2012 [2025-03-15], ISBN 978-3-642-27644-6, doi:10.1007/978-3-642-27645-3_1

springer.com

Gosavi, Abhijit. Simulation-based Optimization: Parametric Optimization Techniques and Reinforcement. Springer. 2003 [2015-08-19]. ISBN 1-4020-7454-9. （原始内容存档于2012-06-15）.

tokic.com

Tokic, Michel; Palm, Günther, Value-Difference Based Exploration: Adaptive Control Between Epsilon-Greedy and Softmax, KI 2011: Advances in Artificial Intelligence (PDF), Lecture Notes in Computer Science 7006, Springer: 335–346, 2011 [2018-09-03], ISBN 978-3-642-24455-1, （原始内容存档 (PDF)于2018-11-23）

web.archive.org

Hu, Junyan; Niu, Hanlin; Carrasco, Joaquin; Lennox, Barry; Arvin, Farshad. Voronoi-Based Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning. IEEE Transactions on Vehicular Technology. 2020-12, 69 (12): 14413-14423. ISSN 1939-9359. doi:10.1109/TVT.2020.3034800. （原始内容存档于2021-08-13）.
Kaelbling, L. P.; Littman, M. L.; Moore, A. W. Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research. 1996-05-01, 4: 237-285 [2025-03-15]. ISSN 1076-9757. S2CID 1708582. arXiv:cs/9605103 . doi:10.1613/jair.301. （原始内容存档于2025-05-04）.
Gosavi, Abhijit. Simulation-based Optimization: Parametric Optimization Techniques and Reinforcement. Springer. 2003 [2015-08-19]. ISBN 1-4020-7454-9. （原始内容存档于2012-06-15）.
Tokic, Michel; Palm, Günther, Value-Difference Based Exploration: Adaptive Control Between Epsilon-Greedy and Softmax, KI 2011: Advances in Artificial Intelligence (PDF), Lecture Notes in Computer Science 7006, Springer: 335–346, 2011 [2018-09-03], ISBN 978-3-642-24455-1, （原始内容存档 (PDF)于2018-11-23）

worldcat.org

Hu, Junyan; Niu, Hanlin; Carrasco, Joaquin; Lennox, Barry; Arvin, Farshad. Voronoi-Based Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning. IEEE Transactions on Vehicular Technology. 2020-12, 69 (12): 14413-14423. ISSN 1939-9359. doi:10.1109/TVT.2020.3034800. （原始内容存档于2021-08-13）.
Kaelbling, L. P.; Littman, M. L.; Moore, A. W. Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research. 1996-05-01, 4: 237-285 [2025-03-15]. ISSN 1076-9757. S2CID 1708582. arXiv:cs/9605103 . doi:10.1613/jair.301. （原始内容存档于2025-05-04）.