Q-учeње (Serbian Wikipedia)

Analysis of information sources in references of the Wikipedia article "Q-учeње" in Serbian language version.

refsWebsite

Global rank Serbian rank

6web.archive.org

1^st place

4doi.org

2^nd place

4^th place

4books.google.com

3^rd place

2^nd place

3arxiv.org

69^th place

240^th place

2ualberta.ca

3,600^th place

5,793^rd place

2worldcat.org

5^th place

12^th place

2nih.gov

4^th place

8^th place

1utl.pt

low place

1ut.ee

8,317^th place

low place

1incompleteideas.net

low place

1archive.org

6^th place

5^th place

1leemon.com

low place

1huji.ac.il

3,903^rd place

4,151^st place

1bkgm.com

low place

8,516^th place

1rhul.ac.uk

low place

1storage.googleapis.com

5,609^th place

low place

1harvard.edu

18^th place

28^th place

1nips.cc

low place

1aaai.org

9,352^nd place

low place

1microsoft.com

153^rd place

508^th place

1anu.edu.au

942^nd place

2,000^th place

aaai.org (Global: 9,352^nd place; Serbian: low place)

van Hasselt, Hado; Guez, Arthur; Silver, David (2015). „Deep reinforcement learning with double Q-learning”. AAAI Conference on Artificial Intelligence: 2094—2100. arXiv:1509.06461 . Архивирано из оригинала (PDF) 06. 02. 2020. г. Приступљено 14. 04. 2022.

anu.edu.au (Global: 942^nd place; Serbian: 2,000^th place)

users.cecs.anu.edu.au

Gaskett, Chris; Wettergreen, David; Zelinsky, Alexander (1999). „Q-Learning in Continuous State and Action Spaces” (PDF).

archive.org (Global: 6^th place; Serbian: 5^th place)

Russell, Stuart J.; Norvig, Peter (2010). Artificial Intelligence: A Modern Approach (Third изд.). Prentice Hall. стр. 649. ISBN 978-0136042594.

arxiv.org (Global: 69^th place; Serbian: 240^th place)

François-Lavet, Vincent; Fonteneau, Raphael (2015-12-07). „How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies”. arXiv:1512.02011  [cs.LG].
van Hasselt, Hado; Guez, Arthur; Silver, David (2015). „Deep reinforcement learning with double Q-learning”. AAAI Conference on Artificial Intelligence: 2094—2100. arXiv:1509.06461 . Архивирано из оригинала (PDF) 06. 02. 2020. г. Приступљено 14. 04. 2022.
Hessel, Matteo; Modayil, Joseph; van Hasselt, Hado; Schaul, Tom; Ostrovski, Georg; Dabney, Will; Horgan, Dan; Piot, Bilal; Azar, Mohammad (фебруар 2018). „Rainbow: Combining Improvements in Deep Reinforcement Learning”. AAAI Conference on Artificial Intelligence. arXiv:1710.02298 . Приступљено 16. 9. 2021.

bkgm.com (Global: low place; Serbian: 8,516^th place)

Tesauro, Gerald (март 1995). „Temporal Difference Learning and TD-Gammon”. Communications of the ACM. 38 (3): 58—68. doi:10.1145/203330.203343. Приступљено 2010-02-08.

books.google.com (Global: 3^rd place; Serbian: 2^nd place)

Hasselt, Hado van (5. 3. 2012). „Reinforcement Learning in Continuous State and Action Spaces”. Ур.: Wiering, Marco; Otterlo, Martijn van. Reinforcement Learning: State-of-the-Art. Springer Science & Business Media. стр. 207—251. ISBN 978-3-642-27645-3.
Bozinovski, S. (15. 7. 1999). „Crossbar Adaptive Array: The first connectionist network that solved the delayed reinforcement learning problem”. Ур.: Dobnikar, Andrej; Steele, Nigel C.; Pearson, David W.; Albrecht, Rudolf F. Artificial Neural Nets and Genetic Algorithms: Proceedings of the International Conference in Portorož, Slovenia, 1999. Springer Science & Business Media. стр. 320—325. ISBN 978-3-211-83364-3.
Bozinovski, S. (1982). „A self learning system using secondary reinforcement”. Ур.: Trappl, Robert. Cybernetics and Systems Research: Proceedings of the Sixth European Meeting on Cybernetics and Systems Research. North Holland. стр. 397. ISBN 978-0-444-86488-8.
Barto, A. (24. 2. 1997). „Reinforcement learning”. Ур.: Omidvar, Omid; Elliott, David L. Neural Systems for Control. Elsevier. ISBN 978-0-08-053739-9.

doi.org (Global: 2^nd place; Serbian: 4^th place)

Shteingart, Hanan; Neiman, Tal; Loewenstein, Yonatan (мај 2013). „The role of first impression in operant learning.” (PDF). Journal of Experimental Psychology: General (на језику: енглески). 142 (2): 476—488. ISSN 1939-2222. PMID 22924882. doi:10.1037/a0029550. Архивирано из оригинала (PDF) 26. 01. 2021. г. Приступљено 14. 04. 2022.
Tesauro, Gerald (март 1995). „Temporal Difference Learning and TD-Gammon”. Communications of the ACM. 38 (3): 58—68. doi:10.1145/203330.203343. Приступљено 2010-02-08.
Watkins, Chris; Dayan, Peter (1992). „Q-learning”. Machine Learning. 8 (3–4): 279—292. doi:10.1007/BF00992698 .
Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; Rusu, Andrei A.; Veness, Joel; Bellemare, Marc G.; Graves, Alex; Riedmiller, Martin; Fidjeland, Andreas K. (фебруар 2015). „Human-level control through deep reinforcement learning”. Nature (на језику: енглески). 518 (7540): 529—533. Bibcode:2015Natur.518..529M. ISSN 0028-0836. PMID 25719670. doi:10.1038/nature14236.

harvard.edu (Global: 18^th place; Serbian: 28^th place)

adsabs.harvard.edu

Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; Rusu, Andrei A.; Veness, Joel; Bellemare, Marc G.; Graves, Alex; Riedmiller, Martin; Fidjeland, Andreas K. (фебруар 2015). „Human-level control through deep reinforcement learning”. Nature (на језику: енглески). 518 (7540): 529—533. Bibcode:2015Natur.518..529M. ISSN 0028-0836. PMID 25719670. doi:10.1038/nature14236.

huji.ac.il (Global: 3,903^rd place; Serbian: 4,151^st place)

ratio.huji.ac.il

Shteingart, Hanan; Neiman, Tal; Loewenstein, Yonatan (мај 2013). „The role of first impression in operant learning.” (PDF). Journal of Experimental Psychology: General (на језику: енглески). 142 (2): 476—488. ISSN 1939-2222. PMID 22924882. doi:10.1037/a0029550. Архивирано из оригинала (PDF) 26. 01. 2021. г. Приступљено 14. 04. 2022.

incompleteideas.net (Global: low place; Serbian: low place)

Sutton, Richard; Barto, Andrew (1998). Reinforcement Learning: An Introduction. MIT Press.

leemon.com (Global: low place; Serbian: low place)

Baird, Leemon (1995). „Residual algorithms: Reinforcement learning with function approximation” (PDF). ICML: 30—37.

microsoft.com (Global: 153^rd place; Serbian: 508^th place)

Strehl, Alexander L.; Li, Lihong; Wiewiora, Eric; Langford, John; Littman, Michael L. (2006). „Pac model-free reinforcement learning” (PDF). Proc. 22nd ICML: 881—888.

nih.gov (Global: 4^th place; Serbian: 8^th place)

ncbi.nlm.nih.gov

Shteingart, Hanan; Neiman, Tal; Loewenstein, Yonatan (мај 2013). „The role of first impression in operant learning.” (PDF). Journal of Experimental Psychology: General (на језику: енглески). 142 (2): 476—488. ISSN 1939-2222. PMID 22924882. doi:10.1037/a0029550. Архивирано из оригинала (PDF) 26. 01. 2021. г. Приступљено 14. 04. 2022.
Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; Rusu, Andrei A.; Veness, Joel; Bellemare, Marc G.; Graves, Alex; Riedmiller, Martin; Fidjeland, Andreas K. (фебруар 2015). „Human-level control through deep reinforcement learning”. Nature (на језику: енглески). 518 (7540): 529—533. Bibcode:2015Natur.518..529M. ISSN 0028-0836. PMID 25719670. doi:10.1038/nature14236.

nips.cc (Global: low place; Serbian: low place)

papers.nips.cc

van Hasselt, Hado (2011). „Double Q-learning” (PDF). Advances in Neural Information Processing Systems. 23: 2613—2622.

rhul.ac.uk (Global: low place; Serbian: low place)

cs.rhul.ac.uk

Watkins, C.J.C.H. Learning from Delayed Rewards (PDF) (Теза). University of Cambridge.

storage.googleapis.com (Global: 5,609^th place; Serbian: low place)

patentimages.storage.googleapis.com

„Methods and Apparatus for Reinforcement Learning, US Patent #20150100530A1” (PDF). US Patent Office. 9. 4. 2015. Приступљено 28. 7. 2018.

ualberta.ca (Global: 3,600^th place; Serbian: 5,793^rd place)

webdocs.cs.ualberta.ca

Sutton, Richard S.; Barto, Andrew G. „2.7 Optimistic Initial Values”. Reinforcement Learning: An Introduction. Архивирано из оригинала 2013-09-08. г. Приступљено 2013-07-18.
Maei, Hamid; Szepesvári, Csaba; Bhatnagar, Shalabh; Sutton, Richard (2010). „Toward off-policy learning control with function approximation in Proceedings of the 27th International Conference on Machine Learning” (PDF). стр. 719—726. Архивирано из оригинала (PDF) 2012-09-08. г. Приступљено 2016-01-25.

ut.ee (Global: 8,317^th place; Serbian: low place)

neuro.cs.ut.ee

Matiisen, Tambet (19. 12. 2015). „Demystifying Deep Reinforcement Learning”. neuro.cs.ut.ee (на језику: енглески). Computational Neuroscience Lab. Архивирано из оригинала 07. 04. 2018. г. Приступљено 2018-04-06.

utl.pt (Global: low place; Serbian: low place)

users.isr.ist.utl.pt

Melo, Francisco S. „Convergence of Q-learning: a simple proof” (PDF). Архивирано из оригинала (PDF) 18. 11. 2017. г. Приступљено 14. 04. 2022.

web.archive.org (Global: 1^st place; Serbian: 1^st place)

Melo, Francisco S. „Convergence of Q-learning: a simple proof” (PDF). Архивирано из оригинала (PDF) 18. 11. 2017. г. Приступљено 14. 04. 2022.
Matiisen, Tambet (19. 12. 2015). „Demystifying Deep Reinforcement Learning”. neuro.cs.ut.ee (на језику: енглески). Computational Neuroscience Lab. Архивирано из оригинала 07. 04. 2018. г. Приступљено 2018-04-06.
Sutton, Richard S.; Barto, Andrew G. „2.7 Optimistic Initial Values”. Reinforcement Learning: An Introduction. Архивирано из оригинала 2013-09-08. г. Приступљено 2013-07-18.
Shteingart, Hanan; Neiman, Tal; Loewenstein, Yonatan (мај 2013). „The role of first impression in operant learning.” (PDF). Journal of Experimental Psychology: General (на језику: енглески). 142 (2): 476—488. ISSN 1939-2222. PMID 22924882. doi:10.1037/a0029550. Архивирано из оригинала (PDF) 26. 01. 2021. г. Приступљено 14. 04. 2022.
van Hasselt, Hado; Guez, Arthur; Silver, David (2015). „Deep reinforcement learning with double Q-learning”. AAAI Conference on Artificial Intelligence: 2094—2100. arXiv:1509.06461 . Архивирано из оригинала (PDF) 06. 02. 2020. г. Приступљено 14. 04. 2022.
Maei, Hamid; Szepesvári, Csaba; Bhatnagar, Shalabh; Sutton, Richard (2010). „Toward off-policy learning control with function approximation in Proceedings of the 27th International Conference on Machine Learning” (PDF). стр. 719—726. Архивирано из оригинала (PDF) 2012-09-08. г. Приступљено 2016-01-25.

worldcat.org (Global: 5^th place; Serbian: 12^th place)

Shteingart, Hanan; Neiman, Tal; Loewenstein, Yonatan (мај 2013). „The role of first impression in operant learning.” (PDF). Journal of Experimental Psychology: General (на језику: енглески). 142 (2): 476—488. ISSN 1939-2222. PMID 22924882. doi:10.1037/a0029550. Архивирано из оригинала (PDF) 26. 01. 2021. г. Приступљено 14. 04. 2022.
Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; Rusu, Andrei A.; Veness, Joel; Bellemare, Marc G.; Graves, Alex; Riedmiller, Martin; Fidjeland, Andreas K. (фебруар 2015). „Human-level control through deep reinforcement learning”. Nature (на језику: енглески). 518 (7540): 529—533. Bibcode:2015Natur.518..529M. ISSN 0028-0836. PMID 25719670. doi:10.1038/nature14236.

Q-учeње (Serbian Wikipedia)

aaai.org (Global: 9,352nd place; Serbian: low place)

anu.edu.au (Global: 942nd place; Serbian: 2,000th place)

users.cecs.anu.edu.au

archive.org (Global: 6th place; Serbian: 5th place)

arxiv.org (Global: 69th place; Serbian: 240th place)

bkgm.com (Global: low place; Serbian: 8,516th place)

books.google.com (Global: 3rd place; Serbian: 2nd place)

doi.org (Global: 2nd place; Serbian: 4th place)

harvard.edu (Global: 18th place; Serbian: 28th place)

adsabs.harvard.edu

huji.ac.il (Global: 3,903rd place; Serbian: 4,151st place)

ratio.huji.ac.il

incompleteideas.net (Global: low place; Serbian: low place)

leemon.com (Global: low place; Serbian: low place)

microsoft.com (Global: 153rd place; Serbian: 508th place)

nih.gov (Global: 4th place; Serbian: 8th place)

ncbi.nlm.nih.gov

nips.cc (Global: low place; Serbian: low place)

papers.nips.cc

rhul.ac.uk (Global: low place; Serbian: low place)

cs.rhul.ac.uk

storage.googleapis.com (Global: 5,609th place; Serbian: low place)

patentimages.storage.googleapis.com

ualberta.ca (Global: 3,600th place; Serbian: 5,793rd place)

webdocs.cs.ualberta.ca

ut.ee (Global: 8,317th place; Serbian: low place)

neuro.cs.ut.ee

utl.pt (Global: low place; Serbian: low place)

users.isr.ist.utl.pt

web.archive.org (Global: 1st place; Serbian: 1st place)

worldcat.org (Global: 5th place; Serbian: 12th place)

aaai.org (Global: 9,352^nd place; Serbian: low place)

anu.edu.au (Global: 942^nd place; Serbian: 2,000^th place)

archive.org (Global: 6^th place; Serbian: 5^th place)

arxiv.org (Global: 69^th place; Serbian: 240^th place)

bkgm.com (Global: low place; Serbian: 8,516^th place)

books.google.com (Global: 3^rd place; Serbian: 2^nd place)

doi.org (Global: 2^nd place; Serbian: 4^th place)

harvard.edu (Global: 18^th place; Serbian: 28^th place)

huji.ac.il (Global: 3,903^rd place; Serbian: 4,151^st place)

microsoft.com (Global: 153^rd place; Serbian: 508^th place)

nih.gov (Global: 4^th place; Serbian: 8^th place)

storage.googleapis.com (Global: 5,609^th place; Serbian: low place)

ualberta.ca (Global: 3,600^th place; Serbian: 5,793^rd place)

ut.ee (Global: 8,317^th place; Serbian: low place)

web.archive.org (Global: 1^st place; Serbian: 1^st place)

worldcat.org (Global: 5^th place; Serbian: 12^th place)