Temporal difference learning (English Wikipedia)

Analysis of information sources in references of the Wikipedia article "Temporal difference learning" in English language version.

refsWebsite
Global rank English rank
2nd place
2nd place
11th place
8th place
4th place
4th place
low place
low place
5th place
5th place
207th place
136th place
low place
low place
1,306th place
885th place
1,131st place
850th place
low place
7,050th place

doi.org

ibm.com

research.ibm.com

incompleteideas.net

nih.gov

pubmed.ncbi.nlm.nih.gov

ncbi.nlm.nih.gov

nips.cc

books.nips.cc

psu.edu

citeseerx.ist.psu.edu

salk.edu

papers.cnl.salk.edu

semanticscholar.org

api.semanticscholar.org

  • Sutton, Richard S. (1 August 1988). "Learning to predict by the methods of temporal differences". Machine Learning. 3 (1): 9–44. doi:10.1007/BF00115009. ISSN 1573-0565. S2CID 207771194.
  • Schultz, W, Dayan, P & Montague, PR. (1997). "A neural substrate of prediction and reward". Science. 275 (5306): 1593–1599. CiteSeerX 10.1.1.133.6176. doi:10.1126/science.275.5306.1593. PMID 9054347. S2CID 220093382.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  • Montague, P. R.; Sejnowski, T. J. (1994). "The predictive brain: temporal coincidence and temporal order in synaptic learning mechanisms". Learning & Memory. 1 (1): 1–33. doi:10.1101/lm.1.1.1. ISSN 1072-0502. PMID 10467583. S2CID 44560099.
  • Sejnowski, T.J.; Dayan, P.; Montague, P.R. (1995). "Predictive Hebbian learning". Proceedings of the eighth annual conference on Computational learning theory - COLT '95. pp. 15–18. doi:10.1145/225298.225300. ISBN 0897917235. S2CID 1709691.
  • Tesauro (1995). Tesauro, Gerald (March 1995). "Temporal Difference Learning and TD-Gammon". Communications of the ACM. 38 (3): 58–68. doi:10.1145/203330.203343. S2CID 6023746.
  • Schultz, W. (1998). "Predictive reward signal of dopamine neurons". Journal of Neurophysiology. 80 (1): 1–27. CiteSeerX 10.1.1.408.5994. doi:10.1152/jn.1998.80.1.1. PMID 9658025. S2CID 52857162.
  • Tobia, M. J., etc. (2016). "Altered behavioral and neural responsiveness to counterfactual gains in the elderly". Cognitive, Affective, & Behavioral Neuroscience. 16 (3): 457–472. doi:10.3758/s13415-016-0406-7. PMID 26864879. S2CID 11299945.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  • Smith, A., Li, M., Becker, S. and Kapur, S. (2006). "Dopamine, prediction error, and associative learning: a model-based account". Network: Computation in Neural Systems. 17 (1): 61–84. doi:10.1080/09548980500361624. PMID 16613795. S2CID 991839.{{cite journal}}: CS1 maint: multiple names: authors list (link)

ucl.ac.uk

gatsby.ucl.ac.uk

worldcat.org

  • Sutton, Richard S. (1 August 1988). "Learning to predict by the methods of temporal differences". Machine Learning. 3 (1): 9–44. doi:10.1007/BF00115009. ISSN 1573-0565. S2CID 207771194.
  • Montague, P. R.; Dayan, P.; Sejnowski, T. J. (1996-03-01). "A framework for mesencephalic dopamine systems based on predictive Hebbian learning" (PDF). The Journal of Neuroscience. 16 (5): 1936–1947. doi:10.1523/JNEUROSCI.16-05-01936.1996. ISSN 0270-6474. PMC 6578666. PMID 8774460.
  • Montague, P. R.; Sejnowski, T. J. (1994). "The predictive brain: temporal coincidence and temporal order in synaptic learning mechanisms". Learning & Memory. 1 (1): 1–33. doi:10.1101/lm.1.1.1. ISSN 1072-0502. PMID 10467583. S2CID 44560099.