Aprenentatge de reforç a partir de la retroalimentació humana (Catalan Wikipedia)

Analysis of information sources in references of the Wikipedia article "Aprenentatge de reforç a partir de la retroalimentació humana" in Catalan language version.

refsWebsite
Global rank Catalan rank
2nd place
6th place
low place
low place
388th place
527th place
616th place
1,222nd place
1,943rd place
2,446th place
low place
low place
187th place
438th place
54th place
198th place
1,559th place
950th place

arstechnica.com

deepmind.com

doi.org

dx.doi.org

  • Ziegler, Daniel M.; Stiennon, Nisan; Wu, Jeffrey; Brown, Tom B.; Radford, Alec "Fine-Tuning Language Models from Human Preferences"., 2019. DOI: 10.48550/arXiv.1909.08593.
  • Warnell, Garrett; Waytowich, Nicholas; Lawhern, Vernon; Stone, Peter Proceedings of the AAAI Conference on Artificial Intelligence, 32, 1, 25-04-2018. DOI: 10.1609/aaai.v32i1.11485.
  • Bai, Yuntao; Jones, Andy; Ndousse, Kamal; Askell, Amanda; Chen, Anna "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback", 2022. DOI: 10.48550/arXiv.2204.05862.
  • Ouyang, Long; Wu, Jeff; Jiang, Xu; Almeida, Diogo; Wainwright, Carroll L. "Learning to summarize with human feedback", 2022. DOI: 10.48550/arXiv.2203.02155.
  • Glaese, Amelia; McAleese, Nat; Trębacz, Maja; Aslanides, John; Firoiu, Vlad "Building safer dialogue agents", 2022. DOI: 10.48550/arXiv.2209.14375.

forbes.com

huggingface.co

openai.com

techcrunch.com

technologyreview.com

venturebeat.com