Alinhamento da inteligência artificial (Portuguese Wikipedia)

Analysis of information sources in references of the Wikipedia article "Alinhamento da inteligência artificial" in Portuguese language version.

refsWebsite
Global rank Portuguese rank
5th place
5th place
69th place
195th place
2nd place
4th place
1,559th place
1,438th place
551st place
330th place
4th place
8th place
low place
low place
6,413th place
low place
18th place
51st place
7th place
14th place
12th place
21st place
low place
low place
616th place
783rd place
1st place
1st place
low place
low place
low place
low place
97th place
116th place
9,352nd place
low place
1,943rd place
4,380th place
low place
low place
20th place
25th place
6,158th place
low place
low place
low place
low place
low place
670th place
686th place
896th place
655th place
8,920th place
6,719th place
2,012th place
2,617th place
low place
low place
731st place
660th place
179th place
198th place
916th place
796th place
low place
low place
234th place
190th place
1,185th place
1,301st place
low place
low place
1,160th place
2,262nd place
421st place
1,430th place
low place
6,910th place
79th place
212th place
1,174th place
1,492nd place
49th place
88th place
low place
low place
388th place
592nd place
low place
low place
580th place
413th place
34th place
86th place
low place
low place
low place
low place
1,082nd place
1,403rd place
low place
low place
low place
low place
5,872nd place
low place
3rd place
6th place
low place
low place
low place
low place
low place
low place
low place
low place
low place
low place
6,703rd place
8,040th place
low place
low place
274th place
339th place
low place
low place
415th place
453rd place
low place
low place
610th place
360th place
222nd place
196th place
3,700th place
7,081st place
432nd place
967th place
low place
low place

80000hours.org

aaai.org

ojs.aaai.org

aclanthology.org

acm.org

dl.acm.org

analyticsindiamag.com

arstechnica.com

arxiv.org

  • Hendrycks, Dan; Carlini, Nicholas (16 de junho de 2022). «Unsolved Problems in ML Safety». arXiv:2109.13916Acessível livremente [cs.LG] 
  • Carlsmith, Joseph (16 de junho de 2022). «Is Power-Seeking AI an Existential Risk?». arXiv:2206.13353Acessível livremente [cs.CY] 
  • Bommasani, Rishi; Hudson, Drew A.; Adeli, Ehsan; Altman, Russ; Arora, Simran; von Arx, Sydney; Bernstein, Michael S.; Bohg, Jeannette; Bosselut, Antoine (12 de julho de 2022). «On the Opportunities and Risks of Foundation Models». Stanford CRFM. arXiv:2108.07258Acessível livremente 
  • Ouyang, Long; Wu, Jeff (2022). «Training language models to follow instructions with human feedback». arXiv:2203.02155Acessível livremente [cs.CL] 
  • Knox, W. Bradley; Allievi, Alessandro; Banzhaf, Holger; Schmitt, Felix; Stone, Peter (11 de março de 2022). «Reward (Mis)design for Autonomous Driving» (PDF). arXiv:2104.13906Acessível livremente 
  • Amodei, Dario; Olah, Chris (21 de junho de 2016). «Concrete Problems in AI Safety». arXiv:1606.06565Acessível livremente [cs.AI] 
  • Doshi-Velez, Finale; Kim, Been (2 de março de 2017). «Towards A Rigorous Science of Interpretable Machine Learning». arXiv:1702.08608 [cs, stat] 
  • Mohseni, Sina; Wang, Haotao (7 de março de 2022). «Taxonomy of Machine Learning Safety: A Survey and Primer». arXiv:2106.04823Acessível livremente [cs.LG] 
  • Manheim, David; Garrabrant, Scott. «Categorizing Variants of Goodhart's Law». arXiv:1803.04585Acessível livremente [cs.AI] 
  • Ji, Ziwei; Lee, Nayeon; Frieske, Rita; Yu, Tiezheng; Su, Dan; Xu, Yan; Ishii, Etsuko; Bang, Yejin; Madotto, Andrea (1 de fevereiro de 2022). «Survey of Hallucination in Natural Language Generation». ACM Computing Surveys. arXiv:2202.03629Acessível livremente. doi:10.1145/3571730 
  • Wei, Jason; Tay, Yi (15 de junho de 2022). «Emergent Abilities of Large Language Models». arXiv:2206.07682Acessível livremente [cs.CL] 
  • Leike, Jan; Martic, Miljan (28 de novembro de 2017). «AI Safety Gridworlds». arXiv:1711.09883Acessível livremente [cs.LG] 
  • Turner, Alexander Matt; Smith, Logan; Shah, Rohin; Critch, Andrew; Tadepalli, Prasad (3 de dezembro de 2021). «Optimal Policies Tend to Seek Power». Neural Information Processing Systems. 34. arXiv:1912.01683Acessível livremente 
  • Everitt, Tom; Lea, Gary (21 de maio de 2018). «AGI Safety Literature Review». arXiv:1805.01109Acessível livremente [cs.AI] 
  • Hendrycks, Dan; Burns, Collin; Basart, Steven; Critch, Andrew; Li, Jerry; Song, Dawn; Steinhardt, Jacob (24 de julho de 2021). «Aligning AI With Shared Human Values». International Conference on Learning Representations. arXiv:2008.02275Acessível livremente 
  • Perez, Ethan; Huang, Saffron (7 de fevereiro de 2022). «Red Teaming Language Models with Language Models». arXiv:2202.03286Acessível livremente [cs.CL] 
  • Wu, Jeff; Ouyang, Long (27 de setembro de 2021). «Recursively Summarizing Books with Human Feedback». arXiv:2109.10862Acessível livremente [cs.CL] 
  • Christiano, Paul; Shlegeris, Buck (19 de outubro de 2018). «Supervising strong learners by amplifying weak experts». arXiv:1810.08575Acessível livremente [cs.LG] 
  • Hendrycks, Dan; Carlini, Nicholas (16 de junho de 2022). «Unsolved Problems in ML Safety». arXiv:2109.13916Acessível livremente [cs.LG] 
  • Leike, Jan; Krueger, David (19 de novembro de 2018). «Scalable agent alignment via reward modeling: a research direction». arXiv:1811.07871Acessível livremente [cs.LG] 
  • Evans, Owain; Cotton-Barratt, Owen (13 de outubro de 2021). «Truthful AI: Developing and governing AI that does not lie». arXiv:2110.06674Acessível livremente [cs.CY] 
  • Nakano, Reiichiro; Hilton, Jacob (1 de junho de 2022). «WebGPT: Browser-assisted question-answering with human feedback». arXiv:2112.09332Acessível livremente [cs.CL] 
  • Menick, Jacob; Trebacz, Maja; Mikulik, Vladimir; Aslanides, John; Song, Francis; Chadwick, Martin; Glaese, Mia; Young, Susannah; Campbell-Gillingham, Lucy (21 de março de 2022). «Teaching language models to support answers with verified quotes». DeepMind. arXiv:2203.11147Acessível livremente 
  • Askell, Amanda; Bai, Yuntao (9 de dezembro de 2021). «A General Language Assistant as a Laboratory for Alignment». arXiv:2112.00861Acessível livremente [cs.CL] 
  • Everitt, Tom; Lea, Gary; Hutter, Marcus (21 de maio de 2018). «AGI Safety Literature Review». 1805.01109. arXiv:1805.01109Acessível livremente 
  • Demski, Abram; Garrabrant, Scott (6 de outubro de 2020). «Embedded Agency». arXiv:1902.09469Acessível livremente [cs.AI] 
  • Everitt, Tom; Ortega, Pedro A. (6 de setembro de 2019). «Understanding Agent Incentives using Causal Influence Diagrams. Part I: Single Action Settings». arXiv:1902.09980Acessível livremente [cs.AI] 

basicbooks.com

bbc.com

berkeley.edu

people.eecs.berkeley.edu

books.google.com

ca.gov

leginfo.legislature.ca.gov

cam.ac.uk

turingarchive.kings.cam.ac.uk

cityam.com

dagstuhl.de

drops.dagstuhl.de

deepmind.com

distill.pub

doi.org

dx.doi.org

doi.org

edge.org

elsevier.com

linkinghub.elsevier.com

erichorvitz.com

futureoflife.org

gcrinstitute.org

gov.uk

harvard.edu

ui.adsabs.harvard.edu

infoq.com

jair.org

longtermrisk.org

lukemuehlhauser.com

machinethoughts.wordpress.com

marktechpost.com

medium.com

deepmindsafetyresearch.medium.com

medium.com

mit.edu

direct.mit.edu

nature.com

neurips.cc

proceedings.neurips.cc

nih.gov

ncbi.nlm.nih.gov

nips.cc

papers.nips.cc

nscai.gov

  • NSCAI Final Report (PDF). Washington, DC: The National Security Commission on Artificial Intelligence. 2021 

nytimes.com

nyu.edu

bhr.stern.nyu.edu

openai.com

openreview.net

pearson.com

penguinrandomhouse.com

quantamagazine.org

reddit.com

reuters.com

sagepub.com

journals.sagepub.com

science.org

scientificamerican.com

scottaaronson.blog

  • Aaronson, Scott (17 de junho de 2022). «OpenAI!». Shtetl-Optimized 

smallake.kr

springer.com

link.springer.com

stanford.edu

fsi.stanford.edu

technologyreview.com

theguardian.com

theregister.com

towardsdatascience.com

un.org

unite.ai

universitypressscholarship.com

oxford.universitypressscholarship.com

utexas.edu

cs.utexas.edu

venturebeat.com

vetta.org

washingtonpost.com

web.archive.org

wiley.com

onlinelibrary.wiley.com

worldcat.org

wsj.com

wwnorton.co.uk