대형 언어 모델 (Korean Wikipedia)

Analysis of information sources in references of the Wikipedia article "대형 언어 모델" in Korean language version.

refsWebsite

Global rank Korean rank

19arxiv.org

69^th place

54^th place

11web.archive.org

1^st place

8doi.org

2^nd place

3^rd place

4aclanthology.org

low place

4github.com

383^rd place

118^th place

3worldcat.org

5^th place

11^th place

3googleblog.com

1,272^nd place

983^rd place

3openai.com

1,559^th place

711^th place

2naver.com

46^th place

2^nd place

2harvard.edu

18^th place

27^th place

2mit.edu

415^th place

263^rd place

2venturebeat.com

616^th place

362^nd place

2deepmind.com

low place

8,293^rd place

2facebook.com

77^th place

194^th place

1analyticsindiamag.com

low place

7,747^th place

1ieee.org

652^nd place

342^nd place

1neurips.cc

low place

1semanticscholar.org

11^th place

310^th place

1theguardian.com

12^th place

65^th place

1euronews.com

612^th place

2,608^th place

1technologyreview.com

1,943^rd place

1,161^st place

1ourworldindata.org

2,263^rd place

781^st place

1unite.ai

low place

1nvidia.com

2,503^rd place

1,329^th place

1kdnuggets.com

low place

1lambdalabs.com

low place

1techcrunch.com

187^th place

102^nd place

1forefront.ai

low place

1microsoft.com

153^rd place

82^nd place

1anthropic.com

low place

1nature.com

234^th place

148^th place

1huggingface.co

low place

1amazon.science

low place

1amazon.com

105^th place

291^st place

1cerebras.net

low place

1fastcompanyme.com

low place

1cnbc.com

220^th place

358^th place

aclanthology.org

Rogers, Anna; Kovaleva, Olga; Rumshisky, Anna (2020). “A Primer in BERTology: What We Know About How BERT Works”. 《Transactions of the Association for Computational Linguistics》 8: 842–866. arXiv:2002.12327. doi:10.1162/tacl_a_00349. S2CID 211532403. 2022년 4월 3일에 원본 문서에서 보존된 문서. 2024년 1월 21일에 확인함.
Movva, Rajiv; Balachandar, Sidhika; Peng, Kenny; Agostini, Gabriel; Garg, Nikhil; Pierson, Emma (2024). 〈Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers〉. 《Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)》. 1223–1243쪽. arXiv:2307.10700. doi:10.18653/v1/2024.naacl-long.67. 2024년 12월 8일에 확인함.
Movva, Rajiv; Balachandar, Sidhika; Peng, Kenny; Agostini, Gabriel; Garg, Nikhil; Pierson, Emma (2024). 〈Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers〉. 《Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)》. 1223–1243쪽. arXiv:2307.10700. doi:10.18653/v1/2024.naacl-long.67. 2024년 12월 8일에 확인함.
Black, Sidney; Biderman, Stella; Hallahan, Eric; 외. (2022년 5월 1일). 《GPT-NeoX-20B: An Open-Source Autoregressive Language Model》. Proceedings of BigScience Episode #5 -- Workshop on Challenges & Perspectives in Creating Large Language Models. 95–136쪽. 2022년 12월 19일에 확인함.

amazon.com

aws.amazon.com

“AlexaTM 20B is now available in Amazon SageMaker JumpStart | AWS Machine Learning Blog”. 《aws.amazon.com》. 2022년 11월 17일. 2023년 3월 13일에 확인함.

amazon.science

“20B-parameter Alexa model sets new marks in few-shot learning”. 《Amazon Science》 (영어). 2022년 8월 2일.

analyticsindiamag.com

Goled, Shraddha (2021년 5월 7일). “Self-Supervised Learning Vs Semi-Supervised Learning: How They Differ”. 《Analytics India Magazine》.

anthropic.com

“Product”. 《Anthropic》 (영어). 2023년 3월 14일에 확인함.

arxiv.org

Goodman, Joshua (2001년 8월 9일), 《A Bit of Progress in Language Modeling》, arXiv:cs/0108005, Bibcode:2001cs........8005G
Rogers, Anna; Kovaleva, Olga; Rumshisky, Anna (2020). “A Primer in BERTology: What We Know About How BERT Works”. 《Transactions of the Association for Computational Linguistics》 8: 842–866. arXiv:2002.12327. doi:10.1162/tacl_a_00349. S2CID 211532403. 2022년 4월 3일에 원본 문서에서 보존된 문서. 2024년 1월 21일에 확인함.
Movva, Rajiv; Balachandar, Sidhika; Peng, Kenny; Agostini, Gabriel; Garg, Nikhil; Pierson, Emma (2024). 〈Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers〉. 《Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)》. 1223–1243쪽. arXiv:2307.10700. doi:10.18653/v1/2024.naacl-long.67. 2024년 12월 8일에 확인함.
Movva, Rajiv; Balachandar, Sidhika; Peng, Kenny; Agostini, Gabriel; Garg, Nikhil; Pierson, Emma (2024). 〈Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers〉. 《Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)》. 1223–1243쪽. arXiv:2307.10700. doi:10.18653/v1/2024.naacl-long.67. 2024년 12월 8일에 확인함.
Peng, Bo; 외. (2023). “RWKV: Reinventing RNNS for the Transformer Era”. arXiv:2305.13048 [cs.CL].
Gu, Albert; Dao, Tri (2023년 12월 1일), 《Mamba: Linear-Time Sequence Modeling with Selective State Spaces》, arXiv:2312.00752
Devlin, Jacob; Chang, Ming-Wei; Lee, Kenton; Toutanova, Kristina (2018년 10월 11일). “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding”. arXiv:1810.04805v2 [cs.CL].
Gao, Leo; Biderman, Stella; Black, Sid; Golding, Laurence; Hoppe, Travis; Foster, Charles; Phang, Jason; He, Horace; Thite, Anish; Nabeshima, Noa; Presser, Shawn; Leahy, Connor (2020년 12월 31일). “The Pile: An 800GB Dataset of Diverse Text for Language Modeling”. arXiv:2101.00027 [cs.CL].
Smith, Shaden; Patwary, Mostofa; Norick, Brandon; LeGresley, Patrick; Rajbhandari, Samyam; Casper, Jared; Liu, Zhun; Prabhumoye, Shrimai; Zerveas, George; Korthikanti, Vijay; Zhang, Elton; Child, Rewon; Aminabadi, Reza Yazdani; Bernauer, Julie; Song, Xia (2022년 2월 4일). “Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model”. arXiv:2201.11990.
Wang, Shuohuan; Sun, Yu; Xiang, Yang; Wu, Zhihua; Ding, Siyu; Gong, Weibao; Feng, Shikun; Shang, Junyuan; Zhao, Yanbin; Pang, Chao; Liu, Jiaxiang; Chen, Xuyi; Lu, Yuxiang; Liu, Weixin; Wang, Xi; Bai, Yangfan; Chen, Qiuliang; Zhao, Li; Li, Shiyong; Sun, Peng; Yu, Dianhai; Ma, Yanjun; Tian, Hao; Wu, Hua; Wu, Tian; Zeng, Wei; Li, Ge; Gao, Wen; Wang, Haifeng (2021년 12월 23일). “ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation”. arXiv:2112.12731.
Askell, Amanda; Bai, Yuntao; Chen, Anna; 외. (2021년 12월 9일). “A General Language Assistant as a Laboratory for Alignment”. arXiv:2112.00861 [cs.CL].
Hoffmann, Jordan; Borgeaud, Sebastian; Mensch, Arthur; 외. (2022년 3월 29일). “Training Compute-Optimal Large Language Models”. arXiv:2203.15556 [cs.CL].
Zhang, Susan; Roller, Stephen; Goyal, Naman; Artetxe, Mikel; Chen, Moya; Chen, Shuohui; Dewan, Christopher; Diab, Mona; Li, Xian; Lin, Xi Victoria; Mihaylov, Todor; Ott, Myle; Shleifer, Sam; Shuster, Kurt; Simig, Daniel; Koura, Punit Singh; Sridhar, Anjali; Wang, Tianlu; Zettlemoyer, Luke (2022년 6월 21일). “OPT: Open Pre-trained Transformer Language Models”. arXiv:2205.01068 [cs.CL].
Lewkowycz, Aitor; Andreassen, Anders; Dohan, David; Dyer, Ethan; Michalewski, Henryk; Ramasesh, Vinay; Slone, Ambrose; Anil, Cem; Schlag, Imanol; Gutman-Solo, Theo; Wu, Yuhuai; Neyshabur, Behnam; Gur-Ari, Guy; Misra, Vedant (2022년 6월 30일). “Solving Quantitative Reasoning Problems with Language Models”. arXiv:2206.14858 [cs.CL].
Taylor, Ross; Kardas, Marcin; Cucurull, Guillem; Scialom, Thomas; Hartshorn, Anthony; Saravia, Elvis; Poulton, Andrew; Kerkez, Viktor; Stojnic, Robert (2022년 11월 16일). “Galactica: A Large Language Model for Science”. arXiv:2211.09085 [cs.CL].
Soltan, Saleh; Ananthakrishnan, Shankar; FitzGerald, Jack; 외. (2022년 8월 3일). “AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model”. arXiv:2208.01448 [cs.CL].
Wu, Shijie; Irsoy, Ozan; Lu, Steven; Dabravolski, Vadim; Dredze, Mark; Gehrmann, Sebastian; Kambadur, Prabhanjan; Rosenberg, David; Mann, Gideon (2023년 3월 30일). “BloombergGPT: A Large Language Model for Finance”. arXiv:2303.17564.
Ren, Xiaozhe; Zhou, Pingyi; Meng, Xinfan; Huang, Xinjing; Wang, Yadao; Wang, Weichao; Li, Pengfei; Zhang, Xiaoda; Podolskiy, Alexander; Arshinov, Grigory; Bout, Andrey; Piontkovskaya, Irina; Wei, Jiansheng; Jiang, Xin; Su, Teng; Liu, Qun; Yao, Jun (2023년 3월 19일). “PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing”. arXiv:2303.10845.
Köpf, Andreas; Kilcher, Yannic; von Rütte, Dimitri; Anagnostidis, Sotiris; Tam, Zhi-Rui; Stevens, Keith; Barhoum, Abdullah; Duc, Nguyen Minh; Stanley, Oliver; Nagyfi, Richárd; ES, Shahul; Suri, Sameer; Glushkov, David; Dantuluri, Arnav; Maguire, Andrew (2023년 4월 14일). “OpenAssistant Conversations -- Democratizing Large Language Model Alignment”. 《arXiv:2304.07327 [cs]》.

cerebras.net

Dey, Nolan (2023년 3월 28일). “Cerebras-GPT: A Family of Open, Compute-efficient, Large Language Models”. 《Cerebras》.

cnbc.com

Elias, Jennifer (2023년 5월 16일). “Google's newest A.I. model uses nearly five times more text data for training than its predecessor”. 《CNBC》. 2023년 5월 18일에 확인함.

deepmind.com

“Language modelling at scale: Gopher, ethical considerations, and retrieval”. 《www.deepmind.com》 (영어). 2023년 3월 20일에 확인함.
Hoffmann, Jordan; Borgeaud, Sebastian; Mensch, Arthur; Sifre, Laurent (2022년 4월 12일). “An empirical analysis of compute-optimal large language model training”. 《Deepmind Blog》.

doi.org

dx.doi.org

Kilgarriff, Adam; Grefenstette, Gregory (September 2003). “Introduction to the Special Issue on the Web as Corpus”. 《Computational Linguistics》 29 (3): 333–347. doi:10.1162/089120103322711569. ISSN 0891-2017.
Banko, Michele; Brill, Eric (2001). “Scaling to very very large corpora for natural language disambiguation”. 《Proceedings of the 39th Annual Meeting on Association for Computational Linguistics - ACL '01》 (Morristown, NJ, USA: Association for Computational Linguistics): 26–33. doi:10.3115/1073012.1073017.
Resnik, Philip; Smith, Noah A. (September 2003). “The Web as a Parallel Corpus”. 《Computational Linguistics》 29 (3): 349–380. doi:10.1162/089120103322711578. ISSN 0891-2017. 2024년 6월 7일에 원본 문서에서 보존된 문서. 2024년 6월 7일에 확인함.
Halevy, Alon; Norvig, Peter; Pereira, Fernando (March 2009). “The Unreasonable Effectiveness of Data”. 《IEEE Intelligent Systems》 24 (2): 8–12. doi:10.1109/MIS.2009.36. ISSN 1541-1672.
Chen, Leiyu; Li, Shaobo; Bai, Qiang; Yang, Jing; Jiang, Sanlong; Miao, Yanming (2021). “Review of Image Classification Algorithms Based on Convolutional Neural Networks”. 《Remote Sensing》 13 (22): 4712. Bibcode:2021RemS...13.4712C. doi:10.3390/rs13224712.
Rogers, Anna; Kovaleva, Olga; Rumshisky, Anna (2020). “A Primer in BERTology: What We Know About How BERT Works”. 《Transactions of the Association for Computational Linguistics》 8: 842–866. arXiv:2002.12327. doi:10.1162/tacl_a_00349. S2CID 211532403. 2022년 4월 3일에 원본 문서에서 보존된 문서. 2024년 1월 21일에 확인함.
Movva, Rajiv; Balachandar, Sidhika; Peng, Kenny; Agostini, Gabriel; Garg, Nikhil; Pierson, Emma (2024). 〈Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers〉. 《Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)》. 1223–1243쪽. arXiv:2307.10700. doi:10.18653/v1/2024.naacl-long.67. 2024년 12월 8일에 확인함.
Movva, Rajiv; Balachandar, Sidhika; Peng, Kenny; Agostini, Gabriel; Garg, Nikhil; Pierson, Emma (2024). 〈Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers〉. 《Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)》. 1223–1243쪽. arXiv:2307.10700. doi:10.18653/v1/2024.naacl-long.67. 2024년 12월 8일에 확인함.

euronews.com

“ChatGPT a year on: 3 ways the AI chatbot has completely changed the world in 12 months”. Euronews. 2023년 11월 30일. 2024년 1월 14일에 원본 문서에서 보존된 문서. 2024년 1월 20일에 확인함.

facebook.com

ai.facebook.com

“Democratizing access to large-scale language models with OPT-175B”. 《ai.facebook.com》 (영어).
“Introducing LLaMA: A foundational, 65-billion-parameter large language model”. 《Meta AI》. 2023년 2월 24일.

fastcompanyme.com

“Abu Dhabi-based TII launches its own version of ChatGPT”. 《tii.ae》.

forefront.ai

“GPT-J-6B: An Introduction to the Largest Open Source GPT Model | Forefront”. 《www.forefront.ai》 (영어). 2023년 3월 9일에 원본 문서에서 보존된 문서. 2023년 2월 28일에 확인함.

github.com

“BERT”. 2023년 3월 13일 – GitHub 경유.
“gpt-2”. 《GitHub》. 2023년 3월 13일에 확인함.
“GPT Neo”. 2023년 3월 15일 – GitHub 경유.
Khrushchev, Mikhail; Vasilev, Ruslan; Petrov, Alexey; Zinov, Nikolay (2022년 6월 22일), 《YaLM 100B》, 2023년 3월 18일에 확인함

googleblog.com

ai.googleblog.com

Dai, Andrew M; Du, Nan (2021년 12월 9일). “More Efficient In-Context Learning with GLaM”. 《ai.googleblog.com》 (영어). 2023년 3월 9일에 확인함.
Cheng, Heng-Tze; Thoppilan, Romal (2022년 1월 21일). “LaMDA: Towards Safe, Grounded, and High-Quality Dialog Models for Everything”. 《ai.googleblog.com》 (영어). 2023년 3월 9일에 확인함.
Narang, Sharan; Chowdhery, Aakanksha (2022년 4월 4일). “Pathways Language Model (PaLM): Scaling to 540 Billion Parameters for Breakthrough Performance”. 《ai.googleblog.com》 (영어). 2023년 3월 9일에 확인함.

harvard.edu

adsabs.harvard.edu

Goodman, Joshua (2001년 8월 9일), 《A Bit of Progress in Language Modeling》, arXiv:cs/0108005, Bibcode:2001cs........8005G
Chen, Leiyu; Li, Shaobo; Bai, Qiang; Yang, Jing; Jiang, Sanlong; Miao, Yanming (2021). “Review of Image Classification Algorithms Based on Convolutional Neural Networks”. 《Remote Sensing》 13 (22): 4712. Bibcode:2021RemS...13.4712C. doi:10.3390/rs13224712.

huggingface.co

“bigscience/bloom · Hugging Face”. 《huggingface.co》.

ieee.org

ieeexplore.ieee.org

Halevy, Alon; Norvig, Peter; Pereira, Fernando (March 2009). “The Unreasonable Effectiveness of Data”. 《IEEE Intelligent Systems》 24 (2): 8–12. doi:10.1109/MIS.2009.36. ISSN 1541-1672.

kdnuggets.com

“BERT, RoBERTa, DistilBERT, XLNet: Which one to use?”. ^{[깨진 링크(과거 내용 찾기)]}

lambdalabs.com

“OpenAI's GPT-3 Language Model: A Technical Overview”. 《lambdalabs.com》 (영어).

microsoft.com

Alvi, Ali; Kharya, Paresh (2021년 10월 11일). “Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, the World's Largest and Most Powerful Generative Language Model”. 《Microsoft Research》.

mit.edu

direct.mit.edu

Kilgarriff, Adam; Grefenstette, Gregory (September 2003). “Introduction to the Special Issue on the Web as Corpus”. 《Computational Linguistics》 29 (3): 333–347. doi:10.1162/089120103322711569. ISSN 0891-2017.
Resnik, Philip; Smith, Noah A. (September 2003). “The Web as a Parallel Corpus”. 《Computational Linguistics》 29 (3): 349–380. doi:10.1162/089120103322711578. ISSN 0891-2017. 2024년 6월 7일에 원본 문서에서 보존된 문서. 2024년 6월 7일에 확인함.

nature.com

Ananthaswamy, Anil (2023년 3월 8일). “In AI, is bigger always better?”. 《Nature》.

naver.com

terms.naver.com

《대규모 언어모델》. ICT 시사용어 2025. 2025. 2025년 4월 15일에 확인함.
《거대 언어 모델》. 두산백과. 2025. 2025년 4월 15일에 확인함.

neurips.cc

proceedings.neurips.cc

Vaswani, Ashish; Shazeer, Noam; Parmar, Niki; Uszkoreit, Jakob; Jones, Llion; Gomez, Aidan N; Kaiser, Łukasz; Polosukhin, Illia (2017). “Attention is All you Need” (PDF). 《Advances in Neural Information Processing Systems》 (Curran Associates, Inc.) 30. 2024년 2월 21일에 원본 문서 (PDF)에서 보존된 문서. 2024년 1월 21일에 확인함.

nvidia.com

blogs.nvidia.com

Merritt, Rick (2022년 3월 25일). “What Is a Transformer Model?”. 《NVIDIA Blog》. 2023년 11월 17일에 원본 문서에서 보존된 문서. 2023년 7월 25일에 확인함.

openai.com

“GPT-2: 1.5B Release”. 《OpenAI》 (영어). 2019년 11월 5일. 2019년 11월 14일에 원본 문서에서 보존된 문서. 2019년 11월 14일에 확인함.
“Better language models and their implications”. 《openai.com》.

cdn.openai.com

“GPT-4 Technical Report” (PDF). 《OpenAI》. 2023. 2023년 3월 14일에 원본 문서 (PDF)에서 보존된 문서. 2023년 3월 14일에 확인함.

ourworldindata.org

“Parameters in notable artificial intelligence systems”. 《ourworldindata.org》. 2023년 11월 30일. 2024년 1월 20일에 확인함.

semanticscholar.org

api.semanticscholar.org

Rogers, Anna; Kovaleva, Olga; Rumshisky, Anna (2020). “A Primer in BERTology: What We Know About How BERT Works”. 《Transactions of the Association for Computational Linguistics》 8: 842–866. arXiv:2002.12327. doi:10.1162/tacl_a_00349. S2CID 211532403. 2022년 4월 3일에 원본 문서에서 보존된 문서. 2024년 1월 21일에 확인함.

techcrunch.com

Wiggers, Kyle (2022년 4월 28일). “The emerging types of language models and why they matter”. 《TechCrunch》.

technologyreview.com

Heaven, Will (2023년 3월 14일). “GPT-4 is bigger and better than ChatGPT—but OpenAI won't say why”. MIT Technology Review. 2023년 3월 17일에 원본 문서에서 보존된 문서. 2024년 1월 20일에 확인함.

theguardian.com

Hern, Alex (2019년 2월 14일). “New AI fake text generator may be too dangerous to release, say creators”. 《The Guardian》. 2019년 2월 14일에 원본 문서에서 보존된 문서. 2024년 1월 20일에 확인함.

unite.ai

Zia, Dr Tehseen (2024년 1월 8일). “Unveiling of Large Multimodal Models: Shaping the Landscape of Language Models in 2024”. 《Unite.AI》 (미국 영어). 2024년 12월 28일에 확인함.

venturebeat.com

Sharma, Shubham (2025년 1월 20일). “Open-source DeepSeek-R1 uses pure reinforcement learning to match OpenAI o1 — at 95% less cost”. 《VentureBeat》 (미국 영어). 2025년 1월 26일에 확인함.
Iyer, Abhishek (2021년 5월 15일). “GPT-3's free alternative GPT-Neo is something to be excited about”. 《VentureBeat》.

web.archive.org

Resnik, Philip; Smith, Noah A. (September 2003). “The Web as a Parallel Corpus”. 《Computational Linguistics》 29 (3): 349–380. doi:10.1162/089120103322711578. ISSN 0891-2017. 2024년 6월 7일에 원본 문서에서 보존된 문서. 2024년 6월 7일에 확인함.
Vaswani, Ashish; Shazeer, Noam; Parmar, Niki; Uszkoreit, Jakob; Jones, Llion; Gomez, Aidan N; Kaiser, Łukasz; Polosukhin, Illia (2017). “Attention is All you Need” (PDF). 《Advances in Neural Information Processing Systems》 (Curran Associates, Inc.) 30. 2024년 2월 21일에 원본 문서 (PDF)에서 보존된 문서. 2024년 1월 21일에 확인함.
Rogers, Anna; Kovaleva, Olga; Rumshisky, Anna (2020). “A Primer in BERTology: What We Know About How BERT Works”. 《Transactions of the Association for Computational Linguistics》 8: 842–866. arXiv:2002.12327. doi:10.1162/tacl_a_00349. S2CID 211532403. 2022년 4월 3일에 원본 문서에서 보존된 문서. 2024년 1월 21일에 확인함.
Hern, Alex (2019년 2월 14일). “New AI fake text generator may be too dangerous to release, say creators”. 《The Guardian》. 2019년 2월 14일에 원본 문서에서 보존된 문서. 2024년 1월 20일에 확인함.
“ChatGPT a year on: 3 ways the AI chatbot has completely changed the world in 12 months”. Euronews. 2023년 11월 30일. 2024년 1월 14일에 원본 문서에서 보존된 문서. 2024년 1월 20일에 확인함.
Heaven, Will (2023년 3월 14일). “GPT-4 is bigger and better than ChatGPT—but OpenAI won't say why”. MIT Technology Review. 2023년 3월 17일에 원본 문서에서 보존된 문서. 2024년 1월 20일에 확인함.
Merritt, Rick (2022년 3월 25일). “What Is a Transformer Model?”. 《NVIDIA Blog》. 2023년 11월 17일에 원본 문서에서 보존된 문서. 2023년 7월 25일에 확인함.
“BERT, RoBERTa, DistilBERT, XLNet: Which one to use?”. ^{[깨진 링크(과거 내용 찾기)]}
“GPT-2: 1.5B Release”. 《OpenAI》 (영어). 2019년 11월 5일. 2019년 11월 14일에 원본 문서에서 보존된 문서. 2019년 11월 14일에 확인함.
“GPT-J-6B: An Introduction to the Largest Open Source GPT Model | Forefront”. 《www.forefront.ai》 (영어). 2023년 3월 9일에 원본 문서에서 보존된 문서. 2023년 2월 28일에 확인함.
“GPT-4 Technical Report” (PDF). 《OpenAI》. 2023. 2023년 3월 14일에 원본 문서 (PDF)에서 보존된 문서. 2023년 3월 14일에 확인함.

worldcat.org

Kilgarriff, Adam; Grefenstette, Gregory (September 2003). “Introduction to the Special Issue on the Web as Corpus”. 《Computational Linguistics》 29 (3): 333–347. doi:10.1162/089120103322711569. ISSN 0891-2017.
Resnik, Philip; Smith, Noah A. (September 2003). “The Web as a Parallel Corpus”. 《Computational Linguistics》 29 (3): 349–380. doi:10.1162/089120103322711578. ISSN 0891-2017. 2024년 6월 7일에 원본 문서에서 보존된 문서. 2024년 6월 7일에 확인함.
Halevy, Alon; Norvig, Peter; Pereira, Fernando (March 2009). “The Unreasonable Effectiveness of Data”. 《IEEE Intelligent Systems》 24 (2): 8–12. doi:10.1109/MIS.2009.36. ISSN 1541-1672.