Anna Rogers, Olga Kovaleva e Anna Rumshisky, A Primer in BERTology: What we know about how BERT works, in arXiv, 9 novembre 2020, arXiv:2002:12327. URL consultato il 16 settembre 2022.
«in a little over a year, BERT has become a ubiquitous baseline in NLP experiments»