Analysis of information sources in references of the Wikipedia article "Optical character recognition" in English language version.
When we generated the original Ngram Viewer corpora in 2009, our OCR wasn't as good […]. This was especially obvious in pre-19th century English, where the elongated medial-s (ſ) was often interpreted as an f, […]. Here's evidence of the improvements we've made since then, using the corpus operator to compare the 2009, 2012 and 2019 versions […]
{{cite book}}
: CS1 maint: multiple names: authors list (link)