Optical character recognition (English Wikipedia)

Riedl, C.; Zanibbi, R.; Hearst, M. A.; Zhu, S.; Menietti, M.; Crusan, J.; Metelsky, I.; Lakhani, K. (February 20, 2016). "Detecting Figures and Part Labels in Patents: Competition-Based Development of Image Processing Algorithms". International Journal on Document Analysis and Recognition. 19 (2): 155. arXiv:1410.6751. doi:10.1007/s10032-016-0260-8. S2CID 11873638.

bisok.com (Global: low place; English: low place)

"How the Best OCR Technology Captures 99.91% of Data". www.bisok.com. Retrieved May 27, 2021.

books.google.com (Global: 3^rd place; English: 3^rd place)

Dhavale, Sunita Vikrant (2017). Advanced Image-Based Spam Detection and Filtering Techniques. Hershey, PA: IGI Global. p. 91. ISBN 9781683180142.
Zeng, Qing-An (2015). Wireless Communications, Networking and Applications: Proceedings of WCNA 2014. Springer. ISBN 978-81-322-2580-5.
"Google Books Ngram Viewer". books.google.com. Retrieved July 20, 2023. When we generated the original Ngram Viewer corpora in 2009, our OCR wasn't as good […]. This was especially obvious in pre-19th century English, where the elongated medial-s (ſ) was often interpreted as an f, […]. Here's evidence of the improvements we've made since then, using the corpus operator to compare the 2009, 2012 and 2019 versions […]
Kapidakis, Sarantos; Mazurek, Cezary; Werla, Marcin (2015). Research and Advanced Technology for Digital Libraries (PDF). Springer. p. 257. doi:10.1007/978-3-319-24592-8. ISBN 9783319245928. Archived from the original on November 3, 2025.

civilica.com (Global: low place; English: low place)

Mohseni, Maedeh Haji Agha; Azmi, Reza; Layeghi, Kamran; Maleki, Sajad (2019). Comparison of Synthesized and Natural Datasets in Neural Network Based Handwriting Solutions. ITCT – via Civilica.

code.google.com (Global: 4,942^nd place; English: 4,061^st place)

"Code and Data to evaluate OCR accuracy, originally from UNLV/ISRI". Google Code Archive.

damiles.com (Global: low place; English: low place)

blog.damiles.com

"Basic OCR in OpenCV | Damiles". Blog.damiles.com. November 20, 2008. Retrieved June 16, 2013.
"The basic pattern recognition and classification with openCV | Damiles". Blog.damiles.com. November 14, 2008. Retrieved June 16, 2013.

dataid.com (Global: low place; English: low place)

"OCR Introduction". Dataid.com. Retrieved June 16, 2013.

dlib.org (Global: low place; English: low place)

Holley, Rose (April 2009). "How Good Can It Get? Analysing and Improving OCR Accuracy in Large Scale Historic Newspaper Digitisation Programs". D-Lib Magazine. Retrieved January 5, 2014.

doi.org (Global: 2^nd place; English: 2^nd place)

d'Albe, E. E. F. (July 1, 1914). "On a Type-Reading Optophone". Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences. 90 (619): 373–375. Bibcode:1914RSPSA..90..373D. doi:10.1098/rspa.1914.0061.
Tappert, C. C.; Suen, C. Y.; Wakahara, T. (1990). "The state of the art in online handwriting recognition". IEEE Transactions on Pattern Analysis and Machine Intelligence. 12 (8): 787. Bibcode:1990ITPAM..12..787T. doi:10.1109/34.57669. S2CID 42920826.
Sezgin, Mehmet; Sankur, Bulent (2004). "Survey over image thresholding techniques and quantitative performance evaluation" (PDF). Journal of Electronic Imaging. 13 (1): 146. Bibcode:2004JEI....13..146S. doi:10.1117/1.1631315. Archived from the original (PDF) on October 16, 2015. Retrieved May 2, 2015.
Gupta, Maya R.; Jacobson, Nathaniel P.; Garcia, Eric K. (2007). "OCR binarisation and image pre-processing for searching historical documents" (PDF). Pattern Recognition. 40 (2): 389. Bibcode:2007PatRe..40..389G. doi:10.1016/j.patcog.2006.04.043. Archived from the original (PDF) on October 16, 2015. Retrieved May 2, 2015.
Trier, Oeivind Due; Jain, Anil K. (1995). "Goal-directed evaluation of binarisation methods" (PDF). IEEE Transactions on Pattern Analysis and Machine Intelligence. 17 (12): 1191–1201. Bibcode:1995ITPAM..17.1191T. doi:10.1109/34.476511. Archived (PDF) from the original on October 16, 2015. Retrieved May 2, 2015.
Milyaev, Sergey; Barinova, Olga; Novikova, Tatiana; Kohli, Pushmeet; Lempitsky, Victor (2013). "Image Binarization for End-to-End Text Understanding in Natural Images". 2013 12th International Conference on Document Analysis and Recognition (PDF). pp. 128–132. doi:10.1109/ICDAR.2013.33. ISBN 978-0-7695-4999-6. S2CID 8947361. Archived (PDF) from the original on November 13, 2017. Retrieved May 2, 2015.
Pati, P.B.; Ramakrishnan, A.G. (May 29, 1987). "Word Level Multi-script Identification". Pattern Recognition Letters. 29 (9): 1218–1229. Bibcode:2008PaReL..29.1218P. doi:10.1016/j.patrec.2008.01.027.
Riedl, C.; Zanibbi, R.; Hearst, M. A.; Zhu, S.; Menietti, M.; Crusan, J.; Metelsky, I.; Lakhani, K. (February 20, 2016). "Detecting Figures and Part Labels in Patents: Competition-Based Development of Image Processing Algorithms". International Journal on Document Analysis and Recognition. 19 (2): 155. arXiv:1410.6751. doi:10.1007/s10032-016-0260-8. S2CID 11873638.
Kapidakis, Sarantos; Mazurek, Cezary; Werla, Marcin (2015). Research and Advanced Technology for Digital Libraries (PDF). Springer. p. 257. doi:10.1007/978-3-319-24592-8. ISBN 9783319245928. Archived from the original on November 3, 2025.
Atkinson, Kristine H. (2015). "Reinventing nonpatent literature for pharmaceutical patenting". Pharmaceutical Patent Analyst. 4 (5): 371–375. doi:10.4155/ppa.15.21. PMID 26389649.

ejohn.org (Global: low place; English: low place)

Resig, John (January 23, 2009). "John Resig – OCR and Neural Nets in JavaScript". Ejohn.org. Retrieved June 16, 2013.

erols.com (Global: 8,858^th place; English: low place)

users.erols.com

Suen, C.Y.; Plamondon, R.; Tappert, A.; Thomassen, A.; Ward, J.R.; Yamamoto, K. (May 29, 1987). Future Challenges in Handwriting and Computer Applications. 3rd International Symposium on Handwriting and Computer Applications, Montreal, May 29, 1987. Retrieved October 3, 2008.

explainthatstuff.com (Global: low place; English: low place)

Woodford, Chris (January 30, 2012). "How does OCR document scanning work?". Explain that Stuff. Retrieved June 16, 2013.

harvard.edu (Global: 18^th place; English: 17^th place)

ui.adsabs.harvard.edu

d'Albe, E. E. F. (July 1, 1914). "On a Type-Reading Optophone". Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences. 90 (619): 373–375. Bibcode:1914RSPSA..90..373D. doi:10.1098/rspa.1914.0061.
Tappert, C. C.; Suen, C. Y.; Wakahara, T. (1990). "The state of the art in online handwriting recognition". IEEE Transactions on Pattern Analysis and Machine Intelligence. 12 (8): 787. Bibcode:1990ITPAM..12..787T. doi:10.1109/34.57669. S2CID 42920826.
Sezgin, Mehmet; Sankur, Bulent (2004). "Survey over image thresholding techniques and quantitative performance evaluation" (PDF). Journal of Electronic Imaging. 13 (1): 146. Bibcode:2004JEI....13..146S. doi:10.1117/1.1631315. Archived from the original (PDF) on October 16, 2015. Retrieved May 2, 2015.
Gupta, Maya R.; Jacobson, Nathaniel P.; Garcia, Eric K. (2007). "OCR binarisation and image pre-processing for searching historical documents" (PDF). Pattern Recognition. 40 (2): 389. Bibcode:2007PatRe..40..389G. doi:10.1016/j.patcog.2006.04.043. Archived from the original (PDF) on October 16, 2015. Retrieved May 2, 2015.
Trier, Oeivind Due; Jain, Anil K. (1995). "Goal-directed evaluation of binarisation methods" (PDF). IEEE Transactions on Pattern Analysis and Machine Intelligence. 17 (12): 1191–1201. Bibcode:1995ITPAM..17.1191T. doi:10.1109/34.476511. Archived (PDF) from the original on October 16, 2015. Retrieved May 2, 2015.
Pati, P.B.; Ramakrishnan, A.G. (May 29, 1987). "Word Level Multi-script Identification". Pattern Recognition Letters. 29 (9): 1218–1229. Bibcode:2008PaReL..29.1218P. doi:10.1016/j.patrec.2008.01.027.

havenondemand.com (Global: low place; English: low place)

community.havenondemand.com

"Extracting text from images using OCR on Android". June 27, 2015. Archived from the original on March 15, 2016.
"[Tutorial] OCR on Google Glass". October 23, 2014. Archived from the original on March 5, 2016.
"[javascript] Using OCR and Entity Extraction for LinkedIn Company Lookup". July 22, 2014. Archived from the original on April 17, 2016.
"How to optimize results from the OCR API when extracting text from an image? - Haven OnDemand Developer Community". Archived from the original on March 22, 2016.

dev.havenondemand.com

"OCR Document". Haven OnDemand. Archived from the original on April 15, 2016.
"Supported Media Formats". Haven OnDemand. Archived from the original on April 19, 2016.

helsinki.fi (Global: 1,670^th place; English: 2,355^th place)

blogs.helsinki.fi

"What is the point of an online interactive OCR text editor? - Fenno-Ugrica". February 21, 2014.

hoopoes.com (Global: low place; English: low place)

"scanno". Hoopoes. May 2001.

ibm.com (Global: 1,131^st place; English: 850^th place)

www-03.ibm.com

"IBM Press room - 2009-07-28 IBM to Acquire SPSS Inc. to Provide Clients Predictive Analytics Capabilities - United States". www-03.ibm.com. July 28, 2009. Archived from the original on July 31, 2009. Retrieved January 9, 2026.

microsoft.com (Global: 153^rd place; English: 151^st place)

Milyaev, Sergey; Barinova, Olga; Novikova, Tatiana; Kohli, Pushmeet; Lempitsky, Victor (2013). "Image Binarization for End-to-End Text Understanding in Natural Images". 2013 12th International Conference on Document Analysis and Recognition (PDF). pp. 128–132. doi:10.1109/ICDAR.2013.33. ISBN 978-0-7695-4999-6. S2CID 8947361. Archived (PDF) from the original on November 13, 2017. Retrieved May 2, 2015.

nicomsoft.com (Global: low place; English: low place)

"Optical Character Recognition (OCR) – How it works". Nicomsoft.com. Retrieved June 16, 2013.

nih.gov (Global: 4^th place; English: 4^th place)

pubmed.ncbi.nlm.nih.gov

Atkinson, Kristine H. (2015). "Reinventing nonpatent literature for pharmaceutical patenting". Pharmaceutical Patent Analyst. 4 (5): 371–375. doi:10.4155/ppa.15.21. PMID 26389649.

nytimes.com (Global: 7^th place; English: 7^th place)

Fehr, Tiff (March 26, 2019). "How We Sped Through 900 Pages of Cohen Documents in Under 10 Minutes". The New York Times. ISSN 0362-4331. Retrieved June 16, 2023.

ocrwizard.com (Global: low place; English: low place)

"How OCR Software Works". OCRWizard. Archived from the original on August 16, 2009. Retrieved June 16, 2013.

patents.google.com (Global: 1,182^nd place; English: 725^th place)

US1838389A, Emanuel, Goldberg, "Statistical machine", issued December 29, 1931

researchgate.net (Global: 120^th place; English: 125^th place)

Assefi, Mehdi (December 2016). "OCR as a Service: An Experimental Evaluation of Google Docs OCR, Tesseract, ABBYY FineReader, and Transym". ResearchGate.

semanticscholar.org (Global: 11^th place; English: 8^th place)

api.semanticscholar.org

Tappert, C. C.; Suen, C. Y.; Wakahara, T. (1990). "The state of the art in online handwriting recognition". IEEE Transactions on Pattern Analysis and Machine Intelligence. 12 (8): 787. Bibcode:1990ITPAM..12..787T. doi:10.1109/34.57669. S2CID 42920826.
Milyaev, Sergey; Barinova, Olga; Novikova, Tatiana; Kohli, Pushmeet; Lempitsky, Victor (2013). "Image Binarization for End-to-End Text Understanding in Natural Images". 2013 12th International Conference on Document Analysis and Recognition (PDF). pp. 128–132. doi:10.1109/ICDAR.2013.33. ISBN 978-0-7695-4999-6. S2CID 8947361. Archived (PDF) from the original on November 13, 2017. Retrieved May 2, 2015.
Riedl, C.; Zanibbi, R.; Hearst, M. A.; Zhu, S.; Menietti, M.; Crusan, J.; Metelsky, I.; Lakhani, K. (February 20, 2016). "Detecting Figures and Part Labels in Patents: Competition-Based Development of Image Processing Algorithms". International Journal on Document Analysis and Recognition. 19 (2): 155. arXiv:1410.6751. doi:10.1007/s10032-016-0260-8. S2CID 11873638.

sfu.ca (Global: 3,413^th place; English: 2,445^th place)

cs.sfu.ca

"Breaking a Visual CAPTCHA". Cs.sfu.ca. December 10, 2002. Retrieved June 16, 2013.

tesseract-ocr.googlecode.com (Global: low place; English: low place)

Smith, Ray (2007). "An Overview of the Tesseract OCR Engine" (PDF). Archived from the original (PDF) on September 28, 2010. Retrieved May 23, 2013.

trainyourtesseract.com (Global: low place; English: low place)

"Train Your Tesseract". Train Your Tesseract. September 20, 2018. Retrieved September 20, 2018.

ualberta.ca (Global: 3,600^th place; English: 2,528^th place)

webdocs.cs.ualberta.ca

Sezgin, Mehmet; Sankur, Bulent (2004). "Survey over image thresholding techniques and quantitative performance evaluation" (PDF). Journal of Electronic Imaging. 13 (1): 146. Bibcode:2004JEI....13..146S. doi:10.1117/1.1631315. Archived from the original (PDF) on October 16, 2015. Retrieved May 2, 2015.

uio.no (Global: 2,613^th place; English: 2,294^th place)

heim.ifi.uio.no

Trier, Oeivind Due; Jain, Anil K. (1995). "Goal-directed evaluation of binarisation methods" (PDF). IEEE Transactions on Pattern Analysis and Machine Intelligence. 17 (12): 1191–1201. Bibcode:1995ITPAM..17.1191T. doi:10.1109/34.476511. Archived (PDF) from the original on October 16, 2015. Retrieved May 2, 2015.

univ-tours.fr (Global: low place; English: low place)

rfai.li.univ-tours.fr

Gupta, Maya R.; Jacobson, Nathaniel P.; Garcia, Eric K. (2007). "OCR binarisation and image pre-processing for searching historical documents" (PDF). Pattern Recognition. 40 (2): 389. Bibcode:2007PatRe..40..389G. doi:10.1016/j.patcog.2006.04.043. Archived from the original (PDF) on October 16, 2015. Retrieved May 2, 2015.

web.archive.org (Global: 1^st place; English: 1^st place)

"OCR Document". Haven OnDemand. Archived from the original on April 15, 2016.
"Supported Media Formats". Haven OnDemand. Archived from the original on April 19, 2016.
"IBM Press room - 2009-07-28 IBM to Acquire SPSS Inc. to Provide Clients Predictive Analytics Capabilities - United States". www-03.ibm.com. July 28, 2009. Archived from the original on July 31, 2009. Retrieved January 9, 2026.
"Extracting text from images using OCR on Android". June 27, 2015. Archived from the original on March 15, 2016.
"[Tutorial] OCR on Google Glass". October 23, 2014. Archived from the original on March 5, 2016.
"[javascript] Using OCR and Entity Extraction for LinkedIn Company Lookup". July 22, 2014. Archived from the original on April 17, 2016.
Sezgin, Mehmet; Sankur, Bulent (2004). "Survey over image thresholding techniques and quantitative performance evaluation" (PDF). Journal of Electronic Imaging. 13 (1): 146. Bibcode:2004JEI....13..146S. doi:10.1117/1.1631315. Archived from the original (PDF) on October 16, 2015. Retrieved May 2, 2015.
Gupta, Maya R.; Jacobson, Nathaniel P.; Garcia, Eric K. (2007). "OCR binarisation and image pre-processing for searching historical documents" (PDF). Pattern Recognition. 40 (2): 389. Bibcode:2007PatRe..40..389G. doi:10.1016/j.patcog.2006.04.043. Archived from the original (PDF) on October 16, 2015. Retrieved May 2, 2015.
Trier, Oeivind Due; Jain, Anil K. (1995). "Goal-directed evaluation of binarisation methods" (PDF). IEEE Transactions on Pattern Analysis and Machine Intelligence. 17 (12): 1191–1201. Bibcode:1995ITPAM..17.1191T. doi:10.1109/34.476511. Archived (PDF) from the original on October 16, 2015. Retrieved May 2, 2015.
Milyaev, Sergey; Barinova, Olga; Novikova, Tatiana; Kohli, Pushmeet; Lempitsky, Victor (2013). "Image Binarization for End-to-End Text Understanding in Natural Images". 2013 12th International Conference on Document Analysis and Recognition (PDF). pp. 128–132. doi:10.1109/ICDAR.2013.33. ISBN 978-0-7695-4999-6. S2CID 8947361. Archived (PDF) from the original on November 13, 2017. Retrieved May 2, 2015.
Smith, Ray (2007). "An Overview of the Tesseract OCR Engine" (PDF). Archived from the original (PDF) on September 28, 2010. Retrieved May 23, 2013.
"How OCR Software Works". OCRWizard. Archived from the original on August 16, 2009. Retrieved June 16, 2013.
"How to optimize results from the OCR API when extracting text from an image? - Haven OnDemand Developer Community". Archived from the original on March 22, 2016.

worldcat.org (Global: 5^th place; English: 5^th place)

search.worldcat.org

Fehr, Tiff (March 26, 2019). "How We Sped Through 900 Pages of Cohen Documents in Under 10 Minutes". The New York Times. ISSN 0362-4331. Retrieved June 16, 2023.

Optical character recognition (English Wikipedia)

andrewt.net (Global: low place; English: low place)

archive.org (Global: 6th place; English: 6th place)

archive.org

ia600608.us.archive.org

arxiv.org (Global: 69th place; English: 59th place)

bisok.com (Global: low place; English: low place)

books.google.com (Global: 3rd place; English: 3rd place)

civilica.com (Global: low place; English: low place)

code.google.com (Global: 4,942nd place; English: 4,061st place)

damiles.com (Global: low place; English: low place)

blog.damiles.com

dataid.com (Global: low place; English: low place)

dlib.org (Global: low place; English: low place)

doi.org (Global: 2nd place; English: 2nd place)

ejohn.org (Global: low place; English: low place)

erols.com (Global: 8,858th place; English: low place)

users.erols.com

explainthatstuff.com (Global: low place; English: low place)

harvard.edu (Global: 18th place; English: 17th place)

ui.adsabs.harvard.edu

havenondemand.com (Global: low place; English: low place)

community.havenondemand.com

dev.havenondemand.com

helsinki.fi (Global: 1,670th place; English: 2,355th place)

blogs.helsinki.fi

hoopoes.com (Global: low place; English: low place)

ibm.com (Global: 1,131st place; English: 850th place)

www-03.ibm.com

microsoft.com (Global: 153rd place; English: 151st place)

nicomsoft.com (Global: low place; English: low place)

nih.gov (Global: 4th place; English: 4th place)

pubmed.ncbi.nlm.nih.gov

nytimes.com (Global: 7th place; English: 7th place)

ocrwizard.com (Global: low place; English: low place)

patents.google.com (Global: 1,182nd place; English: 725th place)

researchgate.net (Global: 120th place; English: 125th place)

semanticscholar.org (Global: 11th place; English: 8th place)

api.semanticscholar.org

sfu.ca (Global: 3,413th place; English: 2,445th place)

cs.sfu.ca

tesseract-ocr.googlecode.com (Global: low place; English: low place)

trainyourtesseract.com (Global: low place; English: low place)

ualberta.ca (Global: 3,600th place; English: 2,528th place)

webdocs.cs.ualberta.ca

uio.no (Global: 2,613th place; English: 2,294th place)

heim.ifi.uio.no

univ-tours.fr (Global: low place; English: low place)

rfai.li.univ-tours.fr

web.archive.org (Global: 1st place; English: 1st place)

worldcat.org (Global: 5th place; English: 5th place)

search.worldcat.org

archive.org (Global: 6^th place; English: 6^th place)

arxiv.org (Global: 69^th place; English: 59^th place)

books.google.com (Global: 3^rd place; English: 3^rd place)

code.google.com (Global: 4,942^nd place; English: 4,061^st place)

doi.org (Global: 2^nd place; English: 2^nd place)

erols.com (Global: 8,858^th place; English: low place)

harvard.edu (Global: 18^th place; English: 17^th place)

helsinki.fi (Global: 1,670^th place; English: 2,355^th place)

ibm.com (Global: 1,131^st place; English: 850^th place)

microsoft.com (Global: 153^rd place; English: 151^st place)

nih.gov (Global: 4^th place; English: 4^th place)

nytimes.com (Global: 7^th place; English: 7^th place)

patents.google.com (Global: 1,182^nd place; English: 725^th place)

researchgate.net (Global: 120^th place; English: 125^th place)

semanticscholar.org (Global: 11^th place; English: 8^th place)

sfu.ca (Global: 3,413^th place; English: 2,445^th place)

ualberta.ca (Global: 3,600^th place; English: 2,528^th place)

uio.no (Global: 2,613^th place; English: 2,294^th place)

web.archive.org (Global: 1^st place; English: 1^st place)

worldcat.org (Global: 5^th place; English: 5^th place)