Comparison of optical character recognition software information
This comparison of optical character recognition software includes:
OCR engines, that do the actual character identification
Layout analysis software, that divide scanned documents into zones suitable for OCR
Graphical interfaces to one or more OCR engines
Software development kits that are used to add OCR capabilities to other software (e.g. forms processing applications, document imaging management systems, e-discovery systems, records management solutions)
ABBYY also supplies SDKs for embedded and mobile devices. Professional, Corporate and Site License Editions for Windows, Express Edition for Mac.[3]
AnyDoc Software
1989
?
?
Proprietary
No
Yes
No
No
No
?
?
VBScript
?
?
?
Works with structured, semi-structured, and unstructured documents.
Asprise OCR SDK
1998
15
2015
Proprietary
Yes
Yes
Yes
Yes
Yes
?
?
Java, C#,VB.NET, C/C++/Delphi
Yes
20+[4]
?
Plain text, searchable PDF, XML[5]
Java, C#, VB.NET, C/C++/Delphi SDKs for OCR and Barcode recognition on Windows, Linux, Mac OS X and Unix.[6]
CuneiForm
1996
1.1
2011
BSD variant
No
Yes
Yes
Yes
Yes
?
?
C/C++
Yes
28
Any printed font
HTML, hOCR, native, RTF, TeX, TXT[7]
Enterprise-class system, can save text formatting and recognizes complicated tables of any structure
E-aksharayan
2010
Yes
No
Yes
No
?
?
14
RTF, TXT, BRL
GOCR
2000
0.52[8]
2018
GPL
Yes[9]
Yes
Yes
Yes
Yes
?
?
C
?
20+
?
Google Drive OCR or Google Cloud Vision
2015
Proprietary
Yes
Browser
Browser
Browser
Unknown
?
?
Unknown
Yes
200+
All fonts
text
Google blog post[10][11]
Microsoft Office Document Imaging
?
Office 2007
2007
Proprietary
No
Yes
No
No
No
?
?
?
?
?
?
Uses OmniPage[citation needed]
Microsoft Office OneNote 2007
2011
?
2007
Proprietary
No
Yes
No
No
No
?
?
?
?
?
?
OCRFeeder
2009-03
0.8.5
2022
GPL
No
No
No
Yes
No
?
?
Python
?
?
?
Features a full user interface and has a command-line tool for automatic operations. Has its own segmentation algorithm but uses system-wide OCR engines like Tesseract or Ocrad
Ocrad
?
0.28[12]
2022
GPL
Yes
No
Yes
Yes
Yes
?
?
C++
Yes
Latin alphabet
?
Command line
OCRopus
2007
1.3.3
2017
Apache
No
No
Yes
Yes
Yes
?
?
Python
?
All languages using Latin script (other languages can be trained)
Normal Latin script and Fraktur (other scripts can be trained)
TXT, hOCR,[13] PDF[14]
Pluggable framework under active development, used for Google Books
OmniPage
1970s
19.2
2015
Proprietary
Yes
Yes
Yes
Yes
No
?
?
C/C++, C#[15]
Yes
125[16]
Machine and handprinted fonts
DOC/DOCX XLS/XLSX PPTX RTF PDF PDF/A Searchable PDF HTML Text XML ePUB MP3
Product of Nuance Communications
Puma.NET
?
?
2009
BSD
No
Yes
No
No
No
?
?
C#
Yes
28
Any printed font
.NET OCR SDK based on Cognitive Technologies' CuneiForm recognition engine. Wraps Puma COM server and provides simplified API for .NET applications
ReadSoft
?
?
?
Proprietary
No
Yes
No
No
No
?
?
?
?
?
?
Scan, capture and classify business documents such as invoices, forms and purchase orders integrated with business processes.
Scantron
?
?
?
Proprietary
No
Yes
No
No
No
?
?
?
?
?
?
For working with localized interfaces, corresponding language support is required.
SmartScore
1991
10.5.8
2015
Proprietary
No
Yes
Yes
No
No
?
?
?
?
?
?
For musical scores
Tesseract
1985
5.3.3
2023
Apache
No
Yes
Yes
Yes
Yes
?
?
C++, C
Yes
100+[17]
Any printed font
Text, ALTO, hOCR,[18] PDF, others with different user interfaces[19] or the API
Created by Hewlett-Packard; under further development by Google[20]
^Ashok Popat (Sep 4, 2015). "IEEE SPS: Optical Character Recognition for Most of the World's Languages". YouTube. Archived from the original on 2021-12-20.
^Diaz, Antonio (2022-01-17). "GNU Ocrad 0.28 released" (Mailing list). info-gnu.
^OCRopus includes the ocropus-hocr tool which produces hOCR from the recognition results.
^In combination with the hocr-tools
^"OmniPage CSDK - OCR Document Capture Toolkit | Document Imaging & OCR". Nuance. Archived from the original on 2010-08-24. Retrieved 2013-09-12.
^"OmniPage Standard Document Conversion". Nuance. Archived from the original on 2014-03-13. Retrieved 2014-02-25.
^Based on count of language training files for version 3.04. Available at the download page.
This comparisonofopticalcharacterrecognitionsoftware includes: OCR engines, that do the actual character identification Layout analysis software, that...
embedded text is a common feature, but other applications perform opticalcharacterrecognition (OCR) to convert imaged text to machine-readable form, sometimes...
In comparison with offline handwriting, online handwriting recognition is more efficient, because the computer not only 'sees' the written character but...
intelligence and opticalcharacterrecognition (OCR). Primarily focused on software as a service model, the company serves clients worldwide. One of ABBYY's best-known...
situation or picking parts from a bin. Opticalcharacterrecognition (OCR) – identifying characters in images of printed or handwritten text, usually with...
This page provides a comparisonof notable screencasting software, used to record activities on the computer screen. This software is commonly used for...
variety of tools for taking notes, including texts, lists, images, and audio. Text from images can be extracted using opticalcharacterrecognition and voice...
Ocrad is an opticalcharacterrecognition program and part of the GNU Project. It is free software licensed under the GNU GPL. Based on a feature extraction...
as sources. Recognition technologies to extract information from scanned documents and digital faxes include: Opticalcharacterrecognition (OCR): Converts...
produce hardcopy art, and the identification of these works. By adopting several OpticalCharacterRecognition (OCR) algorithms, including libraries developed...
(opticalcharacterrecognition) technology) – in apps for mobile phones; the "Suggest translation" button (user patches to help improve the quality of...
output, causing the information to align in vertical columns. Opticalcharacterrecognition has better accuracy with monospaced fonts. Examples are OCR-A...
A virtual keyboard is a software component that allows the input ofcharacters without the need for physical keys. Interaction with a virtual keyboard...
(1996). "Comparisonof Approaches to Egomotion Computation" (PDF). IEEE Computer Society Conference on Computer Vision and Pattern Recognition: 315. Archived...
the Apple TV. In July 2008, Apple released the software 2.1 update which added external recognitionof iPhones and iPod Touches as alternative remote...
system is at the optical center of the camera's lens as shown in the figure. Actually, the camera's image plane is behind the optical center of the camera's...
(keywords). Mobile Visual Search solutions enable you to integrate image recognitionsoftware capabilities into your own branded mobile applications. Mobile Visual...
light. The 1984 Apricot Portable is an early example of an IR keyboard. Opticalcharacterrecognition (OCR) is preferable to rekeying for converting existing...
available. Avidemux has built-in subtitle processing, both for opticalcharacterrecognitionof DVD subtitles and for rendering hard subtitles. Avidemux supports...
subtitles distributed on the Internet are in this format. Using opticalcharacterrecognition, SubRip can extract from live video, video files and DVDs, then...
analysis, pattern recognition and digital image processing for purposes such as opticalcharacterrecognition, fingerprint recognition, visual inspection...
Google. It features a computer algebra system with an augmented opticalcharacterrecognition system, designed for use with a smartphone's camera to scan...