Global Information Lookup Global Information

Comparison of optical character recognition software information


This comparison of optical character recognition software includes:

  • OCR engines, that do the actual character identification
  • Layout analysis software, that divide scanned documents into zones suitable for OCR
  • Graphical interfaces to one or more OCR engines
  • Software development kits that are used to add OCR capabilities to other software (e.g. forms processing applications, document imaging management systems, e-discovery systems, records management solutions)
Sortable table
Name Founded year Latest stable version Release year License Online Windows Mac OS X Linux BSD Android iOS Programming language SDK? Languages Fonts Output Formats Notes
ABBYY FineReader 1989 16 2022 Proprietary Yes Yes Yes No Yes Yes Yes C/C++ Yes 192[1] All fonts DOC, DOCX, XLS, XLSX, PPTX, RTF, PDF, HTML, CSV, TXT, ODT, DjVu, EPUB, FB2[2] ABBYY also supplies SDKs for embedded and mobile devices. Professional, Corporate and Site License Editions for Windows, Express Edition for Mac.[3]
AnyDoc Software 1989 ? ? Proprietary No Yes No No No ? ? VBScript ? ? ? Works with structured, semi-structured, and unstructured documents.
Asprise OCR SDK 1998 15 2015 Proprietary Yes Yes Yes Yes Yes ? ? Java, C#,VB.NET, C/C++/Delphi Yes 20+[4] ? Plain text, searchable PDF, XML[5] Java, C#, VB.NET, C/C++/Delphi SDKs for OCR and Barcode recognition on Windows, Linux, Mac OS X and Unix.[6]
CuneiForm 1996 1.1 2011 BSD variant No Yes Yes Yes Yes ? ? C/C++ Yes 28 Any printed font HTML, hOCR, native, RTF, TeX, TXT[7] Enterprise-class system, can save text formatting and recognizes complicated tables of any structure
E-aksharayan 2010 Yes No Yes No ? ? 14 RTF, TXT, BRL
GOCR 2000 0.52[8] 2018 GPL Yes[9] Yes Yes Yes Yes ? ? C ? 20+ ?
Google Drive OCR or Google Cloud Vision 2015 Proprietary Yes Browser Browser Browser Unknown ? ? Unknown Yes 200+ All fonts text Google blog post[10][11]
Microsoft Office Document Imaging ? Office 2007 2007 Proprietary No Yes No No No ? ? ? ? ? ? Uses OmniPage[citation needed]
Microsoft Office OneNote 2007 2011 ? 2007 Proprietary No Yes No No No ? ? ? ? ? ?
OCRFeeder 2009-03 0.8.5 2022 GPL No No No Yes No ? ? Python ? ? ? Features a full user interface and has a command-line tool for automatic operations. Has its own segmentation algorithm but uses system-wide OCR engines like Tesseract or Ocrad
Ocrad ? 0.28[12] 2022 GPL Yes No Yes Yes Yes ? ? C++ Yes Latin alphabet ? Command line
OCRopus 2007 1.3.3 2017 Apache No No Yes Yes Yes ? ? Python ? All languages using Latin script (other languages can be trained) Normal Latin script and Fraktur (other scripts can be trained) TXT, hOCR,[13] PDF[14] Pluggable framework under active development, used for Google Books
OmniPage 1970s 19.2 2015 Proprietary Yes Yes Yes Yes No ? ? C/C++, C#[15] Yes 125[16] Machine and handprinted fonts DOC/DOCX XLS/XLSX PPTX RTF PDF PDF/A Searchable PDF HTML Text XML ePUB MP3 Product of Nuance Communications
Puma.NET ? ? 2009 BSD No Yes No No No ? ? C# Yes 28 Any printed font .NET OCR SDK based on Cognitive Technologies' CuneiForm recognition engine. Wraps Puma COM server and provides simplified API for .NET applications
ReadSoft ? ? ? Proprietary No Yes No No No ? ? ? ? ? ? Scan, capture and classify business documents such as invoices, forms and purchase orders integrated with business processes.
Scantron ? ? ? Proprietary No Yes No No No ? ? ? ? ? ? For working with localized interfaces, corresponding language support is required.
SmartScore 1991 10.5.8 2015 Proprietary No Yes Yes No No ? ? ? ? ? ? For musical scores
Tesseract 1985 5.3.3 2023 Apache No Yes Yes Yes Yes ? ? C++, C Yes 100+[17] Any printed font Text, ALTO, hOCR,[18] PDF, others with different user interfaces[19] or the API Created by Hewlett-Packard; under further development by Google[20]
Name Founded year Latest stable version Release year License Online Windows Mac OS X Linux BSD Android iOS Programming language SDK? Languages Fonts Output Formats Notes
  1. ^ "ABBYY FineReader 14: Technical Specifications". Finereader.abbyy.com. Retrieved 2017-02-23.
  2. ^ "ABBYY FineReader 11: Technical Specifications". Finereader.abbyy.com. Retrieved 2013-09-12.
  3. ^ "Top OCR Software". Ocrworld.com. 2010-03-30. Archived from the original on 2017-02-23. Retrieved 2013-09-12.
  4. ^ "Asprise OCR SDK Features". asprise.com. Retrieved 2014-06-21.
  5. ^ "Asprise Java OCR Library Features". asprise.com. Retrieved 2014-06-21.
  6. ^ "Asprise Java, C#/VB.NET OCR API". asprise.com. 2015-11-19. Retrieved 2015-11-19.
  7. ^ Debian manual page for Cuneiform for Linux version 1.1.0
  8. ^ "GOCR Homepage". wasd.urz.uni-magdeburg.de. Retrieved 2018-10-17.
  9. ^ "GOCR". Jocr.sourceforge.net. Retrieved 2013-09-12.
  10. ^ "Supported languages". Feb 11, 2022.
  11. ^ Ashok Popat (Sep 4, 2015). "IEEE SPS: Optical Character Recognition for Most of the World's Languages". YouTube. Archived from the original on 2021-12-20.
  12. ^ Diaz, Antonio (2022-01-17). "GNU Ocrad 0.28 released" (Mailing list). info-gnu.
  13. ^ OCRopus includes the ocropus-hocr tool which produces hOCR from the recognition results.
  14. ^ In combination with the hocr-tools
  15. ^ "OmniPage CSDK - OCR Document Capture Toolkit | Document Imaging & OCR". Nuance. Archived from the original on 2010-08-24. Retrieved 2013-09-12.
  16. ^ "OmniPage Standard Document Conversion". Nuance. Archived from the original on 2014-03-13. Retrieved 2014-02-25.
  17. ^ Based on count of language training files for version 3.04. Available at the download page.
  18. ^ Usage explained in the Tesseract Readme and FAQ
  19. ^ Such as ODF with OCRFeeder
  20. ^ "GitHub - tesseract-ocr/tesseract: Tesseract Open Source OCR Engine (main repository)". GitHub. Retrieved 2018-11-05.

and 28 Related for: Comparison of optical character recognition software information

Request time (Page generated in 0.8963 seconds.)

Comparison of optical character recognition software

Last Update:

This comparison of optical character recognition software includes: OCR engines, that do the actual character identification Layout analysis software, that...

Word Count : 390

Optical character recognition

Last Update:

Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed...

Word Count : 4097

List of PDF software

Last Update:

embedded text is a common feature, but other applications perform optical character recognition (OCR) to convert imaged text to machine-readable form, sometimes...

Word Count : 1258

Chinese character IT

Last Update:

In comparison with offline handwriting, online handwriting recognition is more efficient, because the computer not only 'sees' the written character but...

Word Count : 3165

ABBYY

Last Update:

intelligence and optical character recognition (OCR). Primarily focused on software as a service model, the company serves clients worldwide. One of ABBYY's best-known...

Word Count : 761

Computer vision

Last Update:

situation or picking parts from a bin. Optical character recognition (OCR) – identifying characters in images of printed or handwritten text, usually with...

Word Count : 7528

Comparison of screencasting software

Last Update:

This page provides a comparison of notable screencasting software, used to record activities on the computer screen. This software is commonly used for...

Word Count : 312

Google Keep

Last Update:

variety of tools for taking notes, including texts, lists, images, and audio. Text from images can be extracted using optical character recognition and voice...

Word Count : 1668

Ocrad

Last Update:

Ocrad is an optical character recognition program and part of the GNU Project. It is free software licensed under the GNU GPL. Based on a feature extraction...

Word Count : 324

Enterprise content management

Last Update:

as sources. Recognition technologies to extract information from scanned documents and digital faxes include: Optical character recognition (OCR): Converts...

Word Count : 4323

Project Naptha

Last Update:

produce hardcopy art, and the identification of these works. By adopting several Optical Character Recognition (OCR) algorithms, including libraries developed...

Word Count : 2491

Yandex Translate

Last Update:

(optical character recognition) technology) – in apps for mobile phones; the "Suggest translation" button (user patches to help improve the quality of...

Word Count : 1024

Monospaced font

Last Update:

output, causing the information to align in vertical columns. Optical character recognition has better accuracy with monospaced fonts. Examples are OCR-A...

Word Count : 1077

Virtual keyboard

Last Update:

A virtual keyboard is a software component that allows the input of characters without the need for physical keys. Interaction with a virtual keyboard...

Word Count : 2145

Visual odometry

Last Update:

(1996). "Comparison of Approaches to Egomotion Computation" (PDF). IEEE Computer Society Conference on Computer Vision and Pattern Recognition: 315. Archived...

Word Count : 1669

Apple TV

Last Update:

the Apple TV. In July 2008, Apple released the software 2.1 update which added external recognition of iPhones and iPod Touches as alternative remote...

Word Count : 11063

3D reconstruction

Last Update:

system is at the optical center of the camera's lens as shown in the figure. Actually, the camera's image plane is behind the optical center of the camera's...

Word Count : 3927

SPSS

Last Update:

software, or entered during computer-assisted personal interviewing, by scanning and using optical character recognition and optical mark recognition...

Word Count : 2446

Structure from motion

Last Update:

3D reconstruction from multiple images Bundle adjustment Comparison of photogrammetry software Computer stereo vision Epipolar geometry Kinetic depth effect...

Word Count : 2566

Reverse image search

Last Update:

(keywords). Mobile Visual Search solutions enable you to integrate image recognition software capabilities into your own branded mobile applications. Mobile Visual...

Word Count : 2852

Universal Character Set characters

Last Update:

Graphical representations of many control characters. Box Drawing. Block Elements. Braille Patterns. Optical Character Recognition. Technical. Dingbats. Miscellaneous...

Word Count : 6986

Computer keyboard

Last Update:

light. The 1984 Apricot Portable is an early example of an IR keyboard. Optical character recognition (OCR) is preferable to rekeying for converting existing...

Word Count : 8193

Avidemux

Last Update:

available. Avidemux has built-in subtitle processing, both for optical character recognition of DVD subtitles and for rendering hard subtitles. Avidemux supports...

Word Count : 805

SubRip

Last Update:

subtitles distributed on the Internet are in this format. Using optical character recognition, SubRip can extract from live video, video files and DVDs, then...

Word Count : 1808

Ray Kurzweil

Last Update:

He is involved in fields such as optical character recognition (OCR), text-to-speech synthesis, speech recognition technology and electronic keyboard...

Word Count : 8464

Topological skeleton

Last Update:

analysis, pattern recognition and digital image processing for purposes such as optical character recognition, fingerprint recognition, visual inspection...

Word Count : 1384

Outline of natural language processing

Last Update:

interface History of natural-language understanding History of optical character recognition History of question answering History of speech synthesis...

Word Count : 7757

Photomath

Last Update:

Google. It features a computer algebra system with an augmented optical character recognition system, designed for use with a smartphone's camera to scan...

Word Count : 871

PDF Search Engine © AllGlobal.net