Program: gImageReader and Tesseract
License: Open Source
Description: gImageReader is a GUI front end for the Tesseract OCR engine
gImageReader is an excellent front end for the Tesseract OCR engine.
Tesseract is an open source OCR engine that converts images into editable text. It is installed onto a system that has Tesseract already installed, which is why this App Request lists both of them.
- Open images and PDFs
- Acquire from scanner
- Select the part of the image to recognize
- Support for different recognition languages
- Side by side comparison of source image and output text
- Remove linebreaks in output text
- Supports tesseract 3.0
One challenge is that while it also supports spellcheck, it uses the dictionary from OpenOffice. Possibly could be configured to use the dictionaries in LibreOffice Portable?