Program: gImageReader and Tesseract
License: Open Source
Description: gImageReader is a GUI front end for the Tesseract OCR engine
Website: http://sourceforge.net/projects/gimagereader/ and http://sourceforge.net/projects/tesseract-ocr/
gImageReader is an excellent front end for the Tesseract OCR engine.
Tesseract is an open source OCR engine that converts images into editable text. It is installed onto a system that has Tesseract already installed, which is why this App Request lists both of them.
gImageReader Features
- Open images and PDFs
- Acquire from scanner
- Select the part of the image to recognize
- Support for different recognition languages
- Side by side comparison of source image and output text
- Remove linebreaks in output text
- Supports tesseract 3.0
One challenge is that while it also supports spellcheck, it uses the dictionary from OpenOffice. Possibly could be configured to use the dictionaries in LibreOffice Portable?