Outdated: Tesseract-ocr Portable is outdated and is now packaged with gImageReader Portable per John's request.
Description: Tesseract-ocr is an OCR Engine that was developed at HP Labs between 1985 and 1995... and now at Google.
The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV Accuracy test. Between 1995 and 2006 it had little work done on it, but since then it has been improved extensively by Google and is probably one of the most accurate open source OCR engines available. Combined with the Leptonica Image Processing Library it can read a wide variety of image formats and convert them to text in over 40 languages.
Download Tesseract-ocr Portable 3.01 Development Test 1 Online [716KB download / 22.6MB installed]
Note: Tesseract-ocr is a plugin and will be installed into the
Applications: gImageReader Portable is a graphical GTK frontend to Tesseract-ocr Portable.
Development Test 1 (2012-07-03): Initial release
I'll run this through the paces and see if I can get it to trip up. It might take several days to respond back, though. I'm gonna try to run this against a scan of an entire book, and my netbook is a little slow. Thanks for this, my friend...
Frozen St. Paul, MN
land of the frozen mosquito
Please try out gImageReader as well.