Skip to main content

Top 5 Free OCR Software Tools To Convert Images Into Text

ThumbnailQuite frankly, I wish I knew about this simple way to use freely available OCR software back in my school days. Of course, we didn’t have camera mobile phones or inexpensive Digicams, but wouldn’t it have saved hours of copying notes!

Ah, modern technology is wonderful; take a scanned image (or take a snap using a mobile camera/Digicam) and presto – OCR software extracts all the information from the image into easily editable text format.

Optical character recognition (OCR) is a system of converting scanned printed/handwritten image files into its machine readable text format. OCR software works by analyzing a document and comparing it with fonts stored in its database and/or by noting features typical to characters. Some OCR software also puts it through a spell checker to “guess” unrecognized words. 100% accuracy is difficult to achieve, but close approximation is what most software strive for.


Maybe you have already come across our previous How to Extract Text from Images (OCR) post and used JOCR, a a free OCR software tool. Or you might have set your preference for a few online OCR tools. Then again, if you have thought up ways to exploit OCR software for productivity shortcuts, then let us give you a few more tools to play with.

We will be looking at 5 free pieces of OCR software and to start off let’s see the overlooked two that are already installed on our systems.

OCR Using Microsoft OneNote 2007

For the occasional basic OCR stuff, MS OneNote’s optical character recognition feature is a timesaver. You might have missed it…it’s called Copy Text from Picture.

free ocr software

  • Drag a scan or a saved picture into OneNote. You can also use OneNote to clip part of the screen or an image into OneNote.
  • Right click on the inserted picture and select Copy Text from Picture. The copied optically recognized text goes into the clipboard and you can now paste it into any program like Word or Notepad.

OneNote is simplicity personified. But it’s not too great for handwritten characters or even fuzzy ones. But for a quick job, I am all for OneNote’s clip and paste.

OCR Using Microsoft Office Document Imaging

Another little used tool within the Microsoft family. It’s right there under Menu – Microsoft Office – Microsoft Office Tools – Microsoft Office Document Imaging.

ocr software

Doing OCR using the document imaging tool is a bit limiting because it accepts only TIFF (or MDI) formats. But that’s not too much of a bother as any graphic application can be used to convert an image to TIFF. In the screenshot below, I have used MS Paint to convert a JPEG to a TIFF.

ocr software

  • Open the file in Microsoft Office Document Imaging – File – Open.
  • Click the little eye icon – Recognize Text Using OCR.
  • Click on MS Word Icon – Send Text to Word.
  • A MS Word File opens with the editable converted text.
  • Alternatively, you can also use MS Paint to select a specific area and copy it to the clipboard. Open MS Office Document Imaging – select PagePaste Page to copy the selection for OCR.

Again, MODI handled printed text ably, but my handwritten text was met with an ‘OCR performed but could not recognize text prompt’. Of course, do try out with your own handwriting.

So, now let’s leave the Microsoft family behind and look at three free tools which call themselves OCR Software…

SimpleOCR

The difficulty I was having with handwriting recognition using MS tools, could have found a solution in SimpleOCR. But the software offers handwriting recognition only as a 14 day free trial. Machine print recognition though does not have any restrictions.

best ocr software

  • The software can be set up to read directly from a scanner or by adding a page (jpg, tiff, bmp formats).
  • SimpleOCR offers some control over the conversion through text selection, image selection and text ignore features.
  • Conversion to text takes the process into a validation stage; a user can correct discrepancies in the converted text using an in-built spell-checker.
  • The converted file can be saved to a doc or txt format.

SimpleOCR was fine with normal text, but its handling of multi-column layouts was a comedown. In my opinion, the conversion accuracy of the Microsoft tools was considerably better than SimpleOCR.

SimpleOCR (v3.1) is a 9MB download and is compatible with Windows.

TopOCR

Just what I was talking about in the beginning! TopOCR, in a breakaway from typical OCR software, is designed more for digital cameras (at least 3MP) and mobile phones along with scanners. Like SimpleOCR, it has a two window interface – The source Image window and the Text window.

best ocr software

The image sourced from a camera or a scanner in the left window gets converted to the text format in the text editor on the right. The text editor functions like WordPad and can use Microsoft’s Text to Speech engine.

  • The software supports JPEG, TIFF, GIF and BMP formats.
  • Image settings like brightness, color, contrast, despeckle, sharpen etc. can be used to improve readability of the image.
  • Camera filter settings can also be configured for enhancing the image.
  • The converted file can be saved in a variety of formats – PDF, RTF, HTML and TXT.
  • TopOCR functions well with straight oriented text but the usual failing of OCR with columned text remains.
  • The software though, parses a mixed page (text plus graphics) well and processes the text only.
  • The software works with 11 languages.

For best results with your camera read there How to Get the Best Results with TopOCR page.

TopOCR (v3.1) is an 8MB download and is compatible with Windows (not tested on Vista).

FreeOCR

This free OCR software uses the Tesseract OCR engine. Tesseract OCR code was developed at HP Labs between 1985 and 1995 and is currently with Google. It is thought of as one of the most accurate open source OCR engines available.

FreeOCR is a simple Windows interface for that underlying code.

ocr software freeware

  • It supports most image files and multi-page TIFF files.
  • It can handle PDF formats and is also compatible with TWAIN devices like scanners.
  • FreeOCR also has the familiar double window interface with easy to understand settings.
  • Before starting the one click conversion process, you can adjust the image contrast for better readability.

FreeOCR (v.2.03) requires Microsoft Net 2.0 framework. The Windows XP/Vista compatible 4.38MB software can also be downloaded from this alternate site.

Free OCR tools come with their own limitations. And scanning a page has to do a lot with resolutions, contrasts and clarity of fonts. From an average user’s standpoint, 100% OCR accuracy remains a pipedream.

Though the free tools were adequate with printed text, they failed with normal cursive handwritten text. My personal preference for offhand OCR use leans towards the two Microsoft products I mentioned in the beginning.

Your own say matters. Which is your tool of choice? Do the free OCR software recognize what you through at it? And more importantly, do you recognize what they throw back at you? Let us know…

Comments

Popular posts from this blog

How To Hide Text In Microsoft Word 2007, Reveal It & Protect It

Sometimes what we hide is more important than what we reveal. Especially, documents with sensitive information, some things are supposed to be ‘for some eyes only’. Such scenarios are quite common, even for the more un-secretive among us. You want to show someone a letter composed in MS Word, but want to keep some of the content private; or it’s an official letter with some part of it having critical data. As important as these two are, the most common use could involve a normal printing job. Many a time we have to print different versions of a document, one copy for one set of eyes and others for other sets. Rather than creating multiple copies and therefore multiple printing jobs, what if we could just do it from the same document?  That too, without the hassle of repeated cut and paste. We can, with a simple feature in MS Word – it’s just called Hidden and let me show you how to use it to hide text in Microsoft Word 2007. It’s a simple single click process. Open the docum...

Boom, the startup that wants to build supersonic planes, just signed a massive deal with Virgin

Have you heard about Boom? Boom is a relatively new startup that’s aiming to build something pretty crazy. They’re not building an app… or a social network… or even some new gadget for the Kickstarter crowd. Boom wants to build planes. Really, really, really fast planes. Specifically, they’re trying to design and build a supersonic passenger plane that goes 2.2x the speed of sound. If all goes to plan, they’ll be able to shuttle people from New York to London in 3.5 hours, and SF to Tokyo in 4.5. Sound crazy? I wouldn’t disagree. It’s worth noting that the company is in the very early days for something as intensive, massive, and hugely expensive as designing and producing a passenger aircraft. They’re still working on their first prototype, and hope to fly it by late next year. But it’s also worth noting that the team behind the plane has some serious talent in its blood: the company’s 11 employees have collectively contributed to over 30 aircrafts — having worked on thin...

Fun Tools to Translate Your Name into Japanese Calligraphy

Japanese calligraphy is an artistic writing style of the Japanese language. Its Chinese origins can be traced back to the twenty-eighth century BCE. Calligraphy found its way into Japanese culture in 600 CE and is known as the karayo tradition. For Westerners, calligraphy is forever fascinating. However, it takes years to learn how to properly draw the signs. Two basic principles must be known to understand Japanese writing: there are different writing styles and different alphabets. Kaisho for example, is a writing style most commonly used in print media. Tensho on the other hand is used in signatures. Other writing styles are Reisho, Gyosho and Sousho. The alphabets include Kanji, Hiragana, and Katakana. Katakana is used for writing foreign words. It can also serve to highlight words, in analogy to capital letters as we know them from the Roman / Latin alphabet (Romaji in Japanese). Each Kanji character has a meaning of its own, while Hiragana or Katakana characters merely repres...