FreeOCR is average at best. Even standard pages seem to be a tough job for this.
However, TopOCR is awesome. Text only pages are fantastic. Even on pages filled with technical stuff having non-standard English words, it is doing a very good job.
Thanks a lot
Have started downloading *ahem* version of Abbyy. Let me see how this does. Will keep posted!
Just finished with some of the pages. Used ABBYY this time round and am pretty impressed.
I was scanning some material which has some Java code in it, along with some which are plain English. The plain English ones came out great. The ones with the code weren't bad either. I didn't have to edit much in the final document. Attached are the original and scanned versions.
^ As stated in my previous post, I would go with ABBYY. Professional software, and does an extremely good job. Recommended if you have composite documents - pages with text and tables / XMLs / non-standard content.
For plain English documents, TopOCR would do the job well.
I installed the 'trojan' version. The trojan thing is BS, all sorts of cracks cause the AVs to go berserk with alarms. Haven't been impacted so far though.