Hi, All,
We are researching OCR software for Persian/Pashto. It seems to me that Yale
has used Sakhr and Verus for Arabic OCR. Sakhr claimed its OCR software for
Persian/Pashto can reach over 99%. If you know someone is using any software
for Arabic/Persian/Pashto, please let me know contact.
We use Abby Finereader for things that will need correction (yearbooks
where the text was handwritten, for example), and Acrobat for things that
we're not willing to spend the time correcting. Finereader is good if you
really want the OCR perfectly formatted, as it can handle tables and charts
and
I have a recently released a bookclub - related app called Bookship, which
features the ability to scan a page of text from a book so users can post
quotes. (www.bookshipapp.com). So my use case is people taking pictures of
pages with their phone and OCR-ing it.
I extensively tested Tesseract
In our testing, the effectiveness relies heavily on the era of the type, and
the cleanliness of the original to avoid artifacts. A 1940s typewritten
document that is a carbon paper copy is not going to do nearly as well as a
clean printed document in times new roman.
We ran tests on some art
All,
What are you all using for OCR software? How well does it work for you?
Do you find that need to scan at a particular resolution to get optimal
OCR results, or do you find yourself doing post-processing on the images
before OCR'ing them? What have your experiences been like?
In the