[CODE4LIB] OCR software for Arabic/ Persian /Pashto

2020-01-22 Thread Han, Yan - (yhan)
Hi, All, We are researching OCR software for Persian/Pashto. It seems to me that Yale has used Sakhr and Verus for Arabic OCR. Sakhr claimed its OCR software for Persian/Pashto can reach over 99%. If you know someone is using any software for Arabic/Persian/Pashto, please let me know contact.

Re: [CODE4LIB] OCR software

2017-07-26 Thread Laura Buchholz
We use Abby Finereader for things that will need correction (yearbooks where the text was handwritten, for example), and Acrobat for things that we're not willing to spend the time correcting. Finereader is good if you really want the OCR perfectly formatted, as it can handle tables and charts and

Re: [CODE4LIB] OCR software

2017-07-20 Thread Mark Watkins
I have a recently released a bookclub - related app called Bookship, which features the ability to scan a page of text from a book so users can post quotes. (www.bookshipapp.com). So my use case is people taking pictures of pages with their phone and OCR-ing it. I extensively tested Tesseract

Re: [CODE4LIB] OCR software

2017-07-19 Thread Bannen, Kerry
In our testing, the effectiveness relies heavily on the era of the type, and the cleanliness of the original to avoid artifacts. A 1940s typewritten document that is a carbon paper copy is not going to do nearly as well as a clean printed document in times new roman. We ran tests on some art

[CODE4LIB] OCR software

2017-07-19 Thread Will Martin
All, What are you all using for OCR software? How well does it work for you? Do you find that need to scan at a particular resolution to get optimal OCR results, or do you find yourself doing post-processing on the images before OCR'ing them? What have your experiences been like? In the