[tesseract-ocr] Re: persian in tesseract-ocr

2015-09-17 Thread buyi wen
if you like tesseract ocr, you may like this free online ocr tool using tesseract ocr 3.02 -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it,

Re: [tesseract-ocr] Re: persian in tesseract-ocr

2015-08-17 Thread ShreeDevi Kumar
On Mon, Aug 17, 2015 at 6:07 AM, ShreeDevi Kumar shreesh...@gmail.com wrote: Ray was looking for comparative feedback regarding the new traineddata for RTL languages, so this will be useful. ​ Ray - https://groups.google.com/forum/#!msg/tesseract-dev/qcFtWCAAlT8/SZ4xBS5DHwwJ Another

Re: [tesseract-ocr] Re: persian in tesseract-ocr

2015-08-17 Thread Hossein Razizadeh
I think the problem is the lack of cube files in persian. Does anyone know how to add cube files to be used by tesseract? There is a 'fas' folder in 'langdata' that contains some cube related data, but I don't know how to use it with tesseract. On Monday, August 17, 2015 at 4:25:23 PM

Re: [tesseract-ocr] Re: persian in tesseract-ocr

2015-08-17 Thread zdenko podobny
On Mon, Aug 17, 2015 at 6:07 AM, ShreeDevi Kumar shreesh...@gmail.com wrote: Ray was looking for comparative feedback regarding the new traineddata for RTL languages, so this will be useful. As far as I know, Google Docs does not use tesseract OCR engine for recognizing the text.

[tesseract-ocr] Re: persian in tesseract-ocr

2015-08-16 Thread Hossein Razizadeh
It seems 'fas' is for Persian, but there are no cube files, resulting in poor results. Arabic language files work much better for Persian images. There is another 'per' folder for Persian, but there isn't even '.traieddata' file for it. Does anyone know if 'Google Doc' has used 'Tesseract' for

Re: [tesseract-ocr] Re: persian in tesseract-ocr

2015-08-16 Thread ShreeDevi Kumar
Ray was looking for comparative feedback regarding the new traineddata for RTL languages, so this will be useful. As far as I know, Google Docs does not use tesseract OCR engine for recognizing the text. Its OCR accuracy is better than Tesseract for some Indian languages also. However, it doesn't

[tesseract-ocr] Re: persian in tesseract-ocr

2015-07-17 Thread Jeff Breidenbach
I think 'fas' is the language code for Persian. -- You received this message because you are subscribed to the Google Groups tesseract-ocr group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this