shree,
can you please help me out how to perform arabic training on tesseract 4.

thank you


On Thursday, May 4, 2017 at 3:22:42 PM UTC+5:30, shree wrote:
>
> Ibr,
>
> You are incorrect in your description of LSTM training.
>
> What you are doing will use the ara.traineddata provided in the repo, 
> there will be no change in output.
>
> Once lstmf files are created, you have to run lstmtraining which will run 
> for days/weeks  to give you a good result.
>
> Please read about LSTM training on wiki.
>
> On May 4, 2017 2:58 PM, "Ibr" <ibr....@gmail.com <javascript:>> wrote:
>
>> if you are referring to tesseract 4.00alpha with liptonica 1.74.1, and if 
>> you compiled them in the correct way and got the binaries that you need for 
>> training lmstf files, then I recommend to follow the suggestions that is 
>> made by tesseract devs which is: once you create an .lstmf file for a 
>> certain font (that can be used for Arabic writing) then get the official 
>> ara.traineddata file from GitHub paste it in tessdata folder, and the lstmf 
>> file in tesseract folder and run the command  tesseract text_image 
>> result_text -l ara --oem 1 
>> what Arabic characters exactly are you trying to enhance the accuracy for 
>> ?
>>
>> On Saturday, April 8, 2017 at 11:52:25 AM UTC+3, Ahmad Moawad wrote:
>>
>>> Hello All,
>>>
>>>
>>> I want to make training for Arabic language in Tesseract 4.0, and The 
>>> result of this version is great but still need some tunning, so I got 
>>> jTessBoxEditor 2.0 beta.
>>> I tried to modify the incorrect characters and build ara.traineddata. 
>>> After copying the ara.traineddata to 
>>> /usr/share/tesseract-ocr/4.00/tessdata, I got random characters when I run 
>>> the tesseract on the image.
>>> So any suggestion of how making training for Version 4.0, I already know 
>>> that that last version 3.0x cube doesn't included in 4.0 LSTM or waiting 
>>> until Ray makes another updated ara.traineddata.
>>>
>>> ,Thanks.
>>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to tesser...@googlegroups.com <javascript:>.
>> To post to this group, send email to tesser...@googlegroups.com 
>> <javascript:>.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/1c842b1e-1dc1-418b-a5b7-368c11e7dfa5%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/1c842b1e-1dc1-418b-a5b7-368c11e7dfa5%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/7bf66a4e-f85f-4b87-bf82-5688cb2cac8a%40googlegroups.com.

Reply via email to