Hello ShreeDevi,

https://medium.com/apegroup-texts/training-tesseract-for-labels-receipts-and-such-690f452e8f79

In the link, we can see a full fledged tutorial of tesseract 3.0 version, 
of using it and training it. Can you please clarify the below points...?

https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00

But  in the github link, i feel its good if they elaborate more..

1) How should i train tesseract if i dont know or i may get random fonts in 
image files. ?

2) In github tutorial, its specified that we should skip clustering steps 
(mftraining, cntraining, shapeclustering)  ?

3) And I want to generate a trained data file, and want to merge with 
tessdata(already present ) and dont want to replace it?


Can you please specify how to achieve these steps..?


Thank You.






On Monday, April 3, 2017 at 8:11:33 PM UTC+5:30, shree wrote:
>
> Saurabh,
>
> It depends on what you want to do with the bash script.
>
> Here is a sample of a script I used to compare results using diff tessdata 
> files by looping thru a set of image files. Google the bash commands to 
> figure out what they do!
>
> #!/bin/bash
> set -vx
> export TESSDATA_PREFIX=/mnt/c/Users/User/shree/tesseract-ocr
>
>     img_files=$(ls *.jpeg)
>     for img_file in ${img_files}; do
>         time tesseract ${img_file} ${img_file%.*}-ssd  -l ssd
>         time tesseract ${img_file} ${img_file%.*}-ssdsmall  --psm 6 --oem 
> 1 -l ssdsmall 
>         time tesseract ${img_file} ${img_file%.*}-eng  --psm 6 --oem 1 -l 
> eng 
>     done    
>
>
> ShreeDevi
> ____________________________________________________________
> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>
> On Mon, Apr 3, 2017 at 7:10 PM, Saurabh Srivastav <[email protected] 
> <javascript:>> wrote:
>
>> hello  shree ! thank you for your help.
>> may you please help me how can i write a bash  script for tesseract.
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected] <javascript:>.
>> To post to this group, send email to [email protected] 
>> <javascript:>.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/ac53f578-d14c-401b-b65e-b222fe4cb067%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/ac53f578-d14c-401b-b65e-b222fe4cb067%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/358f92a6-2dba-4ef2-b02a-925accfa94ff%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to