Hello ShreeDevi, https://medium.com/apegroup-texts/training-tesseract-for-labels-receipts-and-such-690f452e8f79
In the link, we can see a full fledged tutorial of tesseract 3.0 version, of using it and training it. Can you please clarify the below points...? https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00 But in the github link, i feel its good if they elaborate more.. 1) How should i train tesseract if i dont know or i may get random fonts in image files. ? 2) In github tutorial, its specified that we should skip clustering steps (mftraining, cntraining, shapeclustering) ? 3) And I want to generate a trained data file, and want to merge with tessdata(already present ) and dont want to replace it? Can you please specify how to achieve these steps..? Thank You. On Monday, April 3, 2017 at 8:11:33 PM UTC+5:30, shree wrote: > > Saurabh, > > It depends on what you want to do with the bash script. > > Here is a sample of a script I used to compare results using diff tessdata > files by looping thru a set of image files. Google the bash commands to > figure out what they do! > > #!/bin/bash > set -vx > export TESSDATA_PREFIX=/mnt/c/Users/User/shree/tesseract-ocr > > img_files=$(ls *.jpeg) > for img_file in ${img_files}; do > time tesseract ${img_file} ${img_file%.*}-ssd -l ssd > time tesseract ${img_file} ${img_file%.*}-ssdsmall --psm 6 --oem > 1 -l ssdsmall > time tesseract ${img_file} ${img_file%.*}-eng --psm 6 --oem 1 -l > eng > done > > > ShreeDevi > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > > On Mon, Apr 3, 2017 at 7:10 PM, Saurabh Srivastav <[email protected] > <javascript:>> wrote: > >> hello shree ! thank you for your help. >> may you please help me how can i write a bash script for tesseract. >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected] <javascript:>. >> To post to this group, send email to [email protected] >> <javascript:>. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/ac53f578-d14c-401b-b65e-b222fe4cb067%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/ac53f578-d14c-401b-b65e-b222fe4cb067%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/358f92a6-2dba-4ef2-b02a-925accfa94ff%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

