[tesseract-ocr] About training in Tesseract 4.0

anhlt Mon, 11 Feb 2019 22:47:02 -0800

I have some questions about training in Tesseract 4.0

1.Since we can't obtain the font file (not included in Tesseract's fonts) , 
is there any way to do the training without the font file?


2. Also we are doing some image training, for the same word in many images, 
is it necessary to make many box files or it would be more accurate just 
with on box file? 
   For example, I have a [0] in all images, and I'm declaring this [0] in 
many box files

3. Is there any difference or priority in lang setting? 
    For example lang=jpn+eng and lang=eng+jpn , is there any difference?
    The 1st language to be set in lang will be default as top priority ? 

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/c97ae660-3016-4b5a-948d-c66b53a13135%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[tesseract-ocr] About training in Tesseract 4.0

Reply via email to