I think I find the way for tesseract 3.00 after testing and looking to
source code...
I will try to describe it this week on 
http://www.sk-spell.sk.cx/tesseract-ocr-en...

Zd.

On 17. Apr, 10:01 h., zdpo <[email protected]> wrote:
> Hello,
>
> Can somebody suggest me what to do, let tesseract recognize font name
> during training?
>
> When I run 'tesseract arial.tif junk nobatch box.train.stderr'
>
> I got this message:
>
> Tesseract Open Source OCR Engine
> APPLY_BOXES:
>    Boxes read from boxfile:     231
>    Initially labelled blobs:    231 in 7 rows
>    Box failures detected:                    0
>    Duped blobs for rebalance:     0
>    "l" has fewest samples:     1
>                                 Total unlabelled words:        0
>                                 Final labelled words:        231
> Generating training data
> TRAINING ... Font name = UnknownFont.
> Generated training data for 231 blobs
>
> I would like to let tesseract use correct font name during process
> (e.g. arial) and not "UnknownFont".
>
> Br,
>
> Zd.
>
> --
> You received this message because you are subscribed to the Google Groups 
> "tesseract-ocr" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to 
> [email protected].
> For more options, visit this group 
> athttp://groups.google.com/group/tesseract-ocr?hl=en.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to