I think I find the way for tesseract 3.00 after testing and looking to source code... I will try to describe it this week on http://www.sk-spell.sk.cx/tesseract-ocr-en...
Zd. On 17. Apr, 10:01 h., zdpo <[email protected]> wrote: > Hello, > > Can somebody suggest me what to do, let tesseract recognize font name > during training? > > When I run 'tesseract arial.tif junk nobatch box.train.stderr' > > I got this message: > > Tesseract Open Source OCR Engine > APPLY_BOXES: > Boxes read from boxfile: 231 > Initially labelled blobs: 231 in 7 rows > Box failures detected: 0 > Duped blobs for rebalance: 0 > "l" has fewest samples: 1 > Total unlabelled words: 0 > Final labelled words: 231 > Generating training data > TRAINING ... Font name = UnknownFont. > Generated training data for 231 blobs > > I would like to let tesseract use correct font name during process > (e.g. arial) and not "UnknownFont". > > Br, > > Zd. > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]. > For more options, visit this group > athttp://groups.google.com/group/tesseract-ocr?hl=en. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

