On Tuesday, 2 October 2012 22:33:35 UTC+13, Nick White wrote: > > Hi Donaldo, > > It's great to hear how you're getting on. Thanks for sharing in so > much detail! > > I'll reply / comment below. > > On Mon, Oct 01, 2012 at 04:04:36PM -0700, Donaldo wrote: > > I ran tesseract to train it up on a few fonts. The txt files produced > were full > > of blank characters. It seems to be important to separate the tokens in > each > > file name with a hyphen. > > You mean with lazytrain? Can you explain further, I'm not following. > Yes, I used lazytrain. See my previous message for the commands I used. Re the tokens, I used file names like *epo.freeserif-bold-italic.exp0.tif *rather than (without hyphens)* ** epo.freeserifbolditalic.exp0.tif* Is that necessary?
Donaldo -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

