You can use the command combine_tessdata <http://tesseract-ocr.googlecode.com/svn-history/trunk/doc/combine_tessdata.1.html> to unpack a traineddata file to examine its components.
The eng.traineddata bundled with Tess4J is of 3.01 version. You may want to try 3.02 and see if it can produce better results for you (check in https://code.google.com/p/tesseract-ocr/downloads/list). On Monday, January 12, 2015 at 10:18:18 AM UTC-6, newbie wrote: > > Does anyone know that if tessdata/eng.traineddata(the final crunched > data) in tess4j comes with all the below files included ? > > > - tessdata/eng.config > - tessdata/eng.unicharset > - tessdata/eng.unicharambigs > - tessdata/eng.inttemp > - tessdata/eng.pffmtable > - tessdata/eng.normproto > - tessdata/eng.punc-dawg > - tessdata/eng.word-dawg > - tessdata/eng.number-dawg > - tessdata/eng.freq-dawg > > Also is this enough to identify any of the normal fonts(images attached) ? > Appreciate your help. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/991f0517-29d9-440b-97e4-8e2616c30033%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.