You can use the command combine_tessdata 
<http://tesseract-ocr.googlecode.com/svn-history/trunk/doc/combine_tessdata.1.html>
 
to unpack a traineddata file to examine its components.

The eng.traineddata bundled with Tess4J is of 3.01 version. You may want to 
try 3.02 and see if it can produce better results for you (check in 
https://code.google.com/p/tesseract-ocr/downloads/list).

On Monday, January 12, 2015 at 10:18:18 AM UTC-6, newbie wrote:
>
> Does anyone know that if  tessdata/eng.traineddata(the final crunched 
> data) in tess4j comes with all the below files included ?
>
>
>    - tessdata/eng.config
>    - tessdata/eng.unicharset
>    - tessdata/eng.unicharambigs
>    - tessdata/eng.inttemp
>    - tessdata/eng.pffmtable
>    - tessdata/eng.normproto
>    - tessdata/eng.punc-dawg
>    - tessdata/eng.word-dawg
>    - tessdata/eng.number-dawg
>    - tessdata/eng.freq-dawg
>
> Also is this enough to identify any of the normal fonts(images attached) ? 
> Appreciate your help.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/991f0517-29d9-440b-97e4-8e2616c30033%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to