[tesseract-ocr] Bad results on custom traditional chinese

laurent . lemoine2 Tue, 17 May 2016 08:21:58 -0700

Hi,

Here's the point: I have to train tesseract on a new font in traditional 
chinese. For now, all the results were not good enough.
I've just tried to train it with only a small set of characters and 1 input 
image.
Then I took a sample of that image to test it.


The image is:


And the detected text is: 客戶服務 置龍擇語言 設交置 社交
I'm using tesseract 3.02 on Windows.

The questions are:
 - What kind of machine learning concept tesseract use ?
 - How can I have better results with tesseract ?
    - Do I have to train it with a lot of different images ?
    - Do I have some parameters to play with on the training part ?

Thanks.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/5ca02ec4-14ef-4a39-8a01-252142894cf2%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[tesseract-ocr] Bad results on custom traditional chinese

Reply via email to