How to extract the images of each word from the whole image page?

longyi Mon, 21 Feb 2011 15:00:25 -0800

Hey everyone,
I am trying to use tesseract to extract images of each word from a scanned 
images (say, a Chinese article).
I have looked into the codes for several days, but still unable to find a 
way to do that.
It seems like the this engine are trying to try different adjacent connected 
components combination to recognize a single word.
Anyone have suggestion on that? really need your help, thanks!
Seems like the previous post are failed. So I am trying again


-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

How to extract the images of each word from the whole image page?

Reply via email to