Hey everyone, I am trying to use tesseract to extract images of each word from a scanned images (say, a Chinese article). I have looked into the codes for several days, but still unable to find a way to do that. It seems like the this engine are trying to try different adjacent connected components combination to recognize a single word. Anyone have suggestion on that? really need your help, thanks! Seems like the previous post are failed. So I am trying again
-- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

