Post-processing steps is a very excellent idea. -srirnaga(77yrsold) On Wed, May 26, 2010 at 8:39 AM, nguyenq <[email protected]> wrote:
> You can perform some text manipulations in post-processing steps to > strip out diacritical marks to leave only the base ASCII characters > behind. > > On May 25, 3:34 pm, haratron <[email protected]> wrote: > > http://www.linux.com/archive/feed/57222 > > "Also, it can generate output only in the US-ASCII character set, so > > glyphs with accent marks or other unsupported attributes will probably > > be reproduced incorrectly." > > > > Which is the option to make it limit output to the ASCII charset only? > > Some letters such as "a" are outputted as glyph symbols. > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]<tesseract-ocr%[email protected]> > . > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en. > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

