Hi! I train Tess using separate images for every text line. Recognition is also ran over single text line images. Recognition performs pretty well, however there are many errors that, I believe, related to misdetected baselines, during training or recognition - I don't know. These include:
" (double quote) detected as n S detected as s (and vice versa) V detected as v (and vice versa) etc. Is there any (preferably high-level) way to provide Tess with baseline info? Or at least obtain baseline info from Tess in order to visualize it further for debugging? Thanks, Dmitry -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com. To unsubscribe from this group, send email to tesseract-ocr+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.