It could be that a threshold operation is taking place at a lower brightness than you grey text. Try binarizing the image with a high threshold value befo sending to tesseract (e.g.200) this should make all the text black.
On Saturday, July 28, 2018 at 4:00:16 PM UTC+1, Yogesh Sanchihar wrote: > > If we have a text not black, but light greyish. tesseract does not > recognize it. > > Any solutions to this problem. > > Have attached images of the sample bill. > > Suppose I want to extract Base Fare > > Base Fare - *Rs 500* > > But Since Base Fare is light greyish. Tesseract does not recognize it at > all. > > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/f1c49f5b-27f8-4ed4-8d4d-8f01efe4a58f%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.