How about processing the images using ScanTailor or some other tool before feeding them to Tesseract?
On Monday, October 30, 2023 at 4:58:56 AM UTC+3 Art Rhyno wrote: > Maybe use a different segmentation mode? Try changing the line: > > > > text = pytesseract.image_to_string(cropped_image, lang='eng').strip() > > > > to: > > > > text = pytesseract.image_to_string(cropped_image, lang='eng', > config='--psm 6').strip() > > > > That should help. > > > > art > > > > *From:* tesser...@googlegroups.com <tesser...@googlegroups.com> *On > Behalf Of *Paulus Present > *Sent:* Sunday, October 29, 2023 4:21 PM > *To:* tesseract-ocr <tesser...@googlegroups.com> > *Subject:* [tesseract-ocr] Poor results of Tesseract performing a play > card evaluation > > > > You don't often get email from present...@gmail.com. Learn why this is > important <https://aka.ms/LearnAboutSenderIdentification> > > Dear forum members > > I used Tesseract to get 10 Regions Of Interest from a Lorcana play card, > but it didn' succeed very well. It did not succeed in figuring out the > numbers nor the name of the character. I presume this is because of the > image preprocessing as the fonts are not really anything special. Could you > help me figuring out how I could bring Tesseract to better perform on the > PNG? I add 1 sample card and the py code used to deploy Tesseract as well > as the resulting Excel table and the extraced Region Of Interest TIFFs. > > I will be happy with any help anyone can provide. Thanks in advance! > > Paulus > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-oc...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/9c2e162e-dce2-4a81-8138-5268b4e16423n%40googlegroups.com > > <https://groups.google.com/d/msgid/tesseract-ocr/9c2e162e-dce2-4a81-8138-5268b4e16423n%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/510329a9-3b15-46c8-87a9-57517aa2224en%40googlegroups.com.