Using different interpolation methods of magnification gave me different results, but I was not able to get the "/" character out of the string. Magnifying the image by 200% using a Box, Triangle, or Catmull-Rom interpolation algorithm gave me "NIA". Using Mitchell, I got "NVA". The Cubic B-Spline was too fuzzy for Tesseract to recognize any of the characters.
Does anyone have any further ideas? I wish there was a way to tell Tesseract to ignore font embellishments, such as italics or underlining. On Tuesday, October 8, 2024 at 10:16:17 AM UTC-5 [email protected] wrote: > Hi > Did you try this trick ?? > > On Tue, 8 Oct 2024, 20:42 Art Rhyno, <[email protected]> wrote: > >> You could try resizing the image, with imagemagick, something like: >> >> >> >> convert test.bmp -resize 200% test.png >> >> >> >> That seems to be enough to separate out the “N” and the “/”. >> >> >> >> art >> >> >> >> *From:* [email protected] <[email protected]> *On >> Behalf Of *Will Fetherolf >> *Sent:* Monday, October 7, 2024 9:33 PM >> *To:* tesseract-ocr <[email protected]> >> *Subject:* [tesseract-ocr] Help with recognition please >> >> >> >> You don't often get email from [email protected]. Learn why this is >> important <https://aka.ms/LearnAboutSenderIdentification> >> >> The application I'm attempting to OCR is using what I think is Arial for >> the font, but every time I run the attached image through Tesseract 5.4.0 >> on Windows I get "NVA" or "NIA" depending on which PSM I use. If I use 7, >> I always get back "NIA". I have tried running training on a variety of >> captured data from my application with no success. >> >> >> >> Help me, Obi-Wan Kenobi, you're my only hope! >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/01ab548e-e45e-48b7-824d-73debed1adb1n%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/01ab548e-e45e-48b7-824d-73debed1adb1n%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> > To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/YQBPR0101MB85429847B45CE3732F0ECE5FDC7E2%40YQBPR0101MB8542.CANPRD01.PROD.OUTLOOK.COM >> >> <https://groups.google.com/d/msgid/tesseract-ocr/YQBPR0101MB85429847B45CE3732F0ECE5FDC7E2%40YQBPR0101MB8542.CANPRD01.PROD.OUTLOOK.COM?utm_medium=email&utm_source=footer> >> . >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/dadea12d-5d9f-4e9d-a6e0-72582e239eb8n%40googlegroups.com.

