Thank you very much Zdenko. The traineddata resources were very helpful and i would try to train if required.
On Friday, June 28, 2024 at 5:28:50 PM UTC+2 zdenop wrote: > As far as I remember, the traineddata are from > https://github.com/arturaugusto/display_ocr/blob/master/letsgodigital/letsgodigital.traineddata > Also, check https://github.com/Shreeshrii/tessdata_ssd for Seven Segment > Display recognition. > > Zdenko > > > pi 28. 6. 2024 o 17:07 'uday kaipa' via tesseract-ocr < > [email protected]> napísal(a): > >> Hi Zdenko, >> >> >> Thanks for your recommendation about image format and letsgodigital >> trainidata. Yes, you are right. I got the digits from a segment display. >> I would try the training process before that i wanted to try other >> options. >> >> I suppose you have used the lets.traindata >> <https://github.com/adrianlazaro8/Tesseract_sevenSegmentsLetsGoDigital/blob/master/Trained%20data/lets.traineddata> >> after >> renaming, when i tried the same command with same psm, on the PNG image, I >> got .4 instead. >> By the way, Did you apply any processing on the image?, the edges look >> slightly different. >> >> tesseract 14.png out -l lets --oem 0 --psm 7 >> .4 >> >> Thanks for your time. >> >> On Friday, June 28, 2024 at 3:31:15 PM UTC+2 zdenop wrote: >> >>> First of all, using jpg as a format for image processing and OCR is not >>> very smart. >>> >>> Next: it does not seem like a very standard font... maybe you will need >>> to train tesseract for it. >>> For me, it looks like a heavy preprocessed 7-segment font... so I tried >>> this: >>> >>> tesseract 14.png - --psm 7 --oem 0 -l letsgodigital >>> 14 >>> >>> Zdenko >>> >>> >>> pi 28. 6. 2024 o 14:09 'uday kaipa' via tesseract-ocr < >>> [email protected]> napísal(a): >>> >>>> I have resized the image so that text height would be around 30pxs and >>>> i have tried with 10px boarder as recommended in some threads here. >>>> I converted image to binary, and tried all PSM modes. >>>> I am not sure why it is not OCR'ed properly. >>>> >>>> Any help is appreciated. :) >>>> >>>> >>>> >>>> >>>> >>>> On Thursday, June 27, 2024 at 6:24:36 PM UTC+2 uday kaipa wrote: >>>> >>>>> Hi, >>>>> >>>>> I have an image having number 96 in it.(that might contains a number >>>>> between 0 and 100.) PFA. >>>>> I have used tesseract PSM from 6 to 13 and image size and font and >>>>> everything looks good to me. Text is recognized as 36. >>>>> When i try to adjust padding or other pre-processing, it would work >>>>> for this image and some images are recognized incorrectly. >>>>> >>>>> Can anyone recommend any other pre-processing that might improve the >>>>> recognition. >>>>> >>>>> *t**esseract --oem 1 --psm 7 -c tessedit_char_whitelist=0123456789.: >>>>> C:/Users/xxx/Desktop/test_folder/IMG_2303_2cfac/subboxes/Image_BHU32_1_PREPROCESSED_27-06-2024_17h39m53s.JPG >>>>> >>>>> new hocr* >>>>> >>>>> >>>>> *Many thanks in advance.* >>>>> >>>>> >>>>> *Regards* >>>>> *Uday* >>>>> >>>>> >>>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to [email protected]. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/tesseract-ocr/d59827e4-6973-45af-92c0-e2aebbd7f2e7n%40googlegroups.com >>>> >>>> <https://groups.google.com/d/msgid/tesseract-ocr/d59827e4-6973-45af-92c0-e2aebbd7f2e7n%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> >>> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> > To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/09a5c5e1-2cc7-49c2-9833-e2dc5c770203n%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/09a5c5e1-2cc7-49c2-9833-e2dc5c770203n%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/a87756eb-fe10-4497-8570-c8cb93c5e649n%40googlegroups.com.

