https://github.com/tesseract-ocr/tessdoc/blob/master/TrainingTesseract-4.00.md#fine-tuning-for-impact


https://github.com/Shreeshrii/tess4training/blob/master/1-makedata.sh

https://github.com/Shreeshrii/tess4training/blob/master/4-impact_from_full.sh



On Fri, Apr 3, 2020, 11:57 Suppressed <[email protected]> wrote:

> You got any guides or threads that could help me in the process? Im kinda
> lost, not gonna lie.
>
> 2020. április 3., péntek 4:54:11 UTC+3 időpontban shree a következőt írta:
>>
>> try finetune for impact using your font.
>>
>> On Thu, Apr 2, 2020 at 11:51 PM Suppressed <[email protected]> wrote:
>>
>>> Im working on a project in which I need to read digit values from an
>>> image, then do tasks based on the values that get extracted.
>>> Because of this, mistakes arent really acceptable. I attached the
>>> picture as an example of what the images look like.
>>> The digits barely change, they dont change positioning or angle, only
>>> some have more or less pixels each time but it isnt much.
>>>
>>> 23999
>>> 29999
>>> 30999
>>> 40000
>>> 40000
>>> 40000
>>> 40000
>>> 1
>>> 43000
>>> 44000
>>>
>>> 44000
>>>
>>> 44500
>>>
>>> This is what tesseract extracts from the image. As you can see its
>>> mostly fine but instead for 4111 it extracts 1. Now, this can vary if I
>>> change the languages or change some thresholding values, but that might
>>> work for this case, but it wont work for the other ones.
>>> I guess only training would be a possibility to fix errors, but I
>>> couldnt really do it. The positions or angles of the data doesnt change,
>>> its just the font I Would need to train, but I dont know how to generate a
>>> lot of training information.
>>>
>>> code:
>>> img = cv2.imread(xy.png',cv2.IMREAD_GRAYSCALE)
>>> ret,thresh1 = cv2.threshold(img,150,255,cv2.THRESH_BINARY_INV)
>>> ROI1 = thresh1[130:1050,1280:1420]
>>> text = pytesseract.image_to_string(ROI1,config="digits")
>>>
>>> I imagegrab the screen and select ROI.
>>>
>>> Any suggestion? Maybe theres some training data that with some digits in
>>> it that I could change to my font?
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to [email protected].
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/a0fd3ccf-f681-4c34-8113-7d15f3a44101%40googlegroups.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/a0fd3ccf-f681-4c34-8113-7d15f3a44101%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>>
>>
>>
>> --
>>
>> ____________________________________________________________
>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/c30448a9-7027-4288-8945-f3a59342b1ea%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/c30448a9-7027-4288-8945-f3a59342b1ea%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduU%2BSP%3DQHZGrrBpzkDb140tLSBCnikNfOFM_sa20w4SaLA%40mail.gmail.com.

Reply via email to