Hi, 

probably you have done it already, but in any case.. in line 40, try it:

ocrData = pytesseract.image_to_data(thresh, output_type=Output.DICT, 
config='--tessdata-dir /new/folder/address/Share/ --oem 0 -c 
textord_equation_detect=1', lang='equ')


Please create one folder with the files "equ.traineddata" and 
"eng.traineddata" from this link https://github.com/tesseract-ocr/tessdata 
. 
You might need the folder configs too, but try without it first ..

in lang try lang='eng+equ' too

I got this idea from 
here: https://github.com/tesseract-ocr/tesseract/issues/2204

Please lemme know whether your results improved or not.

Kind regards, 

Em quarta-feira, 27 de maio de 2020 23:01:48 UTC+2, Владимир Калачихин 
escreveu:
>
> This is not a production code, just sketch.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/4302b5eb-bb86-4594-a317-c50abeb72847%40googlegroups.com.

Reply via email to