Hi, probably you have done it already, but in any case.. in line 40, try it:
ocrData = pytesseract.image_to_data(thresh, output_type=Output.DICT, config='--tessdata-dir /new/folder/address/Share/ --oem 0 -c textord_equation_detect=1', lang='equ') Please create one folder with the files "equ.traineddata" and "eng.traineddata" from this link https://github.com/tesseract-ocr/tessdata . You might need the folder configs too, but try without it first .. in lang try lang='eng+equ' too I got this idea from here: https://github.com/tesseract-ocr/tesseract/issues/2204 Please lemme know whether your results improved or not. Kind regards, Em quarta-feira, 27 de maio de 2020 23:01:48 UTC+2, Владимир Калачихин escreveu: > > This is not a production code, just sketch. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/4302b5eb-bb86-4594-a317-c50abeb72847%40googlegroups.com.