[tesseract-ocr] OMP_THREAD_LIMIT not working on Windows 10

2020-09-11 Thread Филип Пешић
Hello, I added the variable OMP_THREAD_LIMIT as new environment variable and set it's value to 1 in both system and user environment and Tesseract still uses 4 threads. The OCR time did not change as well. So does this mean the limit only works on Linux? Windows 10 latest version 2004, build

[tesseract-ocr] Optimal numbers for the ground truth

2020-08-18 Thread Филип Пешић
Hi, I want to train tesseract with tesstrain, with .tif and .gt.txt pairs. However, the native images are 231DPI scans of old books from 1800s and, I assume, that's pretty low, based on what I read on so many forums, plus, there is an huge amount of text on the scanned images, basically 90% of