[tesseract-ocr] Error using tesstrain with START_MODEL - failed to continue

2023-10-19 Thread Keith Smith
Hi, Could someone help me understand why I am getting the following error when using tesstrain with the START_MODEL option? Failed to continue from: data/micr_ref/micr.lstm >From my local tesstrain repo (cloned from https://github.com/tesseract-ocr/tesstrain), I have the following in

[tesseract-ocr] tesstrain help needed - failed to continue

2023-10-19 Thread Keith Smith
Hi, thanks in advance for your help. I am trying to use tesstrain to train tesseract to read the MICR line of checks, but am getting a "failed to continue" error as described below. Perhaps I am misunderstanding how to use tesstrain. Here is my data directory in my tesstrain directory: data

[tesseract-ocr] Nearly 99% accuracy

2023-10-19 Thread Des Bw
I am getting nearly 99% accuracy by training from the top layer of the network. I am training using synthetic data; and the evaluation is done the same type of data. But, the result is not extending to actually scanned documents. On the scanned documents, I am getting lower accuracy,

Re: [tesseract-ocr] accuracy problem after trained in fine-tune

2023-10-19 Thread Des Bw
Hi Ali, How is your training going? Do you get good results with the training-from-the-scratch? On Friday, September 15, 2023 at 6:42:26 PM UTC+3 tesseract-ocr wrote: > yes, two months ago when I started to learn OCR I saw that. it was very > helpful at the beginning. > On Friday, 15

Re: [tesseract-ocr] Should box include surrounding space?

2023-10-19 Thread 'Danny Wilson' via tesseract-ocr
Sorry, I had the coordinate system flipped on my last post. Here is a correct image produced by text2image and includes both FULLWIDTH COMMA and COMMA.  For both types of comma, the boxes produced by text2image include only the boundaries of the glyph itself and does not consider the vertical