Re: [tesseract-ocr] URGENT DEADLINE: NEED HELP WITH NEW LANGUAGE, PLEASE RESPOND

2020-10-31 Thread Shree Devi Kumar
>When we use tesseract on the images without the trained language we receive outputs that are accurate about 50% of the time. You haven't shared a sample image. Sometimes preprocessing the images, using a whitelist in case of limited character set can be the solution rather than training. On Sun,

Re: [tesseract-ocr] URGENT DEADLINE: NEED HELP WITH NEW LANGUAGE, PLEASE RESPOND

2020-10-31 Thread Shree Devi Kumar
Are you trying to train for the legacy tesseract engine? On Sun, Nov 1, 2020, 03:29 Cailey McVay wrote: > Hello! > I am working on a project that is trying to read borehole video depths. We > trained a new language to read these numbers called NTS. When we use > tesseract on the images without t

[tesseract-ocr] URGENT DEADLINE: NEED HELP WITH NEW LANGUAGE, PLEASE RESPOND

2020-10-31 Thread Cailey McVay
Hello! I am working on a project that is trying to read borehole video depths. We trained a new language to read these numbers called NTS. When we use tesseract on the images without the trained language we receive outputs that are accurate about 50% of the time. However when we use the new lan

[tesseract-ocr] tesseract osd retraining and script vs language text extraction

2020-10-31 Thread Omesharma
#Hey - ##i am Using Tesseract OCR for the text extraction form the image : - ##I need your valuable suggestion for the below mentioned points. - - How can i Retrain osd.traindata file for adding Ethiopic and