Re: [tesseract-ocr] Re: lstmeval shows good result but visualized result looks bad

2019-06-17 Thread Shree Devi Kumar
I don't think you need training to improve results. You need to pre-process the image, straighten it. Use a separate tool to identify each cell of data and then OCR that. You will get best results like that. On Mon, Jun 17, 2019 at 6:07 PM phucp...@gmail.com wrote: > Thanks shree for your reply

Re: [tesseract-ocr] Re: lstmeval shows good result but visualized result looks bad

2019-06-17 Thread phucp...@gmail.com
Thanks shree for your reply. I see that you are very busy to answer a lot of questions here. Thanks again for taking some time for me > > Your files have prefix of jpn, so I assume you are training for Japanese, > but the image in question has only numbers in it. > Well I forgot to mention, my mode

[tesseract-ocr] Re: lstmeval shows good result but visualized result looks bad

2019-06-17 Thread shree
Your files have prefix of jpn, so I assume you are training for Japanese, but the image in question has only numbers in it. Getting good results on eval data but bad results on OCR could be the result of overfitting the model, if you have used a small sample and trained for large number of iter