Re: [tesseract-ocr] Re: OCR-d failed at Unicharset line -Help!

2018-08-06 Thread May
Thanks a lot Shree. I tried the tesseract 4.0 and the training is working well until it reaches the lstm-training step and got stuck there. I am totally new in the training so hope you don't mind if I am asking silly questions. Do you know why I got stuck? Also, would you call this training fin

Re: [tesseract-ocr] Re: OCR-d failed at Unicharset line -Help!

2018-08-06 Thread Shree Devi Kumar
Ocr-d scripts are geared towards tesseract 4.0.x. you are trying to use it with tesseract 3.05. On Tue 7 Aug, 2018, 10:50 AM May, wrote: > Hey Shree > > I also tried with the orignal script from the github. But faced the same > issue with the process stuck at unicharset_output. > > >

Re: [tesseract-ocr] Re: OCR-d failed at Unicharset line -Help!

2018-08-06 Thread May
Hey Shree I also tried with the orignal script from the github. But faced the same issue with the process stuck at unicharset_output. These are the versions: tess

Re: [tesseract-ocr] tesseract-4.0.0-beta.3 - testing problem

2018-08-06 Thread Shree Devi Kumar
One of the tests is for developers to verify that all traineddata files are valid and load ok, so it needs the complete repo for tessdata_fast and tessdata_best. The tests have not been setup for users. On Mon 6 Aug, 2018, 1:44 PM Marco Atzeri, wrote: > Am 28.07.2018 um 10:08 schrieb Shree D

Re: [tesseract-ocr] tesseract-4.0.0-beta.3 - testing problem

2018-08-06 Thread Marco Atzeri
Am 28.07.2018 um 10:08 schrieb Shree Devi Kumar: Test related info has been moved to a new repo under tesseract-ocr https://github.com/tesseract-ocr/test You need to update that submodule (similar to googletest) for all files to be available. It's possible that the wiki has not been updated

[tesseract-ocr] Re: Easy training?

2018-08-06 Thread Dimitry Khanukaev
I've attached the example of error message. By "Pass/train to Tesseract number of those pairs" I mean do training for Tesseract by giving pairs images like that + the text that should be recognized from the image. -- You received this message because you are subscribed to the Google Groups "te

[tesseract-ocr] error while converting pdf file to tiff using command

2018-08-06 Thread thiyamjennil
hello everyone, for testing tesseract i convert the pdf file to tiff file and after 10 files(each contains 7000-8000 characters), there is this error that says convert-im6.q16: DistributedPixelCache '127.0.0.1' @ error/distribute-cache.c/ConnectPixelCacheServer/244. convert-im6.q16: cache resou

[tesseract-ocr] Easy training?

2018-08-06 Thread Dimitry Khanukaev
Hi is there way to do easy training with following concept: - I know font of program messages that need recognition - I know background - Even amount of messages is limited Could I? : - Just pass to training pairs (the screenshot of the error message + the text on that screenshot). - Pass/train t