[tesseract-ocr] Re: increase the quality of image so that it extracts proper text from it.

2018-08-07 Thread May
Could you share the image that you used to process? On Tuesday, July 31, 2018 at 11:33:41 PM UTC-7, Mahima Goyal wrote: > > I want to increase the quality of the image so that proper text is > extracted. Right now I am using tesseract but I am not able to extract few > things in the image > > In

[tesseract-ocr] Re: error while converting pdf file to tiff using command

2018-08-07 Thread May
Try checking this out: https://github.com/ImageMagick/ImageMagick/issues/396 On Monday, August 6, 2018 at 12:37:33 AM UTC-7, thiyam...@gmail.com wrote: > > hello everyone, for testing tesseract i convert the pdf file to tiff file > and after 10 files(each contains 7000-8000 characters), there is

Re: [tesseract-ocr] Re: OCR-d failed at Unicharset line -Help!

2018-08-07 Thread May
t; >> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00 >> >> On Tue, Aug 7, 2018 at 12:39 PM May > >> wrote: >> >>> Oh the training started by itself after a long while and still >>> processing. Does it normally take that

Re: [tesseract-ocr] Re: OCR-d failed at Unicharset line -Help!

2018-08-07 Thread May
11:42:40 PM UTC-7, May wrote: > > Thanks a lot Shree. I tried the tesseract 4.0 and the training is working > well until it reaches the lstm-training step and got stuck there. I am > totally new in the training so hope you don't mind if I am asking silly > questions. Do you

Re: [tesseract-ocr] Re: OCR-d failed at Unicharset line -Help!

2018-08-06 Thread May
Ocr-d scripts are geared towards tesseract 4.0.x. you are trying to use it > with tesseract 3.05. > > On Tue 7 Aug, 2018, 10:50 AM May, > > wrote: > >> Hey Shree >> >> I also tried with the orignal script from the github. But faced the same >> issue with th

Re: [tesseract-ocr] Re: OCR-d failed at Unicharset line -Help!

2018-08-06 Thread May
D/ocrd-train > > On Fri, Aug 3, 2018 at 4:41 AM May > > wrote: > >> >> <https://lh3.googleusercontent.com/-LnwUni4-lLw/W2OPUqJpn_I/ANs/Xd_-CVCdiMk0cjMmxBpVgfOSU1JeAacAgCLcBGAs/s1600/Capture.PNG> >> >> >> >> <https://lh3.googl

[tesseract-ocr] Re: OCR-d failed at Unicharset line -Help!

2018-08-02 Thread May
ached photos On Thursday, August 2, 2018 at 4:08:11 PM UTC-7, May wrote: > > Hey all, > > I am following Shree's script for OCR-d in the google groups for > ocrd-training ( > https://groups.google.com/forum/#!topic/tesseract-ocr/be4-rjvY2tQ). I > managed to pass the combi

[tesseract-ocr] OCR-d failed at Unicharset line -Help!

2018-08-02 Thread May
path: I do find a unicharset file named "unicharset" but not as "my.unicharset". Changing the script by removing "my." also did not solve the problem. Do you know what's causing the issue? Best May -- You received this message because you are subscribed to

[tesseract-ocr] Extracting some text and numbers from pdf

2018-06-29 Thread May
an extract correctly and not with some random error. Does training data set work? If so how can I do it? Looking forward to your answer. Best May -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from