[tesseract-ocr] Need OCR SW designed to extract transactions from bank statements to xfer into a General Ledger like QuickBooks or a spreadsheet..

2016-07-19 Thread Dave Burleigh
What is best OCR software for scanning bank statements to pull out transaction data so as to load it into a GL like Quickbooks or at least a spread sheet.. So I don't have to do a bunch of manual data entries? email me at nextgencloud...@gmail.com (this would save a lot of typing for

[tesseract-ocr] shapetable size

2016-07-19 Thread Zhenia Krivopaltsev
Folks, I created few personal tesseract data sets using different fonts or font combinations. In all cases I observe size of "eng.shapetable" to be 1912, in a standard tesseract data set the size is about 63k. Any idea why there is a such difference and what are the factors to make this file size

[tesseract-ocr] How to make things easy for OCR

2016-07-19 Thread Zing GabardineZwei
Hi. I would like to store binary data such as GPG keys on paper as well as digitally. The paper copy is the fallback for the case that the digital copy becomes unreadable. I don't want to use QR codes as they're clumsy to process. So, it's printed letters and digits with the additional

Re: [tesseract-ocr] Tesseract-ocr duration time calculation

2016-07-19 Thread Zdenko Podobný
Have a look at Text Fairy app[1] video 0:22. Are you looking for something like that? If yes, source code is available at github[2]. rmtheis wrote short blog post about it too[3]... [1] https://play.google.com/store/apps/details?id=com.renard.ocr [2] https://github.com/renard314/textfairy [3]

Re: [tesseract-ocr] Can any feature of tesseract auto detect language (or majority language) of the image?

2016-07-19 Thread Zdenko Podobný
No. Tesseract needs for correct OCR result specification of language of input image Zdenko On Tue, Jul 19, 2016 at 8:47 AM, Ashish Goel wrote: > I have 100s of images in different languages that I need to OCR. > Presently, I need to know in advance the language of the

[tesseract-ocr] Re: Text output vs. PDF

2016-07-19 Thread H . Mijail Antón Quiles
I just spent a couple of hours debugging a workflow, because the finally generated PDF seemed to have been OCR'd but with every character being a space. Turns out that the problem was not in the workflow, but me using Preview.app, as explained in this thread. Acrobat Reader does extract the

[tesseract-ocr] Adapive thresholding

2016-07-19 Thread Mitesh Kalal
I just want to know how to do use adaptive.cpp in tesseract to get adaptive thresholded image? I mean how to give input image and get output image. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop

[tesseract-ocr] Re: Can any feature of tesseract auto detect language (or majority language) of the image?

2016-07-19 Thread Ashish Goel
Thanks for the reply, but I am looking for a solution which I can integrate into my custom application. I have no idea, if I can make use of google drive application for this purpose. On Tuesday, July 19, 2016 at 12:17:20 PM UTC+5:30, Ashish Goel wrote: > > I have 100s of images in different

Re: [tesseract-ocr] Can any feature of tesseract auto detect language (or majority language) of the image?

2016-07-19 Thread ShreeDevi Kumar
Try ocr in google drive, it auto detects the languages. - sent from my phone. excuse the brevity. On 19-Jul-2016 12:17 PM, "Ashish Goel" wrote: > I have 100s of images in different languages that I need to OCR. > Presently, I need to know in advance the language of the

[tesseract-ocr] Re: Error when compiling tesscallback.h

2016-07-19 Thread Jayanth Culli
same problem here... -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to

[tesseract-ocr] Can any feature of tesseract auto detect language (or majority language) of the image?

2016-07-19 Thread Ashish Goel
I have 100s of images in different languages that I need to OCR. Presently, I need to know in advance the language of the image and pass the language paramater (for ex. -l deu or -l dan). Is their a way where I can get to somehow figure out language of the image auto magically? It is weird but