Re: [plug] tiff to PDF compression/Conversion with OCR Capability Multilanguage

2008-06-11 Thread Orlando Andico
I believe tesseract-ocr is also based on the ocr work out of HP labs, same as gocr. I would take those numbers with a lump of salt. That will probably be true for text in a single column, no font size changes or type face changes. On 6/11/08, eric pareja <[EMAIL PROTECTED]> wrote: > how does tess

Re: [plug] tiff to PDF compression/Conversion with OCR Capability Multilanguage

2008-06-11 Thread eric pareja
how does tesseract-ocr fare? [http://code.google.com/tesseract-ocr] character accuracy is about 98%, word accuracy is 95+%. On Tue, Jun 10, 2008 at 10:35 PM, Paolo Falcone <[EMAIL PROTECTED]> wrote: > There's no "poor man's OCR". The current state of GOCR (jocr.sf.net) > is just so pitiful at thi

Re: [plug] tiff to PDF compression/Conversion with OCR Capability Multilanguage

2008-06-10 Thread Orlando Andico
Agree. I evaluated gocr and some commercial packages some time ago. Gocr is unusable. It probably gets 70% correct (i used an fhm philippines page as my sample data, took a picture of the page with a dslr and 50mm lens). Abbyy finereader did kind of ok, considering that the fhm page had a complex m

Re: [plug] tiff to PDF compression/Conversion with OCR Capability Multilanguage

2008-06-10 Thread Ariz Jacinto
is your server not a multi-proc/core? does your app behave the same having those extra proc/core? On Tue, Jun 10, 2008 at 3:51 AM, Jagi Sarcilla <[EMAIL PROTECTED]> wrote: > [...] > It killing my server. 99.99% CPU/MEM Utilization for 150,000 Documents(not > single page) everyday. > [...] > ___

Re: [plug] tiff to PDF compression/Conversion with OCR Capability Multilanguage

2008-06-10 Thread Paolo Falcone
There's no "poor man's OCR". The current state of GOCR (jocr.sf.net) is just so pitiful at this stage, it's not worth even considering. On Tue, Jun 10, 2008 at 9:10 PM, Holden Hao <[EMAIL PROTECTED]> wrote: > On Tue, Jun 10, 2008 at 6:51 PM, Jagi Sarcilla <[EMAIL PROTECTED]> wrote: >> >> Hi Plugge

Re: [plug] tiff to PDF compression/Conversion with OCR Capability Multilanguage

2008-06-10 Thread Holden Hao
On Tue, Jun 10, 2008 at 6:51 PM, Jagi Sarcilla <[EMAIL PROTECTED]> wrote: > Hi Pluggers, > > I'm looking for good TIFF to PDF compress/conversion tools with OCR and > Multi language (FREE-OPENSOURCE or COMMERCIAL and it will run on > RHEL5.1/CENTOS5.1) > I want to do exactly what the CVista PDF Co

[plug] tiff to PDF compression/Conversion with OCR Capability Multilanguage

2008-06-10 Thread Jagi Sarcilla
Hi Pluggers, I'm looking for good TIFF to PDF compress/conversion tools with OCR and Multi language (FREE-OPENSOURCE or COMMERCIAL and it will run on RHEL5.1/CENTOS5.1) I want to do exactly what the CVista PDF Compression Tools on Windows. things why we are switch: It killing my server. 99.99% C