I believe tesseract-ocr is also based on the ocr work out of HP labs,
same as gocr. I would take those numbers with a lump of salt.
That will probably be true for text in a single column, no font size
changes or type face changes.
On 6/11/08, eric pareja <[EMAIL PROTECTED]> wrote:
> how does tess
how does tesseract-ocr fare? [http://code.google.com/tesseract-ocr]
character accuracy is about 98%, word accuracy is 95+%.
On Tue, Jun 10, 2008 at 10:35 PM, Paolo Falcone <[EMAIL PROTECTED]> wrote:
> There's no "poor man's OCR". The current state of GOCR (jocr.sf.net)
> is just so pitiful at thi
Agree. I evaluated gocr and some commercial packages some time ago.
Gocr is unusable. It probably gets 70% correct (i used an fhm
philippines page as my sample data, took a picture of the page with a
dslr and 50mm lens). Abbyy finereader did kind of ok, considering that
the fhm page had a complex m
is your server not a multi-proc/core? does your app behave the same having
those extra proc/core?
On Tue, Jun 10, 2008 at 3:51 AM, Jagi Sarcilla <[EMAIL PROTECTED]> wrote:
> [...]
> It killing my server. 99.99% CPU/MEM Utilization for 150,000 Documents(not
> single page) everyday.
> [...]
>
___
There's no "poor man's OCR". The current state of GOCR (jocr.sf.net)
is just so pitiful at this stage, it's not worth even considering.
On Tue, Jun 10, 2008 at 9:10 PM, Holden Hao <[EMAIL PROTECTED]> wrote:
> On Tue, Jun 10, 2008 at 6:51 PM, Jagi Sarcilla <[EMAIL PROTECTED]> wrote:
>>
>> Hi Plugge
On Tue, Jun 10, 2008 at 6:51 PM, Jagi Sarcilla <[EMAIL PROTECTED]> wrote:
> Hi Pluggers,
>
> I'm looking for good TIFF to PDF compress/conversion tools with OCR and
> Multi language (FREE-OPENSOURCE or COMMERCIAL and it will run on
> RHEL5.1/CENTOS5.1)
> I want to do exactly what the CVista PDF Co
Hi Pluggers,
I'm looking for good TIFF to PDF compress/conversion tools with OCR and
Multi language (FREE-OPENSOURCE or COMMERCIAL and it will run on
RHEL5.1/CENTOS5.1)
I want to do exactly what the CVista PDF Compression Tools on Windows.
things why we are switch:
It killing my server. 99.99% C
7 matches
Mail list logo