Re: [plug] tiff to PDF compression/Conversion with OCR Capability Multilanguage

Orlando Andico Wed, 11 Jun 2008 02:10:51 -0700

I believe tesseract-ocr is also based on the ocr work out of HP labs,
same as gocr. I would take those numbers with a lump of salt.


That will probably be true for text in a single column, no font size
changes or type face changes.

On 6/11/08, eric pareja <[EMAIL PROTECTED]> wrote:
> how does tesseract-ocr fare? [http://code.google.com/tesseract-ocr]
>
> character accuracy is about 98%, word accuracy is 95+%.
>
> On Tue, Jun 10, 2008 at 10:35 PM, Paolo Falcone <[EMAIL PROTECTED]>
> wrote:
>> There's no "poor man's OCR". The current state of GOCR (jocr.sf.net)
>> is just so pitiful at this stage, it's not worth even considering.
>
> --
> `..^..' eric pareja ([EMAIL PROTECTED]) lpic-2 | software freedom for
> all
> |<(e)>| gnu linux python debian edu iosn localization tagalog filipino
> `..v..' foss internationalization usability pusakat philippines free
> "Ang mundo ay aklat, at iisang pahina lamang ang nababasa ng hindi
> naglalakbay."
> _________________________________________________
> Philippine Linux Users' Group (PLUG) Mailing List
> http://lists.linux.org.ph/mailman/listinfo/plug
> Searchable Archives: http://archives.free.net.ph
>

-- 
Sent from Gmail for mobile | mobile.google.com

Orlando Andico
+63.2.976.8659 | +63.920.903.0335
_________________________________________________
Philippine Linux Users' Group (PLUG) Mailing List
http://lists.linux.org.ph/mailman/listinfo/plug
Searchable Archives: http://archives.free.net.ph

Re: [plug] tiff to PDF compression/Conversion with OCR Capability Multilanguage

Reply via email to