Re: can tesseract decode the OCR from CCITT facsimile standard images?

newtotesseract Thu, 06 Sep 2012 21:10:49 -0700

Hi Nick,

I tried passing in the CCITTFaxDecode data to tesseract, but it was not 
detected as TIFF.


It seems like CCITT fax is not same as TIFF.

Google search showed me that few other people also faced same issue (e.g. "
http://stackoverflow.com/questions/2641770/extracting-image-from-pdf-with-ccittfaxdecode-filter
").

If you know, how we can convert the CCITT-Fax to tiff or jpeg, it would be 
really helpful.

Many thanks for your help and time.

Thanks,
- ganesh

On Thursday, September 6, 2012 4:52:26 PM UTC+8, Nick White wrote:
>
> Hi Ganesh, 
>
> On Wed, Sep 05, 2012 at 06:42:52PM -0700, newtotesseract wrote: 
> > Can you please help me know, can tesseract decode ocr from CCITT 
> facsimile 
> > standard images? 
> > 
> > I checked leptonica documentation and found that lept handles tiff, 
> jpeg, and 
> > others but did not see anything specific about CCITT fax. 
>
> From looking online briefly it looks like CCITT is a type of TIFF 
> file. So there's a good chance that tesseract will be able to read 
> it directly (using leptonica). Try it and see. If not, the easiest 
> thing would be to use ImageMagick to convert it to PNG before 
> running Tesseract. 
>
> Let us know if it works! 
>
> Thanks, 
>
> Nick 
>

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Re: can tesseract decode the OCR from CCITT facsimile standard images?

Reply via email to