Hi Nick, I tried passing in the CCITTFaxDecode data to tesseract, but it was not detected as TIFF.
It seems like CCITT fax is not same as TIFF. Google search showed me that few other people also faced same issue (e.g. " http://stackoverflow.com/questions/2641770/extracting-image-from-pdf-with-ccittfaxdecode-filter "). If you know, how we can convert the CCITT-Fax to tiff or jpeg, it would be really helpful. Many thanks for your help and time. Thanks, - ganesh On Thursday, September 6, 2012 4:52:26 PM UTC+8, Nick White wrote: > > Hi Ganesh, > > On Wed, Sep 05, 2012 at 06:42:52PM -0700, newtotesseract wrote: > > Can you please help me know, can tesseract decode ocr from CCITT > facsimile > > standard images? > > > > I checked leptonica documentation and found that lept handles tiff, > jpeg, and > > others but did not see anything specific about CCITT fax. > > From looking online briefly it looks like CCITT is a type of TIFF > file. So there's a good chance that tesseract will be able to read > it directly (using leptonica). Try it and see. If not, the easiest > thing would be to use ImageMagick to convert it to PNG before > running Tesseract. > > Let us know if it works! > > Thanks, > > Nick > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

