On Fri, Dec 14, 2012 at 11:28 AM, Zdenko Podobný wrote:
> tesseract-ocr use leptonica for image IO. List of supported input type also
> depends on leptonica configuration e.g. if you did not compile jpeg support
> for leptonica, jpeg will be not supported in tesseract-ocr. So creating list
> of su
I was wondering this myself, as we need the ability to process at least
PDFs. I think the answer is, yes, you do have to render to pixel first.
At least that's the conclusion I've come to. I checked leptonica protos
for a covert *from* PDF, but there doesn't seem to be any. They do have
var
tesseract-ocr use leptonica for image IO. List of supported input type
also depends on leptonica configuration e.g. if you did not compile jpeg
support for leptonica, jpeg will be not supported in tesseract-ocr. So
creating list of supported types would be tricky.
For possible supported type yo
Unfortunately, I could not find a list of supported image input types
anywhere, so I just started to play with what I can produce. I tried SVG,
EPS, PDF, PS, and ODG, but all of them report as unsupported.
So the question remains: which vector type can I use as input? Or do I have
to convert t
4 matches
Mail list logo