Re: hocr2pdf and arabic language

Jeff Breidenbach Thu, 06 Feb 2014 10:39:20 -0800

I've merged Nick White's bugfix into hocr-tools. Thank you, Nick.
I expect most people will instead use the native PDF support 
built into Tesseract henceforth, and I intend to focus most of my
time and energy there.


However, there is still some use for hocr-pdf, especially when 
working with slow digitization equipment like a Linear Book 
Scanner. Generating a separate HOCR files per image (then 
assembling them into a PDF at the end) means you don't have 
to wait for scanning to complete before beginning OCR. Leading 
to faster overall results.

Cheers,
Jeff

http://www.youtube.com/watch?v=4JuoOaL11bw

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to tesseract-ocr@googlegroups.com
To unsubscribe from this group, send email to
tesseract-ocr+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Re: hocr2pdf and arabic language

Reply via email to