I created a script, which takes an image file (currently TIF), OCRs it using 
http://code.google.com/p/tesseract-ocr/ Tesseract  and returns the
recognized text.

The script takes the image as a parameter and passes it to Tesseract OCR on
the filesystem, using the command line; Tesseract OCRs the file and then
returns the resulting text. The script then creates a new text file in
Plone, in the same folder as the image, with the recognized text. I call the
external script from an internal script, which is called from an action.

I assume in order to upload this to plone.org it needs to be a product but I
don't know how to turn it into a product. Honestly, I just wanted to get it
to work so I could OCR documents using Plone and it does now. 

I was wondering if anyone wants to see the script and help turn it into a
product so it is available to the Plone community?

Also, if I don't turn it into a product, is there another way I could post
it so people could use it and perhaps contribute to it?

Thanks,
Nathan
----------------------
http://www.abillionbillion.com ABillionBillion.com 
Document Management for Everyone
-- 
View this message in context: 
http://www.nabble.com/Tesseract-OCR-in-Plone%21-tp14521687s20094p14521687.html
Sent from the Product Developers mailing list archive at Nabble.com.


_______________________________________________
Product-Developers mailing list
[email protected]
http://lists.plone.org/mailman/listinfo/product-developers

Reply via email to