After some discussion at the Plone documentation list the document was published as a How-To, here it is:
http://plone.org/documentation/how-to/ocr-in-plone-using-tesseract-ocr If anyone can help make this into a product, or improve the scripts that would be very helpful. Thanks, Nathan _____________ ABillionBillion.com Document Management for Everyone duffyd wrote: > > Hi Nathan, > > On Thu, 2007-12-27 at 19:06 -0800, NA wrote: > >> I created a script, which takes an image file (currently TIF), OCRs it >> using >> http://code.google.com/p/tesseract-ocr/ Tesseract and returns the >> recognized text. >> >> The script takes the image as a parameter and passes it to Tesseract OCR >> on >> the filesystem, using the command line; Tesseract OCRs the file and then >> returns the resulting text. The script then creates a new text file in >> Plone, in the same folder as the image, with the recognized text. I call >> the >> external script from an internal script, which is called from an action. >> >> I assume in order to upload this to plone.org it needs to be a product >> but I >> don't know how to turn it into a product. Honestly, I just wanted to get >> it >> to work so I could OCR documents using Plone and it does now. >> >> I was wondering if anyone wants to see the script and help turn it into a >> product so it is available to the Plone community? >> >> Also, if I don't turn it into a product, is there another way I could >> post >> it so people could use it and perhaps contribute to it? > > > You could create a tutorial[1] on plone.org and upload the scripts as > attachments and/or post the code into the tutorial - that's a great way > to contribute! > > Thanks, > Tim > > [1] http://plone.org/documentation > > >> >> Thanks, >> Nathan >> ---------------------- >> http://www.abillionbillion.com ABillionBillion.com >> Document Management for Everyone > > _______________________________________________ > Product-Developers mailing list > [email protected] > http://lists.plone.org/mailman/listinfo/product-developers > > -- View this message in context: http://www.nabble.com/Tesseract-OCR-in-Plone%21-tp14521687s20094p14770351.html Sent from the Product Developers mailing list archive at Nabble.com. _______________________________________________ Product-Developers mailing list [email protected] http://lists.plone.org/mailman/listinfo/product-developers
