Hey Clint,
Theres a ton of different ways you could build a system such as that. It all comes down to your experience level with the different technologies involved. E.g. if you know how to use the command-line version of tesseract, what you could do, is write a web application, that handles the image upload and then simply calls a shell script, which wraps the interaction with tesseract. This approach assumes you are familiar with building web applications in general, as well as how to shell out into e.g. bash. Another approach could be to build your server backend completely in C++ and actually load tesseract as a library and do the processing inside of the same process, which will be hosting the web application. I think the reason, you've not had much luck on the mailing list so far, is because there's really a huge number of assumptions needing to be made, before it's even possible to start answering your questions in earnest. You need to decide how you'd like to build your web application first. E.g. What web-server are you going to be using? What language and/or framework will you be writing the backend in? Saying html5 and JS doesn't really provide enough information. Do you mean you'll be writing Node.JS or do you mean you only have access to a static file web host? Here's a question on Quora about doing a web application for OpenCV: https://www.quora.com/How-do-I-use-opencv-image-processing-in-web-application You might find some inspiration there and in the related questions. /René On Tue, 10 Sep 2019 at 23:55, Clint William Theron < [email protected]> wrote: > Hey. How do I develop a web app using tesseract-ocr? For example, I would > create an online UI with a "pick a file" button and when the user picks an > image, from computer storage, the ocr process starts. I'll do the UI myself > with html5 & JS code but I need assistance with tesseract-ocr. Like which > files to upload and what to include in the script and then how to call the > recognize method. > > To give you an example of what I mean I'll demonstrate with a different > tesseact project. The one from project naptha > <http://tesseract.projectnaptha.com>: I made a web app using this tutorial > <https://ourcodeworld.com/articles/read/580/how-to-convert-images-to-text-with-pure-javascript-using-tesseract-js> > and > I got to perform ocr on images. Everything works as expected. Now I want to > do more or less the same thing with tesseract-ocr, the one from Google. I > downloaded the tesseract-master.zip file, upload it to my web server and > unzipped it. Since tesseract-ocr doesn't have a UI I don't know what to do > next. There must be some file, script, I should include in my main script > in order to call tesseract.recognize. > > Even if my question doesn't make full sense I'm sure you have an idea of > what I want to achieve. > > Thanks for any advice > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/20d26401-4011-47a5-a3c2-b67bb33fa393%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/20d26401-4011-47a5-a3c2-b67bb33fa393%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- Never fear, Linux is here. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAB-60nj9CLXh65yu9Eh2kVqQXQA1OekeYgZik-655X%2BhWLyhXQ%40mail.gmail.com.

