https://github.com/manisandro/gImageReader it is open source, you can compile and learn what it does with your example.
On Thursday, July 28, 2022 at 5:16:58 PM UTC+8 damien...@gmail.com wrote: > Any idea how I can see what gImageReader is doing and replicate this in > the command line with tesseract? > > On Thu, 28 Jul 2022 at 11:11 AM, 'Yunlong Liu' via tesseract-ocr < > tesser...@googlegroups.com> wrote: > >> Could it be because gImageReader has some extra preprocessing steps >> before calling Tesseract to do the actual recognition work? >> >> On Thursday, July 28, 2022 at 4:27:55 PM UTC+8 damien...@gmail.com wrote: >> >>> Hi >>> >>> I am trying to detect the photo shoot identifier image in a batch of >>> images. This photo always has a whiteboard with text written on it. I am >>> not interested in the handwritten text, just the printed text on the >>> whiteboard. Detecting the words "Photo" or "Shoot" in the photo will be >>> enough to identify this image. >>> [image: C5H11295.JPG] >>> >>> I have tried to identify these words with gImageReader, and it works >>> fine. But when trying to do the OCR with tesseract in the command line >>> (version 5.2 on Windows 64-bit) I don't get any text being returned in the >>> result. My understanding is the gImageReader uses the tesseract engine, so >>> why am I getting a result with that, and not directly from the command line? >>> >>> Any assistance will be appreciated. >>> >>> Thanks. >>> >> >> This email and any attachment(s) it may contain is confidential and is >> intended solely for the use of the individual(s) to whom it is addressed. >> If you are not the intended recipient of this email, you must not take >> action based on the contents, nor distribute, nor expose any part of the >> content(s) to entities or person(s) beyond the original distribution list. >> Please contact the sender and delete the email if you have received it in >> error. Thank you. >> >> -- >> You received this message because you are subscribed to a topic in the >> Google Groups "tesseract-ocr" group. >> To unsubscribe from this topic, visit >> https://groups.google.com/d/topic/tesseract-ocr/ZYK154zlJFA/unsubscribe. >> To unsubscribe from this group and all its topics, send an email to >> tesseract-oc...@googlegroups.com. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/acee0e05-dc02-434a-b3da-b40574c6ce6fn%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/acee0e05-dc02-434a-b3da-b40574c6ce6fn%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> > -- This email and any attachment(s) it may contain is confidential and is intended solely for the use of the individual(s) to whom it is addressed. If you are not the intended recipient of this email, you must not take action based on the contents, nor distribute, nor expose any part of the content(s) to entities or person(s) beyond the original distribution list. Please contact the sender and delete the email if you have received it in error. Thank you. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/8ccd99d4-d1fc-4c28-8509-89d14a9400b5n%40googlegroups.com.