Rucha > Green? Why?

Ger > Indeed, why? (What is the thought that drove you to run this
particular imagemagick command?)

Fair questions.  I saw both black and white in the text so I picked a
background color that does not exist in the text and has high contrast.
 tesseract did a great job with the green background.  I want to process
images to extract Palo Alto California tide data, date, and time and then
plot the results against xtide predictions.  I am close to processing a
day's worth of images collected once a minute so I will see how well the
green background works.  If I have problems, I will definitely try using
your (Ger and Rucha's) advice.

Thank you Ger and Racha very much for your advice.

Best Regards,
   Michael

On Fri, Oct 31, 2025 at 5:52 PM Ger Hobbelt <[email protected]> wrote:

> Indeed, why? (What is the thought that drove you to run this particular
> imagemagick command?)  While it might help visually debugging something
> you're trying, the simplest path towards "black text on white background"
> is
>
> 1. converting any image to greyscale. (and see for yourself if that output
> is easily legible; if it's not, chances are the machine will have trouble
> too, so more preprocessing /before/ the greyscale transform is needed then)
> 2. use a 'threshold' (a.k.a. binarization) step to possibly help (though
> tesseract can oftentimes do a better job with greyscale instead of hard
> black & white as there's more 'detail' in the image pixels then. YMMV).
>
> You can do this many ways, using imagemagick is one, openCV another. For
> one-offs I use Krita / Photoshop filter layers (stacking the filters to get
> what I want).
> Anything really that gets you something that approaches 'crisp dark/black
> text on a clean, white background, text characters about 30px high' (dpi is
> irrelevant, though often mentioned elsewhere: tesseract does digital image
> pixels, not classical printer mindset dots-per-inch).
>
> Note that 'simplest path towards' does not mean 'always the best way'.
>
> Met vriendelijke groeten / Best regards,
>
> Ger Hobbelt
>
> --------------------------------------------------
> web:    http://www.hobbelt.com/
>         http://www.hebbut.net/
> mail:   [email protected]
> mobile: +31-6-11 120 978
> --------------------------------------------------
>
>
> On Fri, Oct 31, 2025 at 5:46 AM Rucha Patil <[email protected]>
> wrote:
>
>> Green? Why? I dont know if this might resolve the issue. Lmk the behavior
>> I’m curious. But you need an image that has white background and black
>> text. You can achieve this easily using cv2 functions.
>>
>> On Thu, Oct 30, 2025 at 1:26 PM Michael Schuh <[email protected]> wrote:
>>
>>> I am trying to extract the date and time from
>>>
>>> [image: time.png]
>>>
>>> I have successfully use tesseract to extract text from other images.
>>> tesseract does not find any text in the above image,
>>>
>>>    michael@argon:~/michael/trunk/src/tides$ tesseract time.png out
>>>    Estimating resolution as 142
>>>
>>>    michael@argon:~/michael/trunk/src/tides$ cat out.txt
>>>
>>>    michael@argon:~/michael/trunk/src/tides$ ls -l out.txt
>>>    -rw-r----- 1 michael michael 0 Oct 30 08:58 out.txt
>>>
>>> Any help you can give me would be appreciated.  I attached the time.png
>>> file I used above.
>>>
>>> Thanks,
>>>    Michael
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to [email protected].
>>> To view this discussion visit
>>> https://groups.google.com/d/msgid/tesseract-ocr/77ac0d2b-7796-4f17-8bc6-0e70a9653adan%40googlegroups.com
>>> <https://groups.google.com/d/msgid/tesseract-ocr/77ac0d2b-7796-4f17-8bc6-0e70a9653adan%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to [email protected].
>> To view this discussion visit
>> https://groups.google.com/d/msgid/tesseract-ocr/CADEFw17btz6nKqyhFKd-GXVCu7qtBQQ6gY5AV0pZJusXa4CpXg%40mail.gmail.com
>> <https://groups.google.com/d/msgid/tesseract-ocr/CADEFw17btz6nKqyhFKd-GXVCu7qtBQQ6gY5AV0pZJusXa4CpXg%40mail.gmail.com?utm_medium=email&utm_source=footer>
>> .
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To view this discussion visit
> https://groups.google.com/d/msgid/tesseract-ocr/CAFP60fpUCz1LFq_aqk0ea6W8GR7a7mrX5%3DPdZhv6%3Dn6t-1YVrg%40mail.gmail.com
> <https://groups.google.com/d/msgid/tesseract-ocr/CAFP60fpUCz1LFq_aqk0ea6W8GR7a7mrX5%3DPdZhv6%3Dn6t-1YVrg%40mail.gmail.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAAo-6adqVtsaoEhFxwwiXc%2Brx6uCi2zx4q7viYBZJWJMYVeeQA%40mail.gmail.com.

Reply via email to