[tesseract-ocr] Re: Image pre-processing pipeling for a general image captured from camera.

Andres Thu, 15 Jun 2017 21:54:07 -0700

Some things you can do:

1 - If your image is color, take profit of that, transform it to HSV and 
use H (mainly) to filter by color, with that you will take out most of the 
pixels that you don't want. Before starting programming, try that with 
photoshop or something similar to see how much could that help.
2 - Search for some blobs to figure out the size of your characters and 
then adjust the parameters of your adaptive threshold filter with that
3 - Find the paragraphs using your own method (this is not an easy thing) 
and then segment the characters (not easy either), extract the characters 
by yourself and OCR it separately


I would recommend you to start with 1 and 2. 3 requires a lot of work and 
good knowledge of image processing with libraries like OpenCV.
--

El jueves, 15 de junio de 2017, 15:13:22 (UTC-3), [email protected] 
escribió:
>
> Hey! Thanks for your suggestion
>
> Can you just give me a brief outline of the general pre processing step if 
> I have this kind of images.
>
>
>
> On Thursday, June 15, 2017 at 12:17:24 AM UTC+5:30, Andres wrote:
>>
>> The things that you mentione are not just details, they are subject of 
>> wide study and specialization.
>>
>> From all the books from computer vision / image processing that I have, 
>> the best one that addresses your enquiries is "Algorithms for Image 
>> Processing and Computer Vision", from J.R. Parker, 2nd edition. ISBN 
>> 978-0-470-64385-3
>>
>> Cheers,
>>
>> Andres
>>
>>
>> El domingo, 11 de junio de 2017, 16:06:43 (UTC-3), [email protected] 
>> escribió:
>>>
>>> I am trying to do do OCR using tesseract on images. I am unable to 
>>> figure out a proper pre processing technique for the same.
>>>
>>> the problems I am facing is:
>>>
>>> 1. Low contrast images: The images have different texts with different 
>>> font sizes. So what should be my approach to enhance the contrast of any 
>>> image.
>>>
>>> 2. Problem of touching characters: Sometimes after applying adaptive 
>>> thresholding I am facing the problem of touching characters (in which two 
>>> adjacent characters are touching each other) What is the best way to figure 
>>> out a solution for that.
>>>
>>> 3. Problem of non uniform illumination: How should I proceed if I want 
>>> to solve the problem of non uniform illumination ? 
>>>
>>> How can image segmentation solve my problem ?
>>>
>>> I have added a sample image. Assume that the image is not rotated as it 
>>> is there in the picture. But the variety of font sizes and the text 
>>> segments in the image are exact replica of what I am asking about ? Apart 
>>> from above mentioned steps, I would appreciate any  kind of suggestion for 
>>> pre - processing of the above image. Let me know if you have worked out a 
>>> solution for something related to this. 
>>>
>>> Thanks
>>>
>>>
>>>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/82b1db93-41f4-4e07-84dd-d49630ef4caa%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[tesseract-ocr] Re: Image pre-processing pipeling for a general image captured from camera.

Reply via email to