Hi David, Can you help me out , how can I use tesseract to OCR numbers of digitanl fonts (7 segment display). Did you figure out how to train tesseract for 7 segment display numeric language? Thanks, Jamil
On Monday, January 28, 2013 2:59:50 AM UTC-5, David Peter Lisin Crespo wrote: > > Good morning Seema, > > as for training tesseract for seven segment ocr, i also asked in the > forum but did not find a reply. In the end i simply used opencv. > > Steps where: > > - Convert image to black and white > - Clean the image (erosion, dilation, etc) > - Contour detection (works very well). > - Once I have countours, i made three lines. One that cuts vertically the > contour by the half. And two horizontal lines at 1/4 of contour height and > 3/4. > - Set point were segments cross the lines to get 7 points (the vertical > lines cut three segments) and the horizontal cut both two (upper and lower > segments) > The check for pixel value. If black consider segment as active. > With resultng segments, you get which number is in use. > > Hope this helps ;) > > PD: On this page you can see similar algorithm used: > http://www.unix-ag.uni-kl.de/~auerswal/ssocr/ > Raj found how to do it with tesseract, but was not able to answer. > https://groups.google.com/forum/?fromgroups=#!topic/tesseract-ocr/elnIngFJvQs > Good paper describing similar process to what i had to do: > http://morgoth.zemris.fer.hr/people/Marko.Cupic/files/2009-SP-MIPRO.pdf > > Hope it works out for you ;) > > Kind regards, > > David Lisin > > > 2013/1/27 Seema Shettar <[email protected] <javascript:>> > >> >> >> On Thursday, March 5, 2009 3:03:03 AM UTC-4, Raj wrote: >>> >>> Hi All.... >>> >>> i'm newbie and want 2 use tesseract ocr for detecting 7-segment >>> display. >>> >>> for this i'm using c#.net 2005 and a image processing open source >>> (opencv) and c# wrapper emgucv to achieve the task. >>> >>> i have removed noise from the image before passing it to tesseract >>> ocr engine. >>> >>> but i'm getting mixed results. like for digit ' 0' -- detecting as 11. >>> digit '6' as 5 >>> >>> i read about training the tesseract. Is it possible for me to train >>> the " 7 segment display ". >>> >>> If yes, then please tell me the process how can i get tesseract to >>> train. >>> >>> >>> >>> Thank U. >>> >>> >>> -- >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to [email protected] >> <javascript:> >> To unsubscribe from this group, send email to >> [email protected] <javascript:> >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en >> >> >> >> > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/c24bd163-3512-4b7c-8e8b-c139be772d75%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

