Did you try the "psm" switch (look for it in the forum)? Your own segmentation? Both combined?
Warm regards, Dmitri Silaev www.CustomOCR.com On Tue, Feb 14, 2012 at 1:55 AM, John Williams <jdwilliams1...@gmail.com> wrote: > If I duplicate the column 9 times, so that there's ten columns with the same > numbers, it reads it correctly. Running these results through the training > tools didn't help it recognize the original image, though. Running tesseract > on images with a single digit yielded nothing as well. > > In my program, do I have to programatically duplicate my column of numbers > several times and then figure out what the result was supposed to be... or > can I train tesseract to recognize a single column? I suppose duplicating it > will work, but it seems like a bad hack. > > On Mon, Feb 13, 2012 at 10:42 AM, Chris <cmgreen...@gmail.com> wrote: >> >> I'd try segmenting the numbers out yourself and feeding them into >> tesseract as individual characters. Might work better than feeding it >> the whole image. >> >> Make sure you put some padding around each character. >> >> On Feb 13, 1:56 am, JD <jdwilliams1...@gmail.com> wrote: >> > I'm using v 3.01 on Windows 7 to perform OCR on another program. I >> > don't have access to the fonts the program is using, so I trained >> > tesseract using some screenshots, and so far the text recognition is >> > far better than I expected. However, when I try to process a >> > screenshot that contains only a few numbers, it doesn't match anything >> > at all. If was matching garbage, or the wrong numbers, then I'd just >> > keep working on improving the training... but it doesn't find >> > anything. Does anyone have a suggestion about what I should try? >> > >> > It doesn't look like I can attach a screenshot, but the numbers are in >> > a column... something like this: >> > >> > 10 >> > 13 >> > 14 >> > 15 >> > 17 >> > >> > I pre-process the screenshots so the text is black on white. I also >> > zoom in on the images, so they're slightly blurred (only very >> > slightly)... but the text recognition is near perfect, so I don't >> > think that's an issue. Plus, it seems like it should find SOMETHING. >> >> -- >> You received this message because you are subscribed to the Google >> Groups "tesseract-ocr" group. >> To post to this group, send email to tesseract-ocr@googlegroups.com >> To unsubscribe from this group, send email to >> tesseract-ocr+unsubscr...@googlegroups.com >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en > > > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to tesseract-ocr@googlegroups.com > To unsubscribe from this group, send email to > tesseract-ocr+unsubscr...@googlegroups.com > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com To unsubscribe from this group, send email to tesseract-ocr+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en