Hi Jon,
Like each morning, I check my emails and I saw those headstones Images from Graves. I am a God fearing person. So, I was not able to ignore your email. Regarding the preprocessing step, I suggest to apply Local Minima method for background removal. However, you might require to adjust your window size in order to achieve the best results. I did some experiments with the MATLAB code, and I got some good results. Testing on a larger sample set, may improve the step. Please tell me what project you are working on, maybe I will be able to contribute better? Just lemme know if you need any type of help! Best Regards, Vicky From: tesseract-ocr@googlegroups.com [mailto:tesseract-ocr@googlegroups.com] On Behalf Of Jon Andersen Sent: Monday, February 21, 2011 07:32 To: tesseract-ocr@googlegroups.com Subject: Image pre-processing for good OCR results Hi, My project at http://RecordAGrave.com is about recording headstones from graves and posting the text and images on the Net so that people can research their family history. I would appreciate some advice on how to pre-process these headstone images to get the best results from Tesseract OCR. I have thousands of 1-2 MB jpg images of headstones to process. Example images: http://freepages.genealogy.rootsweb.ancestry.com/~janderse/cemeteries/Star%2 0of%20David%20Memorial%20Gardens/Garden%20of%20Haifa%20-%20Raw/IMG_28215.jpg http://freepages.genealogy.rootsweb.ancestry.com/~janderse/cemeteries/Star%2 0of%20David%20Memorial%20Gardens/Garden%20of%20Haifa%20-%20Raw/IMG_28216.jpg http://freepages.genealogy.rootsweb.ancestry.com/~janderse/cemeteries/Star%2 0of%20David%20Memorial%20Gardens/Garden%20of%20Haifa%20-%20Raw/IMG_28217.jpg I am a software developer so I can script up pre-processing steps to prepare the input for Tesseract. Any advice on improving OCR accuracy through pre-processing steps? Thanks so much, -Jon -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com. To unsubscribe from this group, send email to tesseract-ocr+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com. To unsubscribe from this group, send email to tesseract-ocr+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.