Re: noise output

2011-03-04 Thread Saurabh Gandhi
Thanks for the prompt response. Will work on these and get back with more specific doubts. -- Regards, Saurabh Gandhi On Sat, Mar 5, 2011 at 11:52 AM, Dmitry Silaev wrote: > There are tons of. And I believe, no ready recipe can be used > universally, this is very task-specific, especially in

Fwd: noise output

2011-03-04 Thread Dmitry Silaev
There are tons of. And I believe, no ready recipe can be used universally, this is very task-specific, especially in photographic images. Also I believe, to do good text detection your algo should in some extent mimic human behavior so it probably should be multi-stage, gradually refining results a

Re: noise output

2011-03-04 Thread Saurabh Gandhi
Hey, Any algorithm / whitepaper suggestions for text extraction, especially if the text is not over-lay text but a part of the image itself. Most algorithms I saw on the internet are compute intensive. -- Regards, Saurabh Gandhi On Sat, Mar 5, 2011 at 11:20 AM, Dmitry Silaev wrote: > Zdravko

Re: noise output

2011-03-04 Thread Dmitry Silaev
Zdravko, You should do text-detection before passing images to Tesseract. Text-detection is a process of determining of image regions containing text. Even if an image contains no text, Tesseract anyways will treat it as an image of text. Before recognition Tess applies a so-called binarization a

Re: can't read frequent_words_list file

2011-03-04 Thread zdenko podobny
please provide more information: how you try create dictionary, platform, exact version of Tessaract (maybe how did you get it). Zdenko On Fri, Mar 4, 2011 at 2:50 PM, Sang Đặng Minh wrote: > hi all. my name is Sang. I'm trying to train Tessaract 2.0, everything > is ok, but i can't create DAWG

noise output

2011-03-04 Thread zdravco
Hello, I am using tesseract in my project after some image pre-processing. There are some false negatives I was hoping tesseract would eliminate by producing no output. However, sometimes there is a strange output that I get from almost blank images. Here is the sample image: https://picasaweb.goo

can't read frequent_words_list file

2011-03-04 Thread Sang Đặng Minh
hi all. my name is Sang. I'm trying to train Tessaract 2.0, everything is ok, but i can't create DAWG files, this error is: Could not open file frequent_words_list. Please help me! thanks a lot! -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. T

can't read frequent_words_list file

2011-03-04 Thread Sang Đặng Minh
hi all, my name is Sang, I am trying to train Tessaract 2.0, but i can't create DAWG files. The error message is: Could not open file: frequent_words_list. Please help me! Tks a lot ! -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to