Re: Is there minimum of letters?

2011-10-24 Thread merve t
Yes, can you explain PSM 8? Is it something like PSM_AUTO, should i change pagesegmode to PSM_8? In this mail group, i was advised to give tesseract characters one by one. Thus i must learn how to make tesseract recognize alone chars in images. Thanks in advance. 2011/10/24 patrickq > What's PSM

Re: Choice Iterator

2011-10-24 Thread merve t
Hello, First of all i want to confirm that i understand choice iterator right. For example if i have a word on an image "scope", choice iterator must give me something like "s" and maybe after Next(), "5". for 3. letter "o" it maybe gives me "0", after Next() "O" and after Next() "o". Do i und

Re: Help with tables

2011-10-24 Thread Sven Pedersen
Tesseract won't handle the lines of the table, but you should be able to remove them through image processing and construct a table from the spaces (perhaps regular expressions would help there). If you post an example image we might be able to help. --Sven On Mon, Oct 24, 2011 at 12:47 PM, Fede

Re: SetImage(pix) vs SetImage(raw...)

2011-10-24 Thread dh
In the event these thoughts might prove useful to others... While I've not yet proven this, I currently theorize that the SetImage(raw) functionality does not correctly handle images where the physical raster storage is longer (rounded to dword) than the logical raster length. Code flow (as resul

required includes for using TessBaseAPI

2011-10-24 Thread exactae
I've downloaded, compiled, and tested tesseract 3.01 from the commandline and would now like to begin testing using TessBaseAPI. (note: build system is Windows MS vs2010 - using the provided .sln file) I think there is some basic configuration problem that I'm missing...I'd sure appreciate a poin

Re: Is there minimum of letters?

2011-10-24 Thread patrickq
What's PSM? Alternative spelling for PMS :-)? On Oct 24, 1:35 pm, Quan Nguyen wrote: > Try with PSM 8 or 10. > > On Oct 24, 9:09 am, "Giuseppe Menga" wrote: > > > > > > > > > That is interesting. I'm recognizing espiration dates from medicines, and I > > found convenient to repeat the date 3 or

Re: Is there minimum of letters?

2011-10-24 Thread patrickq
The basic reason it helps Tesseract to repeat text is because Tesseract makes an initial assumption what kind of letters it is looking at: tall (digits, uppercase letters, tall lowercase) or lowercase letters. Only after it makes that assumption / guess will it try to match the letters against the

Re: Is there minimum of letters?

2011-10-24 Thread Quan Nguyen
Try with PSM 8 or 10. On Oct 24, 9:09 am, "Giuseppe Menga" wrote: > That is interesting. I'm recognizing espiration dates from medicines, and I > found convenient to repeat the date 3 or 4 times, it improves recognition. > Someone can explain the reason. > Giuseppe > > -Messaggio originale---

Help with tables

2011-10-24 Thread Fede
Hello! I'm new but I'm going crazy trying to pass a table to text Agreadeceria a little help and say if this is possible with Tesseract thanks -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@go

Re: Is there minimum of letters?

2011-10-24 Thread Giuseppe Menga
That is interesting. I'm recognizing espiration dates from medicines, and I found convenient to repeat the date 3 or 4 times, it improves recognition. Someone can explain the reason. Giuseppe -Messaggio originale- From: Adlerfalke Sent: Monday, October 24, 2011 3:29 PM To: tesseract-oc

Is there minimum of letters?

2011-10-24 Thread Adlerfalke
Hello, i can't found anything about, how many letters/numbers an tif must have, so that tesseract can find these. For example I have an picture only with the number 3, but tesseract don't detect this number. But if i put 3 3 3 3 in my picture tesseract detect the numbers. So my Question is: What

HELP WITH RECOGNIZE tif image

2011-10-24 Thread Enrico
Hello all, I am trying to convert the TIF image to text using Tesseract OCR. I try to convert with command line, but not all words are convert. Can someone guide me on how to efficiently use Tesseract for the conversion of image to text? And another question, can someone explain me how to r