Jose,

I run Tesseract revision 549 from the command line under Windows with
no special config and get the segmentation which is almost correct.
What language file do you use? I used the following command line

tesseract 3.tiff test3 -l eng

with no pageseg_mode (-psm argument) as well as with it, and always
the result was satisfactory.

Let me know the details on your command line and OS.

Warm regards,
Dmitry Silaev





On Sun, Mar 13, 2011 at 11:18 PM, patrickq
<patrick.questemb...@gmail.com> wrote:
> You expect way too much from Tesseract: it's not Tesseract's job to
> slice and dice the text according to various organizational
> requirements of applications - that's for the application to handle.
> You can get all the coordinates of all characters and easily determine
> which one are in what you consider the first column and which are in
> the 2nd column. In ScanBizCards' case considering our target material,
> we treat each line as a single number formed of two sequences - but if
> we wanted to treat the input as columns, it would take us a mere 20
> minutes of coding or organize the results that way. We actually don't
> even pay attention to where Tesseract thinks lines end and start, we
> figure that out ourselves based on coordinates. It's not hard.
>
> Patrick
>
> On Mar 13, 4:10 pm, Jose <diox...@gmail.com> wrote:
>> Hi Patrick,
>>
>> yes the results are correct! but the format of the results it is not! that's
>> my trouble
>
> --
> You received this message because you are subscribed to the Google Groups 
> "tesseract-ocr" group.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> To unsubscribe from this group, send email to 
> tesseract-ocr+unsubscr...@googlegroups.com.
> For more options, visit this group at 
> http://groups.google.com/group/tesseract-ocr?hl=en.
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to tesseract-ocr@googlegroups.com.
To unsubscribe from this group, send email to 
tesseract-ocr+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to