Re: Tess3.01 not recognizing my curly double quotes.

2012-07-23 Thread Galt
That's great news, Nick! I can't wait to try it on the old Irish fonts! -Galt On Tuesday, July 3, 2012 9:44:27 AM UTC-7, Nick White wrote: > > On Fri, Jun 01, 2012 at 10:16:52AM +0100, Nick White wrote: > > On Wed, May 23, 2012 at 05:39:00PM +0100, Nick White wrote: > > > On Tue, May 22, 2012

Re: Tess3.01 not recognizing my curly double quotes.

2012-07-03 Thread Nick White
On Fri, Jun 01, 2012 at 10:16:52AM +0100, Nick White wrote: > On Wed, May 23, 2012 at 05:39:00PM +0100, Nick White wrote: > > On Tue, May 22, 2012 at 05:21:23AM -0700, Galt wrote: > > > On May 21, 2:04 am, Nick White wrote: > > > > I've been suffering a very similar problem with some of the text I

Re: Tess3.01 not recognizing my curly double quotes.

2012-06-01 Thread Nick White
On Wed, May 23, 2012 at 05:39:00PM +0100, Nick White wrote: > On Tue, May 22, 2012 at 05:21:23AM -0700, Galt wrote: > > On May 21, 2:04 am, Nick White wrote: > > > I've been suffering a very similar problem with some of the text I'm > > > training, which has several diacritics above and below glyp

Re: Tess3.01 not recognizing my curly double quotes.

2012-05-23 Thread Nick White
On Tue, May 22, 2012 at 05:21:23AM -0700, Galt wrote: > On May 21, 2:04 am, Nick White wrote: > > I've been suffering a very similar problem with some of the text I'm > > training, which has several diacritics above and below glyphs. It > > isn't infrequent to find quite a few lines of garbage whi

Re: Tess3.01 not recognizing my curly double quotes.

2012-05-22 Thread Galt
On May 21, 2:04 am, Nick White wrote: > Hi Galt, > > I've been suffering a very similar problem with some of the text I'm > training, which has several diacritics above and below glyphs. It > isn't infrequent to find quite a few lines of garbage which are some > of the diacritics taking a line,

Re: Tess3.01 not recognizing my curly double quotes.

2012-05-21 Thread Nick White
Hi Galt, I've been suffering a very similar problem with some of the text I'm training, which has several diacritics above and below glyphs. It isn't infrequent to find quite a few lines of garbage which are some of the diacritics taking a line, which then causes the following and preceding lines

Re: Tess3.01 not recognizing my curly double quotes.

2012-05-20 Thread Stane
Not sure if it helps but you could try to use dictionaries or modifing the unicharambigs file during your training. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com To unsubscrib

Re: Tess3.01 not recognizing my curly double quotes.

2012-05-16 Thread Galt
FOLLOW UP THIS DOES NOT REALLY WORK, it is only a misleading trick. If I remove the fuzzies that had appeared as anti-aliasing effects of using convert with -resize before -monochrome then the fuzzies disappear and so does the beneficial effect on quotes. It does not appear that the 300 dpi was re

Re: Tess3.01 not recognizing my curly double quotes.

2012-05-14 Thread Galt
Right now I have been forced by this problem to use 300dpi (instead of 600dpi which is what I actually scanned at). Since the box finder never joins them as double-curlys by itself, I have taken to defining the single curlys left and right, and then will use either ambigs or a post-processing step

Re: Tess3.01 not recognizing my curly double quotes.

2012-05-14 Thread Galt
I found a message in the forum in which someone recommends scanning Nepali test at 600dpi, which implies that Tess is supposed to work on it. Letters which had a small amount of space between them at 600 dpi will sometimes lose that at 300dpi. -- You received this message because you are subscri

Re: Tess3.01 not recognizing my curly double quotes.

2012-05-13 Thread Galt
I have found that there is a scale-dependency in the curly quotes handling. If I create 300 dpi versions of my scans, then Tess3.01 begins working much better. That is a huge relief and makes tess usable. I wish I could use the 600dpi scans. I have them. Seems like this might be a little bug. May

Tess3.01 not recognizing my curly double quotes.

2012-05-12 Thread Galt
Tess3.01 has a lot of trouble recognizing my curly double quotes. Unfortunately, my scans have lots of dialog with these in them. My Irish font is one with diacriticals. It has accents over vowels and dots over consonants. In addition, the uppercase letters are just larger versions of the lower ca