Hello,
Go this url (http://code.google.com/p/tesseract-ocr/wiki/FAQ), look
for How do I recognize only digits? That can be modified however before you
try it read the comments on that wiki page as the instructions there are
partly wrong and the comments have the correct comments.
Cheers,
Neil
On 20 April 2010 09:22, Ramon <[email protected]> wrote:
> Hi, i'm using latest version from repository ( v3?)
>
> My ocr training language is catalan. I'm using spanish trainset from
> download page in this google group to train all characters.
>
> My word list is about 500.000 words (in fact there are 250.000
> lowercase and uppercase versions of a word) and ocr works fast in
> recnogtion (with 5 min. creating the dawg file) and with very good
> precision (if the word is in the txt file tesseract will fix any
> misspelling in image).
>
> next step is avoiding ( | > I ) errors, I'm reading how to constrain
> the character set to use in recognition. There is any file to do
> that?.
>
> I miss more v3 training information.
>
> Ramon.
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to
> [email protected]<tesseract-ocr%[email protected]>
> .
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en.
>
>
--
--
Neil Benn Msc
Director
Ziath Ltd
Phone :+44 (0)7508 107942
Website - http://www.ziath.com
IMPORTANT NOTICE: This message, including any attached documents, is
intended only for the use of the individual or entity to which it is
addressed, and may contain information that is privileged, confidential and
exempt from disclosure under applicable law. If the reader of this message
is not the intended recipient, or the employee or agent responsible for
delivering the message to the intended recipient, you are hereby notified
that any dissemination, distribution or copying of this communication is
strictly prohibited. If you have received this communication in error,
please notify Ziath Ltd immediately by email at [email protected]. Thank you.
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to
[email protected].
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en.