Re: [Tesseract 3] English training text

2013-04-19 Thread sdk
I found the following links online. Don't know how applicable they are to 3.02 .. http://michaeljaylissner.com/blog/adding-new-fonts-to-tesseract-3-ocr-engine http://michaeljaylissner.com/files/standard-training-text.txt On Wednesday, February 9, 2011 10:18:32 PM UTC+5:30, Sly_bzh wrote: > > I

Re: [Tesseract 3] English training text

2011-02-22 Thread zdenko podobny
Dmitry, unfortunately I have not enough of time for tests :-(. I still hope Ray will release more info before final 3.01. At the moment I focus on box editor. BR, Zdenko On Tue, Feb 22, 2011 at 9:27 AM, Dmitry Silaev wrote: > Interesting. I was wondering about Cube since its traces began to >

Re: [Tesseract 3] English training text

2011-02-22 Thread Dmitry Silaev
Interesting. I was wondering about Cube since its traces began to appear in the source code but had no enough time to investigate it thorougly Zdenko, would you please kindly share your other findings on Cube? Regards, Dmitry On Tue, Feb 22, 2011 at 11:13 AM, zdenko podobny wrote: > I doubt tha

Re: [Tesseract 3] English training text

2011-02-22 Thread zdenko podobny
I doubt that google will release their (full) training set :-( Have a look at svn to file eng.cube.size [1]. You can see there name of fonts that was training for English in 3.01. As far as I understood there is (unpublished/not released) possibility to train language data directly on font files.

Re: [Tesseract 3] English training text

2011-02-21 Thread Dmitry Silaev
See this http://code.google.com/p/tesseract-ocr/downloads/detail?name=boxtiff-2.01.eng.tar.gz&can=2&q= This box/tiff set would work for 3.0x also On Wed, Feb 9, 2011 at 7:48 PM, Sly_bzh wrote: > I would like to train tesseract for English with some special fonts. > Tesseract training documenta

[Tesseract 3] English training text

2011-02-21 Thread Sly_bzh
I would like to train tesseract for English with some special fonts. Tesseract training documentation says that a text should be prepared and it must follow some important points (see http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#Generate_Training_Images) Could someone provide to