Merging in new fonts with english

2013-05-21 Thread ajacks
Would anyone be able to point me to the tesseract-ocr 3.02 english training data (the box/tif/.tr files associated with the standard eng.traineddata) as well as the corresponding font_properties file? I didn't find it in the files provided in tesseract-ocr-3.002.eng.tar.gz or in downloads. I did

Re: Non-deterministic results of Tesseract

2013-05-21 Thread zdenko podobny
Try to clear the adaptive classifier. See explanation in forum archive[1]. [1] https://groups.google.com/forum/#!msg/tesseract-ocr/ByGJhocI9qQ/U4wciLYfDDsJ Zdenko On Tue, May 21, 2013 at 5:19 PM, Alexander Chigorin wrote: > Hello! > > I am running the same instance of Tesseract Engine over and

Non-deterministic results of Tesseract

2013-05-21 Thread Alexander Chigorin
Hello! I am running the same instance of Tesseract Engine over and over the same image, and getting slightly different result strings. Is there a way to fix this? P.S. According to Issue 233 (http://code.google.com/p/tesseract-ocr/issues/detail?id=233) non-determinism in line fitting has been f

Re: Tesseract API example

2013-05-21 Thread TP
On Tue, May 21, 2013 at 2:59 AM, Arthur Ozga wrote: > I am looking to use Tesseract as a backend for a web-based OCR app using > visual c++ and the .Net framework, written using Visual Studio 2010/2012. > Unfortunately, I don't really understand how to integrate Tesseract in the > system using th

Tesseract API example

2013-05-21 Thread Arthur Ozga
I am looking to use Tesseract as a backend for a web-based OCR app using visual c++ and the .Net framework, written using Visual Studio 2010/2012. Unfortunately, I don't really understand how to integrate Tesseract in the system using the api calls in baseapi.h Could anyone tell me what #include

Re: jTessBoxEditor 0.6 Beta release

2013-05-21 Thread Shree Devi Kumar
Mamata, Please see https://code.google.com/p/tesseract-ocr/downloads/list for the available language data friles for tesseract 3.02. In case Odia is similar to bangala, you can use the bengali traineddata to bootstrap for odia. Shree Shree Devi Kumar __

Re: jTessBoxEditor 0.6 Beta release

2013-05-21 Thread mamata nayak
Sir Can you please tell me, the recent list of indian languages those are trained the tesseract-ocr engine. Thank you On Sun, May 12, 2013 at 12:23 PM, Shree Devi Kumar wrote: > Are you training Odia language? > > Have you seen > http://tdil-dc.in/tdildcMain/articles/374232Odia%20Script%20Gramm

Re: integrating other spellchecking tools

2013-05-21 Thread francis . tyers
El dilluns 20 de maig de 2013 16:48:12 UTC, zdenop va escriure: > > On Mon, May 20, 2013 at 4:08 PM, >wrote: > >> >> >> El dilluns 20 de maig de 2013 13:29:38 UTC, jimregan va escriure: >> >>> On Saturday, 18 May 2013 12:51:54 UTC+1, franci...@gmail.com wrote: >>> Hi, >>> Hi Fran.

Re: integrating other spellchecking tools

2013-05-21 Thread Nick White
Hi Fran, On Mon, May 20, 2013 at 07:08:35AM -0700, francis.ty...@gmail.com wrote: > Looks like it is this one: > > https://android.googlesource.com/platform/external/tesseract/+/ > d544c9231465999ad600ec13614b4d69d351798d/ > > The date is 3 years and 10 months ago. Have any substantial improve