Re: Modified Tesseract

2011-09-30 Thread Calomer
at my end. However, I'm still concerned on using Tesseract-OCR with my preferred settings (first part of the initial post). Thanks, Cihan On Sep 29, 10:59 am, Calomer wrote: > I understand that you can edit box files with any editor (even text > editor) and check it. Been there, do

Re: Illegal feature parameter spec!

2011-09-30 Thread Calomer
of the box file) from > the box file > or the last character(s.no:54 of the box file), which avoids the crash ?. > > > > > > > > On Fri, Sep 30, 2011 at 1:34 PM, Calomer wrote: > > Merve, > > > Do you receive the error when you train again, or when you

Re: Illegal feature parameter spec!

2011-09-30 Thread Calomer
Merve, Do you receive the error when you train again, or when you are using the trained file? Which version are you using ? Did you try to use your (successful) trained file ? I have bumped into an old problem like yours on http://code.google.com/p/tesseract-ocr/issues/detail?id=490. Official p

Re: user-words

2011-09-30 Thread Calomer
s. Thanks On Sep 29, 10:39 pm, Sven Pedersen wrote: > Thanks Calomer. > > Bonny, is the language you're trying to improve using a different set > of characters (alphabet)? If so, you'll need to do a lot of training > as Calomer described. Otherwise you'll just need

Re: user-words

2011-09-29 Thread Calomer
I'll try my best to answer, tho I'm hardly eligible. According to training instructions (on http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3) and general OCR knowledge, you cannot train solely by new characters. You need training images, you need to create boxes (with any box editor

Modified Tesseract

2011-09-29 Thread Calomer
I understand that you can edit box files with any editor (even text editor) and check it. Been there, done that, awesome feature. I'm curious if it is possible to feed tesseract predefined boxes for it to just use OCR inside ? I'll make sure that all the boxes have only one character inside, promi