Sriranga, Actually I don't understand why one needs to refer to the forum discussion you've just mentioned above, as I managed to build this traineddata file without writing a single line of code and even without a compiler, say Visual C++...
The value I can add is in that any user inexperienced in programming can make this traineddata file himself )) Warm regards, Dmitry Silaev On Thu, Mar 3, 2011 at 5:08 PM, Sriranga(78yrsold) <withblessi...@gmail.com> wrote: > Dmitry, > No I am NOT the first invented but actually credited to spohor...@sjm.com > -who helped me very lot including creating vcproj for combined traineddata > for windows. I am very thankful to him for his help/guidance rendered from > time to time. Without his help I would not succeeded to generate traineddata > file out of old datafiles All credits should go to Steve. Steve has already > explained in detail how to do in the forum discussion are available. > -sriranga(78yrs) > > On Thu, Mar 3, 2011 at 6:36 PM, Dmitry Silaev <daemons2...@gmail.com> wrote: >> >> Sriranga, >> >> Thanks for letting me know. You are the first one then, and I invented >> the bicycle )) >> However an article might be still of use instead of verbose forum >> discussion... >> May be you'd like to write it then? >> >> Warm regards, >> Dmitry Silaev >> >> >> >> >> >> On Thu, Mar 3, 2011 at 3:55 PM, Sriranga(78yrsold) >> <withblessi...@gmail.com> wrote: >> > Dimitry, >> > I had generated traineddata(Kannada) files sucessfully from the old >> > datafiles of 2.xx last year. There is discussion by spohorsky in the >> > forum >> > how to do. >> > sriranga(78) >> > ♫ >> > On Thu, Mar 3, 2011 at 5:42 PM, Dmitry Silaev <daemons2...@gmail.com> >> > wrote: >> >> >> >> Manuel, >> >> >> >> It's quite an interesting question although it may seem to be an >> >> ordinary newbie-like one. >> >> >> >> I was always wondering if 2.xx files can be used with version 3.xx. >> >> The wiki states that "the files in the traineddata file are different >> >> from the list used prior to 3.00, and will most likely change, >> >> possibly dramatically in future revisions." >> >> >> >> I have no time to investigate it in the code so I decided to act >> >> rather than to think. After some tinkering with all those files I >> >> slipped the resulted "por.traineddata" into my Tesseract algo I'm >> >> currently working at, and - guess what? - it worked! )) >> >> >> >> I must say it was tested only with a couple of *very simple* images >> >> and also it absolutely lacks any dictionary-related data. And my test >> >> images don't contain these specific Portuguese letters with >> >> diacritics. So in fact this file may perform poorly. Please test and >> >> report your results. The file is in the attachment. >> >> >> >> It was not difficult at all but also not so straight-forward to make >> >> this training data file, so probably this process deserves a separate >> >> article and later I'd like to post it in my blog. >> >> >> >> Warm regards, >> >> Dmitry Silaev >> >> >> >> >> >> >> >> >> >> >> >> On Wed, Mar 2, 2011 at 8:40 PM, manuelfhp <manuel...@gmail.com> wrote: >> >> > Helo list, >> >> > I can't find a solution for special chars >> >> > >> >> > I installed tesseract 3 in my MacOSX 10.6 >> >> > It is running very well >> >> > >> >> > But I'm having problems with charset. >> >> > I need tesseract working with brazillian portuguese. (ISO8859-1) >> >> > >> >> > I installed the portuguese dictionary but is not working with special >> >> > chars like Ç Ã É é .... (ISO8859-1) >> >> > Is there any solution ? >> >> > >> >> > There is an old dictionary special for brazilian portuguese in >> >> > version >> >> > 2.0.4. Is it possible to use in version 3? How? >> >> > >> >> > >> >> > -- >> >> > You received this message because you are subscribed to the Google >> >> > Groups "tesseract-ocr" group. >> >> > To post to this group, send email to tesseract-ocr@googlegroups.com. >> >> > To unsubscribe from this group, send email to >> >> > tesseract-ocr+unsubscr...@googlegroups.com. >> >> > For more options, visit this group at >> >> > http://groups.google.com/group/tesseract-ocr?hl=en. >> >> > >> >> > >> >> >> >> -- >> >> You received this message because you are subscribed to the Google >> >> Groups >> >> "tesseract-ocr" group. >> >> To post to this group, send email to tesseract-ocr@googlegroups.com. >> >> To unsubscribe from this group, send email to >> >> tesseract-ocr+unsubscr...@googlegroups.com. >> >> For more options, visit this group at >> >> http://groups.google.com/group/tesseract-ocr?hl=en. >> >> >> > >> > -- >> > You received this message because you are subscribed to the Google >> > Groups >> > "tesseract-ocr" group. >> > To post to this group, send email to tesseract-ocr@googlegroups.com. >> > To unsubscribe from this group, send email to >> > tesseract-ocr+unsubscr...@googlegroups.com. >> > For more options, visit this group at >> > http://groups.google.com/group/tesseract-ocr?hl=en. >> > >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To post to this group, send email to tesseract-ocr@googlegroups.com. >> To unsubscribe from this group, send email to >> tesseract-ocr+unsubscr...@googlegroups.com. >> For more options, visit this group at >> http://groups.google.com/group/tesseract-ocr?hl=en. >> > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to tesseract-ocr@googlegroups.com. > To unsubscribe from this group, send email to > tesseract-ocr+unsubscr...@googlegroups.com. > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com. To unsubscribe from this group, send email to tesseract-ocr+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.