Re: Danish fraktur support in r319

Lars Aronsson Mon, 24 May 2010 10:20:35 -0700

Peter Alberti wrote:

I've trained tesseract r319 (3.0) to support Danish texts written in
fraktur. It is not
perfect but good enough that I hope it may be useful to others.

Jimmy O'Regan wrote:

With the current SVN version, you can use combine_tessdata -e
[trainingdata file] [files to extract] to extract the components you
want, and combine_tessdata [path to files] to make a new trainingdata
file.


I tried to compile the current version (svn -r354 up), but failed:

svshowim.cpp: In function ‘void sv_show_sub_image(IMAGE*, inT32, inT32,inT32, inT32, ScrollView*, inT32, inT32)’:svshowim.cpp:37: error: no matching function for call to‘ScrollView::Image(Pix*&, inT32&, int)’../viewer/scrollview.h:266: note: candidates are: voidScrollView::Image(const char*, int, int)


Versions 340, 351, 352, 353 also failed in the same place.

I was successful in compiling -r319, but then combine_tessdata
doesn't have all these flags.

----

Still, I'm not very interested in running the program, but to
understand the data. Is there no documentation for the format?
Should we write some?

Or is that something you keep internally at Google?


--
 Lars Aronsson ([email protected])
 Aronsson Datateknik - http://aronsson.se


--
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Re: Danish fraktur support in r319

Reply via email to