On 24 May 2010 17:41, Lars Aronsson <[email protected]> wrote:
> Peter Alberti wrote:
>>>>
>>>> I've trained tesseract r319 (3.0) to support Danish texts written in
>>>> fraktur. It is not
>>>> perfect but good enough that I hope it may be useful to others.
>
> Jimmy O'Regan wrote:
>>
>> With the current SVN version, you can use combine_tessdata -e
>> [trainingdata file] [files to extract] to extract the components you
>> want, and combine_tessdata [path to files] to make a new trainingdata
>> file.
>
> I tried to compile the current version (svn -r354 up), but failed:
>
> svshowim.cpp: In function ‘void sv_show_sub_image(IMAGE*, inT32, inT32,
> inT32, inT32, ScrollView*, inT32, inT32)’:
> svshowim.cpp:37: error: no matching function for call to
> ‘ScrollView::Image(Pix*&, inT32&, int)’
> ../viewer/scrollview.h:266: note: candidates are: void
> ScrollView::Image(const char*, int, int)
>
> Versions 340, 351, 352, 353 also failed in the same place.
>

Looks like a pair of missing casts - have you opened an issue?

> I was successful in compiling -r319, but then combine_tessdata
> doesn't have all these flags.
>
> ----
>
> Still, I'm not very interested in running the program, but to
> understand the data. Is there no documentation for the format?
> Should we write some?
>
> Or is that something you keep internally at Google?
>

Err... I have no affiliation with Google. I periodically read the
commits to projects I'm interested in, because I find that it's often
the only documentation available.

>
> --
>  Lars Aronsson ([email protected])
>  Aronsson Datateknik - http://aronsson.se
>
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to
> [email protected].
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en.
>
>



-- 
<Leftmost> jimregan, that's because deep inside you, you are evil.
<Leftmost> Also not-so-deep inside you.

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/tesseract-ocr?hl=en.

Reply via email to