On Sat, Jun 29, 2013 at 4:11 AM, matthew christy <[email protected]>wrote:
> Like a lot of people ( > https://groups.google.com/forum/#!topic/tesseract-ocr/-m3WQ6FAbMw) I've been > trying to get word confidence values to appear in the hOCR output with > Tesseract 3.02.02. Despite reading several posts that indicate that it just > works, I am not getting any x_wconf values in the output. > > I did find this > https://code.google.com/p/tesseract-ocr/issues/detail?id=748 and this > https://code.google.com/p/tesseract-ocr/source/detail?r=806 which > indicate that it isn't working anymore in 3.02. These posts say that I can > patch the code to get it working, but there's no indication of how to apply > the patch. I also thought of just changing the code, but I don't see a > baseapi.cpp file anywhere in my tesseract install folders. So, I'm not sure > what to do now. > > Can anyone tell me: > > Whether word confidence is or isn't working in Tesseract 3.02.02? > Yes it is working and you can get it with tesseract-ocr API. For executable it was inplemented after releasing of version 3.02.02. So if you build current svn you will get it in hOCR output as x_wconf. > Is there some parm or value that goes in my config file besides the > "tessedit_create_hocr 1" that I already have? > "tessedit_create_hocr 1" is only one and correct param to create hOCR output How do I apply a patch? > e.g.: patch -p 0 < your.patch > Why don't I have a baseapi.cpp file in my install? > > Did you install source code? > Any help would be appreciated. > Thanks > > -- > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > > --- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > For more options, visit https://groups.google.com/groups/opt_out. > > > -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

