[tesseract-ocr] Re: Tesseract 3.04 Build Error

2015-06-29 Thread supriya Das
Hello Jeff, Thanks for your reply, I Updated Leptonica version 1.71, after tha i am faceing error like. *LNK1104: cannot open file 'liblept171d.lib*'. So, how to get this lib file? which is not available . Please suggested me. On Monday, 29 June 2015 13:16:53 UTC+5:30, Jeff Breidenbach

[tesseract-ocr] how to train tesseract?

2015-06-29 Thread Cenk KIZILDAĞ
Hi, I would like to recognize only digits with the code sample below: try { System.Drawing.Bitmap imagee=new Bitmap(pictureBox1.Image); System.Drawing.Bitmap

Re: [tesseract-ocr] how to train tesseract?

2015-06-29 Thread Dmitri Silaev
As the first mandatory step you need to do perspective correction, e.g. using paper sheet boundaries (is it a lottery ticket?) Then depending on how it goes further with Tesseract you may need either to: - Train for this particular font - Blur a bit to make characters more fleshy - Scale down

[tesseract-ocr] Re: Text output vs. PDF

2015-06-29 Thread Jeff Breidenbach
Unfortunately, I think there is nothing we can do. I've done everything I can to maximize compatibility with various PDF rendering engines, but Preview uses particularly terrible text extraction heuristics. To be fair, the root problem is the design and complexity of the PDF specification

[tesseract-ocr] Re: Tesseract 3.04 Build Error

2015-06-29 Thread Jeff Breidenbach
You need version 1.71 or later. Current leptonica release is 1.72. -- You received this message because you are subscribed to the Google Groups tesseract-ocr group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To

[tesseract-ocr] Re: jbig2 encoding in PDF output file

2015-06-29 Thread Jeff Breidenbach
Not available currently, and pretty major effort required to make it happen, both in Leptonica and Tesseract's PDF output module. No plans to work on this. For other formats we try hard to not re-encode during PDF generation whenever practical. -- You received this message because you are