Re: Is there a way to attach image when posting a discussion?

2011-02-24 Thread Ray Smith
Attachments and files are no longer supported. See http://groups-announcements.blogspot.com/2010/09/notice-about-pages-and-files.html?hl=en The recommendation is to upload to docs or sites and then share with

RE: Wrappers for tessearct3.01?

2011-02-24 Thread Cong Nguyen
Dear devTess, I does not plan to implement delegate event at engine-level, so you should manage your implementations at high-level. Example: string result = _ocrProcessor.Apply(bitmap); List detectedWords = _ocrProcessor.RetriveResultDetail(); // you can raise event/

Issues in using Chinese language in OCR

2011-02-24 Thread devTess
https://lh3.googleusercontent.com/_p-XgqnEJowA/TWZtI-MBneI/ABU/DEWkgFdLUBM/ProblemOCRZh.jpg There are 3 cases in this image. Case 1: a single character is recognized as two overlapping characters. This is why having the character bounding box is important to diagnose the problem case 2:

Is there a way to attach image when posting a discussion?

2011-02-24 Thread devTess
Hi anyone can recommend a way to post a screenshot to discuss problems of OCR? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com. To unsubscribe from this group, send email to t

Re: Wrappers for tessearct3.01?

2011-02-24 Thread devTess
Thanks Ray, After getting helps from different people (Dmitry Silaev and Steve Pohorsky (his implementation of tessnet for 3.0), yes, your diagnosis is correct. With a few examples now available, I can see where were my mistakes. Thanks to all. J. -- You received this message because you are s

Re: Wrappers for tessearct3.01?

2011-02-24 Thread devTess
HI Cong Nguyen, Exactly what I need. Would you be implementing delegate event for the monitor class similar to tessnet2. Finally, someone did it. Thanks J. On Feb 21, 10:44 am, Cong Nguyen wrote: > Dear devTess, > > I have just implemented a simple .net wrapper: > > http://code.google.com/p/tess

Re: Customising Tesseract for character recognition

2011-02-24 Thread Jose
Ok I'll try to do that this afternoon. thank you for the help regards, jose -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com. To unsubscribe from this group, send email to t

Re: Customising Tesseract for character recognition

2011-02-24 Thread Dmitry Silaev
The best way to explain everything would be just to send your source image examples, describe what information you want to get from them and provide the community with the code snippets you use to interface with Tess. And please be as detailed as possible. Warm regards, Dmitry Silaev On Thu,

Re: Customising Tesseract for character recognition

2011-02-24 Thread Jose
In my particular case is just a matter that the first word of each column is in one font and the other is in another so instead of reading column by column it reads all the columns of the first row and then all the columns of the second row! My god is really hard to explain in english. I get an acc

Re: Customising Tesseract for character recognition

2011-02-24 Thread Dmitry Silaev
Unfortunately not only text output order can suffer from Tess's segmentation, but also extents of some text fragments can be identified incorrectly (say one "segmented" row can span over two "real" rows, probably in partial way), and that in turn can lead to *completely* irrelevant recognition resu

Re: Customising Tesseract for character recognition

2011-02-24 Thread Jose
Dmitry the recognition works the only thing is the way it is parsing it... :S I think segmentation of the images would be too much painful! I only won't to change the other that is display or the bounding boxes so I could now the x and y of the word recognized and thereby can organise the results b

Re: Customising Tesseract for character recognition

2011-02-24 Thread Dmitry Silaev
I don't know if it's affordable for you, but imho decent results can only be achieved if you do segmentation yourself and then pass image fragments to Tesseract on a word-by-word basis. Problems may appear when you have words that are too short, however, as I can see, it's not your case. Long time

Re: Customising Tesseract for character recognition

2011-02-24 Thread Jose
Hi, (as you now Saurabh because we talked in private the other day) I tried the PSM_SINGLE_COLUMN and the accuracy drops dramatically... I can't afford to loose that accuracy. Is it possible to change the way the output is display? Looking a the code it seems rather hard to change it... perhaps I c