Hello all. I'm a computational linguistics graduate student, and I'd like to do some work on Tesseract for credit in a software engineering course. My area of interest is word order modelling and I believe this can and has been used to improve the accuracy of other OCR systems. As far as I can tell, Tesseract has nothing similar currently, so I'm interested in adding it. Any feedback on that idea would be appreciated.
I'm also completely new to open source, so provided that my general goal appeals, I would appreciate any and all advice on how best to get involved with your community, get myself up to speed technically (I am reading the "Hacking Tesseract" manual currently), avoid stepping on toes, and so on. Thanks. --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com To unsubscribe from this group, send email to tesseract-ocr+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en -~----------~----~----~----~------~----~------~--~---