>I see on the linked Debian bug that sikuli has reported worse performance with >this new series. >Is that not a concern or is their usecase no longer as well supported?
Tesseract upstream is in communication with Sikuli upstream. a 10% drop in recognition performance is considered acceptable by Sikuli upstream. Additionally, future releases of Sikuli may remove that penalty now that the two upstreams are in communication. Here is the relevant quote from Sikuli upstream Tsung-Hsiang (Sean) Chang. "The main reason we aren't not switching to tesseract 3 in an official release is that its recognition performance is worse than 2.04 in our dataset. (Not very bad, about 10% worse as I recall.) So I think it's fine to wrap the tesseract 3 branch for Debian sid." >Could you please give an explicit list of all packages to be synced? Appended. >I must admit to being a bit concerned about the way that ocropus was broken >without apparently warning its maintainer too, especially given that there is >no replacement available yet. This is a reasonable concern. I assume you are referring to http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=659597 >From an etiquette perspective, I have been in email communication (8 threads in the last 16 days) with Jeffrey Ratcliffe, who is the first listed maintainer for both Tesseract and Ocropus. I have been in bug tracking communication (7 bugs) with Jakub Wilk over the same period. Jakub has been incredibly helpful by filing those packaging bugs. That said, my strategy was to - with blessing from my co-maintainer - bring Tesseract 3 into Debian unstable, then find and fix problems as quickly as possible. I apologize for causing surprise. However, that leaves the issue of Ocropus. If Ubuntu 12.04 accepts Tesseract 3, it will lose Ocropus. I respect Ubuntu's decision whichever way it goes. Please consider the number of users affected on either side, and also Ocropus upstream Tom Breuel's comments. "the version of OCRopus that has been packaged is completely outdated. OCRopus is now a set of Python libraries with a little bit of C++ in each. The complete final package structure isn't settled yet, but I want different components to be fairly independent of each other. Now, during my sabbatical, I've finally had time to actually work on it more than just a little on the side. The best thing for Debian probably would be to discontinue the current packaging for OCRopus and start over again when the new release is out." Thank you for your consideration. ========= Full list of source packages to remove: ocropus tesseract-ocr-deu-f Full list of non-source packages to remove (maybe this goes away automatically): tesseract-ocr-dev Full list of source packages to sync (note lack of tesseract-lat-lid): sikuli tesseract tesseract-afr tesseract-ara tesseract-aze tesseract-bel tesseract-ben tesseract-bul tesseract-cat tesseract-ces tesseract-chi-sim tesseract-chi-tra tesseract-chr tesseract-dan tesseract-deu tesseract-deu-frak tesseract-ell tesseract-eng tesseract-enm tesseract-epo tesseract-equ tesseract-est tesseract-eus tesseract-fin tesseract-fra tesseract-frk tesseract-frm tesseract-glg tesseract-heb tesseract-hin tesseract-hrv tesseract-hun tesseract-ind tesseract-isl tesseract-ita tesseract-ita-old tesseract-jpn tesseract-kan tesseract-kor tesseract-lav tesseract-lit tesseract-mal tesseract-mkd tesseract-mlt tesseract-msa tesseract-nld tesseract-nor tesseract-osd tesseract-pol tesseract-por tesseract-ron tesseract-rus tesseract-slk tesseract-slk-frak tesseract-slv tesseract-spa tesseract-spa-old tesseract-sqi tesseract-srp tesseract-swa tesseract-swe tesseract-tam tesseract-tel tesseract-tgl tesseract-tha tesseract-tur tesseract-ukr tesseract-vie ** Bug watch added: Debian Bug tracker #659597 http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=659597 -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/933162 Title: Sync tesseract 3.02.01-1 (universe) from Debian sid (main) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/tesseract/+bug/933162/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs