Accepted tesseract-afr 3.04.00-1 (source all) into unstable

2015-07-12 Thread Jeff Breidenbach
-By: Jeff Breidenbach j...@debian.org Description: tesseract-ocr-afr - tesseract-ocr language files for Afrikaans Changes: tesseract-afr (3.04.00-1) unstable; urgency=medium . * New upstream release Checksums-Sha1: 29f5b28ced0de23a42cec7acc4789060034c1310 1752 tesseract-afr_3.04.00-1.dsc

Accepted tesseract 3.04.00-1 (source amd64 all) into unstable

2015-07-12 Thread Jeff Breidenbach
Ratcliffe jeffrey.ratcli...@gmail.com Changed-By: Jeff Breidenbach j...@debian.org Description: libtesseract-dev - Development files for the tesseract command line OCR tool libtesseract3 - Tesseract OCR library tesseract-ocr - Tesseract command line OCR tool tesseract-ocr-dev - transitional dummy

Accepted tesseract-ces 3.04.00-1 (source all) into unstable

2015-07-12 Thread Jeff Breidenbach
-By: Jeff Breidenbach j...@debian.org Description: tesseract-ocr-ces - tesseract-ocr language files for Czech Changes: tesseract-ces (3.04.00-1) unstable; urgency=medium . * New upstream release Checksums-Sha1: c8cfb17371c139b72a3f6d38a3986ea75d79d632 1752 tesseract-ces_3.04.00-1.dsc

Accepted tesseract-cat 3.04.00-1 (source all) into unstable

2015-07-12 Thread Jeff Breidenbach
-By: Jeff Breidenbach j...@debian.org Description: tesseract-ocr-cat - tesseract-ocr language files for Catalan Changes: tesseract-cat (3.04.00-1) unstable; urgency=medium . * New upstream release Checksums-Sha1: 9a063296fd5ca16f0f92a64c73a99b2578ae9a4c 1752 tesseract-cat_3.04.00-1.dsc

Accepted tesseract-bel 3.04.00-1 (source all) into unstable

2015-07-12 Thread Jeff Breidenbach
-By: Jeff Breidenbach j...@debian.org Description: tesseract-ocr-bel - tesseract-ocr language files for Belarusian Changes: tesseract-bel (3.04.00-1) unstable; urgency=medium . * New upstream release Checksums-Sha1: a2d2c3b5158b239784b091aec2e0b60919c5d34b 1752 tesseract-bel_3.04.00-1.dsc

Accepted tesseract-ben 3.04.00-1 (source all) into unstable

2015-07-12 Thread Jeff Breidenbach
-By: Jeff Breidenbach j...@debian.org Description: tesseract-ocr-ben - tesseract-ocr language files for Bengali Changes: tesseract-ben (3.04.00-1) unstable; urgency=medium . * New upstream release Checksums-Sha1: 4987fddd6f8f4d76e033733035ae358ab3fe7b0a 1752 tesseract-ben_3.04.00-1.dsc

Accepted tesseract-aze 3.04.00-1 (source all) into unstable

2015-07-12 Thread Jeff Breidenbach
-By: Jeff Breidenbach j...@debian.org Description: tesseract-ocr-aze - tesseract-ocr language files for Azerbaijani Changes: tesseract-aze (3.04.00-1) unstable; urgency=medium . * New upstream release Checksums-Sha1: f27f27bdfa101590976d7f894fdbbef3822c72e6 1752 tesseract-aze_3.04.00-1.dsc

Bug#705502: tesseract: Individual language packages should depend on tesseract-ocr

2015-07-11 Thread Jeff Breidenbach
careful, these three languages can't have this or we get a circular dependency tesseract-ocr-eng tesseract-ocr-osd tesseract-ocr-equ

Bug#779477: tesseract-ocr: Lines are lost, when tesseract destination is pdf

2015-07-11 Thread Jeff Breidenbach
That's weird. Just to make my life a little easier, can you also attach non-PDF output, and specifically identify one of the lines that was eaten by PDF?

Bug#699609: tesseract-ocr: please provide source for language files

2015-07-10 Thread Jeff Breidenbach
Making some progress; they are on github in the 'langdata' repository. Much packaging work still required, particularly around tesstrain.sh

Re: [Gossip] archive not updated for board-disc...@documentfoundation.org mailinglist

2015-06-30 Thread Jeff Breidenbach
The spam filtering service we use (SpamHero) quarantined that message along with some others. I've released the messages from quarantine and also adjusted the whitelist to hopefully reduce or prevent this from happening in the future. I'm sorry about this and we would definitely consider

[tesseract-ocr] Re: Text output vs. PDF

2015-06-29 Thread Jeff Breidenbach
Unfortunately, I think there is nothing we can do. I've done everything I can to maximize compatibility with various PDF rendering engines, but Preview uses particularly terrible text extraction heuristics. To be fair, the root problem is the design and complexity of the PDF specification

[tesseract-ocr] Re: Tesseract 3.04 Build Error

2015-06-29 Thread Jeff Breidenbach
You need version 1.71 or later. Current leptonica release is 1.72. -- You received this message because you are subscribed to the Google Groups tesseract-ocr group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To

[tesseract-ocr] Re: jbig2 encoding in PDF output file

2015-06-29 Thread Jeff Breidenbach
Not available currently, and pretty major effort required to make it happen, both in Leptonica and Tesseract's PDF output module. No plans to work on this. For other formats we try hard to not re-encode during PDF generation whenever practical. -- You received this message because you are

Bug#785000: libwebp: FTBFS on mips: Error: opcode not supported on this processor: mips2...

2015-05-11 Thread Jeff Breidenbach
Sorry, I was under the impression upsteam had integrated the patch. NMU acceptable, or I can do it when I find time.

Bug#785000: libwebp: FTBFS on mips: Error: opcode not supported on this processor: mips2...

2015-05-11 Thread Jeff Breidenbach
Sorry, I was under the impression upsteam had integrated the patch. NMU acceptable, or I can do it when I find time.

Bug#783693: libwebp: no symbols; loose shlibs dependency

2015-04-29 Thread Jeff Breidenbach
Thank you for the investigation, and please NMU. I'm not literally underwater right now, but I'm also not that far off from it.

Bug#783693: libwebp: no symbols; loose shlibs dependency

2015-04-29 Thread Jeff Breidenbach
Thank you for the investigation, and please NMU. I'm not literally underwater right now, but I'm also not that far off from it.

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-18 Thread Jeff Breidenbach
Yes, you can safely leave out To, Message-id, and Received. Consequences are what you'd expect, like the inability to do a message-id search and find that particular message. You are correct. Posting address is manually assigned during the bulk import process, and automatically determined from

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-17 Thread Jeff Breidenbach
The only things indexed for search are: message-id, subject, date (usually extracted from the Recieved: header), sender name (extracted from From: header), posting address (for example, gossip@mail-archive.com), archival message number, and message body. Every message is sorted and organized

Re: [Gossip] Porting digested new list archives to mail-archive

2015-04-14 Thread Jeff Breidenbach
Statute of limitations is typically 3 kilomessages on a normal non-import list, but should (I think) be unlimited on bulk import. Conversion to unix newlines is required and is manual; doesn't matter who does it. Still prefer to do whole import at once especially if tricky; less labor, also less

Accepted leptonlib 1.72-1 (source amd64) into unstable

2015-04-09 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA256 Format: 1.8 Date: Thu, 09 Apr 2015 11:06:12 -0700 Source: leptonlib Binary: libleptonica-dev liblept4 leptonica-progs Architecture: source amd64 Version: 1.72-1 Distribution: unstable Urgency: medium Maintainer: Jeff Breidenbach j...@debian.org

Accepted libwebp 0.4.3-1 (source amd64) into unstable

2015-03-27 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA256 Format: 1.8 Date: Fri, 27 Mar 2015 09:43:41 -0700 Source: libwebp Binary: libwebp-dev libwebp5 libwebpmux1 libwebpdemux1 webp Architecture: source amd64 Version: 0.4.3-1 Distribution: unstable Urgency: low Maintainer: Jeff Breidenbach j

Bug#779243: numpy should use OpenBLAS, making it up to 150x faster

2015-02-25 Thread Jeff Breidenbach
Package: python-numpy Version: 1:1.8.2-2 (This problem report was written by Yaroslav Bulatov. I've confirmed it on Debian sid chroot. My computer gave a 30X improvement.) Default numpy install uses inferior BLAS, and is very slow. Matrix multiplication benchmark below gets me 1.26 G

[Python-modules-team] Bug#779243: numpy should use OpenBLAS, making it up to 150x faster

2015-02-25 Thread Jeff Breidenbach
Package: python-numpy Version: 1:1.8.2-2 (This problem report was written by Yaroslav Bulatov. I've confirmed it on Debian sid chroot. My computer gave a 30X improvement.) Default numpy install uses inferior BLAS, and is very slow. Matrix multiplication benchmark below gets me 1.26 G

[Python-modules-team] Fwd: numpy should use OpenBLAS, making it up to 150x faster

2015-02-25 Thread Jeff Breidenbach
Hopefully I submitted this bug report correctly. If not, here is a direct report. -- Forwarded message -- From: Jeff Breidenbach j...@jab.org Date: Wed, Feb 25, 2015 at 11:52 AM Subject: numpy should use OpenBLAS, making it up to 150x faster To: Debian Bug Tracking System sub

Re: [Mailman-Developers] Support for X-No-Archive

2015-02-12 Thread Jeff Breidenbach
I checked and these two lists are responsible. One uses 'No' and the other uses 'no' for every message. I can't speak to the intention. There are an impressive number of headers on each message including DKIM but I don't see a clue as to the list server software.

Re: [Mailman-Developers] X-Message-ID-Hash header (was Re: Python 3)

2015-01-04 Thread Jeff Breidenbach
I should probably report the experience of mail-archive.com, which has supported a homebrew message-id hash algorithm for over 6 years. Only one organization has ever used it (LibreOffice). They put it in the Archived-At headers, not message footers. However, links did end up getting manually

Re: mhonarc for Debian

2014-11-08 Thread Jeff Breidenbach
I'm the Debian package maintainer for mhonarc. This is fixed in Jessie and you might want to try installing the package from Jessie. (This usually doesn't work without backporting, but it might work for mhonarc.) https://packages.qa.debian.org/m/mhonarc.html

Bug#767462: libwebp: please raise the priority to optional

2014-11-02 Thread Jeff Breidenbach
Sounds good to me. It is amazing how quickly webp has established itself. As always, zero delay NMU for uncontroversial changes are fine with me. Otherwise I will do this when I have a chance.

Bug#764301: Please provide a backport for Wheezy

2014-10-12 Thread Jeff Breidenbach
I don't know how to do a backport. I guess I could build a wheezy package with pbuilder, but no idea what I'd do with it next. Are backports usually done by the package maintainer, or someone else?

Bug#762675: leptonlib: Build leptonlib against openjpeg 2.1

2014-09-24 Thread Jeff Breidenbach
This is terrific, NMU is acceptable without any delay. However, please confirm that the resulting Leptonica package actually has a dependency on libopenjpeg! I mention this because configure.ac in Leptonica upstream mentions libopenjp2 and I think that still needs to be libopenjpeg. So we may

[tesseract-ocr] Re: compile error under ubuntu 14.04

2014-09-09 Thread Jeff Breidenbach
This error comes from Leptonica 1.70. Tesseract now requires Leptonica 1.71. Leptonica 1.71 can be installed manually (but not so easily) and will ship with Ubuntu for their 14.10 release scheduled for October 23 of this year. -- You received this message because you are subscribed to the

Accepted leptonlib 1.71-2 (source amd64) into unstable

2014-09-08 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Fri, 05 Sep 2014 17:50:11 -0700 Source: leptonlib Binary: libleptonica-dev liblept4 leptonica-progs Architecture: source amd64 Version: 1.71-2 Distribution: unstable Urgency: low Maintainer: Jeff Breidenbach j...@debian.org Changed

add multithread indexing sample?

2014-08-16 Thread Jeff Breidenbach
Does it make sense to add a multithreaded indexing example to the samples directory? Mike McCandles has such fun graphs, and it would be fun to chase them in python. http://blog.mikemccandless.com/2011/05/265-indexing-speedup-with-lucenes.html

Bug#757785: libwebp FTBFS on arm64, internal compiler errors

2014-08-11 Thread Jeff Breidenbach
Upstream says: GCC 4.9 strikes again. CFLAGS='-O2 -frename-registers' should do [the trick]. I'm generally comfortable with NMU high urgency situations like this one. I prefer to keep the optimizations if possible, because they are substantial. * NEON assembly additions: - ~25% faster

Bug#757785: libwebp FTBFS on arm64, internal compiler errors

2014-08-11 Thread Jeff Breidenbach
I'm quite happy to have NMU, with no delay necessary. It especially makes sense since I'm not set up to test on ARM. Also, upstream has some ideas. I'll see if they are amenable to direct contact. PS. How the heck did webp become bootstrap critical?

[tesseract-ocr] Re: [tesseract-dev] Re: Training tools linking failure, icu_48::*

2014-08-01 Thread Jeff Breidenbach
Done. Bonus points if someone can remember to remove the instructions when they become obsolete in October. -- You received this message because you are subscribed to the Google Groups tesseract-ocr group. To unsubscribe from this group and stop receiving emails from it, send an email to

Accepted libwebp 0.4.1-1 (source amd64) into unstable

2014-07-30 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Wed, 30 Jul 2014 16:38:48 -0700 Source: libwebp Binary: libwebp-dev libwebp5 libwebpmux1 libwebpdemux1 webp Architecture: source amd64 Version: 0.4.1-1 Distribution: unstable Urgency: medium Maintainer: Jeff Breidenbach j

Accepted leptonlib 1.71-1 (source amd64)

2014-07-24 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Thu, 24 Jul 2014 13:00:21 -0700 Source: leptonlib Binary: libleptonica-dev liblept4 leptonica-progs Architecture: source amd64 Version: 1.71-1 Distribution: unstable Urgency: low Maintainer: Jeff Breidenbach j...@debian.org Changed

Re: [sane-devel] scanimage / tesseract interoperability

2014-07-15 Thread Jeff Breidenbach
Very much appreciated. -Jeff -- sane-devel mailing list: sane-devel@lists.alioth.debian.org http://lists.alioth.debian.org/cgi-bin/mailman/listinfo/sane-devel Unsubscribe: Send mail with subject unsubscribe your_password to sane-devel-requ...@lists.alioth.debian.org

Re: [sane-devel] scanimage / tesseract interoperability

2014-07-14 Thread Jeff Breidenbach
Revised so printing filenames to stdout is optional and defaults to off. The new option is --batch-print. Please consider applying along with happy-batch-1.1.gz names-to-stdout-1.4.diff.gz Description: GNU Zip compressed data -- sane-devel mailing list: sane-devel@lists.alioth.debian.org

Bug#736036: upgrading to serious: libtiff4-dev is being removed

2014-07-05 Thread Jeff Breidenbach
Thanks for escalating, I will attempt fix well before autoremoval deadline. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org

Bug#736036: upgrading to serious: libtiff4-dev is being removed

2014-07-05 Thread Jeff Breidenbach
Thanks for escalating, I will attempt fix well before autoremoval deadline. -- To UNSUBSCRIBE, email to debian-bugs-rc-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org

Re: [sane-devel] scanimage / tesseract interoperability

2014-06-17 Thread Jeff Breidenbach
Friendly reminder. -- sane-devel mailing list: sane-devel@lists.alioth.debian.org http://lists.alioth.debian.org/cgi-bin/mailman/listinfo/sane-devel Unsubscribe: Send mail with subject unsubscribe your_password to sane-devel-requ...@lists.alioth.debian.org

Bug#750632: (no subject)

2014-06-06 Thread Jeff Breidenbach
Thanks. NMU is acceptable, otherwise I'll get to it as soon as I get a chance. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org

Re: [sane-devel] scanimage / tesseract interoperability

2014-06-05 Thread Jeff Breidenbach
Is there anything more that I can do to help? -- sane-devel mailing list: sane-devel@lists.alioth.debian.org http://lists.alioth.debian.org/cgi-bin/mailman/listinfo/sane-devel Unsubscribe: Send mail with subject unsubscribe your_password to sane-devel-requ...@lists.alioth.debian.org

Bug#749953: pkg-config file missing

2014-05-30 Thread Jeff Breidenbach
Will add to next release (coming up over the next month or two) -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org

Re: [sane-devel] scanimage / tesseract interoperability

2014-05-30 Thread Jeff Breidenbach
Thank you for the review, Olaf. I've incorporated both of your suggestions. Jeff On Thu, May 29, 2014 at 4:49 PM, Olaf Meeuwissen olaf.meeuwis...@avasys.jp wrote: Jeff Breidenbach writes: Are these two patches on track for inclusion? What more can I do to help? names-to-stdout-1.2.diff.gz

Re: [sane-devel] scanimage / tesseract interoperability

2014-05-19 Thread Jeff Breidenbach
Implementation was a little intrusive because there is no recovery from calling freopen() on stdout. This preliminary patch follows the recommendations of the C FAQ and introduces an explicit stream variable. I've only done light testing. http://c-faq.com/stdio/undofreopen.html $ scanimage

Re: [sane-devel] scanimage / tesseract interoperability

2014-05-19 Thread Jeff Breidenbach
Implemented and tested. Please consider for inclusion. happy-batch-1.0.diff.gz Description: GNU Zip compressed data -- sane-devel mailing list: sane-devel@lists.alioth.debian.org http://lists.alioth.debian.org/cgi-bin/mailman/listinfo/sane-devel Unsubscribe: Send mail with subject unsubscribe

Re: [sane-devel] scanimage / tesseract interoperability

2014-05-19 Thread Jeff Breidenbach
Testing found an error path with a double fclose. Tiny tweak to make that impossible. - if (0 != fclose(ofp)) + if (!ofp || 0 != fclose(ofp)) names-to-stdout-1.2.diff.gz Description: GNU Zip compressed data -- sane-devel mailing list: sane-devel@lists.alioth.debian.org

Re: [sane-devel] scanimage / tesseract interoperability

2014-05-15 Thread Jeff Breidenbach
When I run scanimage on a Fujitsu S1500, the program is a little unhappy even after normal operation, note the return code. This is not great for pipelines. Should I attempt a fix? This is version 1.0.23-3ubuntu3 on the latest Ubuntu release. Sorry, I haven't yet figured out how to configure a

Re: [sane-devel] scanimage / tesseract interoperability

2014-05-12 Thread Jeff Breidenbach
Thank you, Allan. Tesseract will also accept image data directly on stdin, so single scan mode should work just fine. I think it is cleaner to use stdout as opposed to stderr for filenames. I will work on a patch. There is one possible alternative. Scanimage could emit image data to stdout using

Re: [sane-devel] scanimage / tesseract interoperability

2014-05-11 Thread Jeff Breidenbach
Jeffrey, gscan2pdf is terrific and I expect most users will prefer the friendly graphical user interface. I would love to compare notes with you on PDF generation nuances, and also coordinate with respect to future Tesseract releases. It's also good have the command line programs connect together

Re: [sane-devel] scanimage / tesseract interoperability

2014-05-10 Thread Jeff Breidenbach
Thank you Simon, the --batch-script feature looks very flexible. It almost does the trick: scanimage --batch --batch-script (echo) | tesseract - - Unfortunately, it runs just a little too early. The script executes just before the temporary image file is renamed. Tesseract wants to know about

Accepted mhonarc 2.6.19-1 (source all)

2014-05-06 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Mon, 05 May 2014 23:08:19 -0700 Source: mhonarc Binary: mhonarc Architecture: source all Version: 2.6.19-1 Distribution: unstable Urgency: low Maintainer: Jeff Breidenbach j...@debian.org Changed-By: Jeff Breidenbach j...@debian.org

[Gossip] experimental search interface, feedback requested

2014-04-27 Thread Jeff Breidenbach
We're experimenting with a new user interface for search. It works a little differently, what do people think? To try this on your own list, just replace search in the URL with searchdev. Cheers, Jeff === OLD http://www.mail-archive.com/search?q=squirrell=cayugabirds-l%40cornell.edu NEW

[Gossip] 2014 spring update

2014-04-13 Thread Jeff Breidenbach
Happy Spring everyone. Here are some updates for The Mail Archive. Search is about 10 times faster than before. This is due to a complete rewrite that shaves off a ton of initialization time. I'm really happy about this. As an experiment, we're changing the way we serve ads. Previously, direct

having trouble suppressing inline images

2014-03-30 Thread Jeff Breidenbach
I'm having trouble suppressing attachments. For example, I'd like all jpeg images to be discarded. My best efforts with MIMEFilters, m2h_null::filter, MIMEArgs, excludeexts are insufficient. I've placed everything needed to reproduce the problem here. Thoughts greatly appreciated.

Accepted tesseract 3.03.03-1 (source all amd64)

2014-03-28 Thread Jeff Breidenbach
Ratcliffe jeffrey.ratcli...@gmail.com Changed-By: Jeff Breidenbach j...@debian.org Description: libtesseract-dev - Development files for the tesseract command line OCR tool libtesseract3 - Command line OCR tool tesseract-ocr - Command line OCR tool tesseract-ocr-dev - transitional dummy package

Bug#742027: tesseract-ocr: tesseract doesn't start

2014-03-24 Thread Jeff Breidenbach
Package: tesseract-ocr Version: 3.03.02-3 I can't reproduce this. Please run ldd and md5sum on /usr/bin/tesseract and report the results. === $ curl http://ftp.us.debian.org/debian/pool/main/t/tesseract/tesseract-ocr_3.03.02-3_i386.deb foo.deb $ ar x foo.deb $ tar xJvf data.tar.xz $ ldd

Bug#742029: tesseract-ocr: Trainingtools missing in SID version (3.03.02-3)

2014-03-24 Thread Jeff Breidenbach
Package: tesseract-ocr Version: 3.03.02-3 Problem confirmed, working on fix.

Bug#742027: tesseract-ocr: tesseract doesn't start

2014-03-24 Thread Jeff Breidenbach
Package: tesseract-ocr Version: 3.03.02-3 I can't reproduce this. Please run ldd and md5sum on /usr/bin/tesseract and report the results. === $ curl http://ftp.us.debian.org/debian/pool/main/t/tesseract/tesseract-ocr_3.03.02-3_i386.deb foo.deb $ ar x foo.deb $ tar xJvf data.tar.xz $ ldd

Bug#742027: tesseract-ocr: tesseract doesn't start

2014-03-19 Thread Jeff Breidenbach
Package: tesseract-ocr Version: 3.03.02-3 This is unexpected. The build dependency is on libleptonica-dev (= 1.70~) which is leptonlib4. I don't see how or where a leptonlib3 could be sneaking in.

Bug#742027: tesseract-ocr: tesseract doesn't start

2014-03-19 Thread Jeff Breidenbach
Package: tesseract-ocr Version: 3.03.02-3 This is unexpected. The build dependency is on libleptonica-dev (= 1.70~) which is leptonlib4. I don't see how or where a leptonlib3 could be sneaking in.

Accepted tesseract 3.03.02-3 (source all amd64)

2014-02-07 Thread Jeff Breidenbach
Ratcliffe jeffrey.ratcli...@gmail.com Changed-By: Jeff Breidenbach j...@debian.org Description: libtesseract-dev - Development files for the tesseract command line OCR tool libtesseract3 - Command line OCR tool tesseract-ocr - Command line OCR tool tesseract-ocr-dev - transitional dummy package

Re: hocr2pdf and arabic language

2014-02-06 Thread Jeff Breidenbach
I've merged Nick White's bugfix into hocr-tools. Thank you, Nick. I expect most people will instead use the native PDF support built into Tesseract henceforth, and I intend to focus most of my time and energy there. However, there is still some use for hocr-pdf, especially when working with

Re: hocr2pdf and arabic language

2014-02-06 Thread Jeff Breidenbach
As for Arabic and other right-to-left scripts, please try using the new native PDF capability in Tesseract instead. It is significantly more sophisticated and I think it should work correctly. -- -- You received this message because you are subscribed to the Google Groups tesseract-ocr group.

Re: hocr2pdf and arabic language

2014-02-06 Thread Jeff Breidenbach
I don't know, it is up to Ray. My guess is quite soon. In any case, I just ran on your example images, noticed a small problem, and fixed it. Thank you for providing them. I should also mention that there is no need to convert your binary images to JPEG when using Tesseract's native PDF

Accepted leptonlib 1.70.1-1 (source amd64)

2014-02-05 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Wed, 05 Feb 2014 17:16:29 -0800 Source: leptonlib Binary: libleptonica-dev liblept4 leptonica-progs Architecture: source amd64 Version: 1.70.1-1 Distribution: unstable Urgency: low Maintainer: Jeff Breidenbach j...@debian.org Changed

Accepted tesseract 3.03.02-2 (source all amd64)

2014-02-05 Thread Jeff Breidenbach
Ratcliffe jeffrey.ratcli...@gmail.com Changed-By: Jeff Breidenbach j...@debian.org Description: libtesseract-dev - Development files for the tesseract command line OCR tool libtesseract3 - Command line OCR tool tesseract-ocr - Command line OCR tool tesseract-ocr-dev - transitional dummy package

Accepted tesseract 3.03.02-1 (source all amd64)

2014-02-04 Thread Jeff Breidenbach
Ratcliffe jeffrey.ratcli...@gmail.com Changed-By: Jeff Breidenbach j...@debian.org Description: libtesseract-dev - Development files for the tesseract command line OCR tool libtesseract3 - Command line OCR tool tesseract-ocr - Command line OCR tool tesseract-ocr-dev - transitional dummy package

Bug#737481: tesseract: undefined symbol: _Z16tprintf_internalPKcz

2014-02-04 Thread Jeff Breidenbach
Thank you for the problem report. I will adjust the dependency.

Bug#737481: tesseract: undefined symbol: _Z16tprintf_internalPKcz

2014-02-04 Thread Jeff Breidenbach
Thank you for the problem report. I will adjust the dependency.

Accepted leptonlib 1.70-2 (source amd64)

2014-02-03 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Mon, 03 Feb 2014 10:17:20 -0800 Source: leptonlib Binary: libleptonica-dev liblept4 leptonica-progs Architecture: source amd64 Version: 1.70-2 Distribution: unstable Urgency: low Maintainer: Jeff Breidenbach j...@debian.org Changed

Accepted tesseract 3.03.01-1 (source all amd64)

2014-02-03 Thread Jeff Breidenbach
Ratcliffe jeffrey.ratcli...@gmail.com Changed-By: Jeff Breidenbach j...@debian.org Description: libtesseract-dev - Development files for the tesseract command line OCR tool libtesseract3 - Command line OCR tool tesseract-ocr - Command line OCR tool tesseract-ocr-dev - transitional dummy package

Re: hocr2pdf and arabic language

2014-01-27 Thread Jeff Breidenbach
I am the author of the hocr2pdf utility. Thank you for the patch, I'll merge it some time next week. This week my focus is fixing some problem reports with the new native PDF output capability for Tesseract. Jeff -- -- You received this message because you are subscribed to the Google Groups

Accepted leptonlib 1.70-1 (source amd64)

2014-01-23 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Thu, 23 Jan 2014 13:29:23 -0800 Source: leptonlib Binary: libleptonica-dev liblept4 leptonica-progs Architecture: source amd64 Version: 1.70-1 Distribution: experimental Urgency: low Maintainer: Jeff Breidenbach j...@debian.org

Accepted leptonlib 1.69-5 (source amd64)

2014-01-23 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Thu, 23 Jan 2014 14:47:30 -0800 Source: leptonlib Binary: libleptonica-dev liblept3 leptonica-progs Architecture: source amd64 Version: 1.69-5 Distribution: unstable Urgency: low Maintainer: Jeff Breidenbach j...@debian.org Changed

Bug#735510: transition: tesseract

2014-01-23 Thread Jeff Breidenbach
Upstream has looked at compatibility and decided to not bump soname for this package. I'll give things a week to settle down and then this bug is ready for closure.

Bug#735510: transition: tesseract

2014-01-23 Thread Jeff Breidenbach
Upstream has looked at compatibility and decided to not bump soname for this package. I'll give things a week to settle down and then this bug is ready for closure.

Accepted perceptualdiff 1.1.1-2 (source amd64)

2014-01-21 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Tue, 21 Jan 2014 09:00:14 -0800 Source: perceptualdiff Binary: perceptualdiff Architecture: source amd64 Version: 1.1.1-2 Distribution: unstable Urgency: low Maintainer: Jeff Breidenbach j...@debian.org Changed-By: Jeff Breidenbach j

Bug#673934: perceptualdiff: Consider switching to OpenMP version

2014-01-21 Thread Jeff Breidenbach
I would be happy to cede this package to someone with the energy to make the switch.

Bug#731168: transition: libwebp

2014-01-20 Thread Jeff Breidenbach
Thank you, Julien. I'll talk to upstream and get a few more details. They definitely bumped the soname for their release, and probably for a good reason. The libwebp upstream folks generally have their act together.

Bug#731168: transition: libwebp

2014-01-20 Thread Jeff Breidenbach
Thank you, Julien. I'll talk to upstream and get a few more details. They definitely bumped the soname for their release, and probably for a good reason. The libwebp upstream folks generally have their act together.

Accepted tesseract 3.03.00-1 (source all amd64)

2014-01-17 Thread Jeff Breidenbach
Ratcliffe jeffrey.ratcli...@gmail.com Changed-By: Jeff Breidenbach j...@debian.org Description: libtesseract-dev - Development files for the tesseract command line OCR tool libtesseract3 - Command line OCR tool tesseract-ocr - Command line OCR tool tesseract-ocr-dev - transitional dummy package

Bug#735509: transition: leptonlib

2014-01-15 Thread Jeff Breidenbach
Package: release.debian.org Severity: normal User: release.debian@packages.debian.org Usertags: transition Leptonica upstream is releasing a new version that will have an increased soname (liblept3 - liblept4). No exotic challenges expected. -- System Information: Debian Release: wheezy/sid

Bug#735510: transition: tesseract

2014-01-15 Thread Jeff Breidenbach
Package: release.debian.org Severity: normal User: release.debian@packages.debian.org Usertags: transition Tesseract upstream is releasing a new version that will have an increased soname (libtesseact3 - libtesseract4). No exotic challenges expected. -- System Information: Debian Release:

Bug#735509: transition: leptonlib

2014-01-15 Thread Jeff Breidenbach
Package: release.debian.org Severity: normal User: release.debian@packages.debian.org Usertags: transition Leptonica upstream is releasing a new version that will have an increased soname (liblept3 - liblept4). No exotic challenges expected. -- System Information: Debian Release: wheezy/sid

Bug#735510: transition: tesseract

2014-01-15 Thread Jeff Breidenbach
Package: release.debian.org Severity: normal User: release.debian@packages.debian.org Usertags: transition Tesseract upstream is releasing a new version that will have an increased soname (libtesseact3 - libtesseract4). No exotic challenges expected. -- System Information: Debian Release:

Accepted leptonlib 1.69+1.70rc2-1 (source amd64)

2014-01-14 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Tue, 14 Jan 2014 12:22:14 -0800 Source: leptonlib Binary: libleptonica-dev liblept4 leptonica-progs Architecture: source amd64 Version: 1.69+1.70rc2-1 Distribution: experimental Urgency: low Maintainer: Jeff Breidenbach j

Accepted leptonlib 1.69+1.70rc1-2 (source amd64)

2014-01-13 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Mon, 13 Jan 2014 09:06:19 -0800 Source: leptonlib Binary: libleptonica-dev liblept4 leptonica-progs Architecture: source amd64 Version: 1.69+1.70rc1-2 Distribution: experimental Urgency: low Maintainer: Jeff Breidenbach j

Accepted libwebp 0.4.0-4 (source amd64)

2014-01-13 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Mon, 13 Jan 2014 10:07:43 -0800 Source: libwebp Binary: libwebp-dev libwebp5 libwebpmux1 libwebpdemux1 webp Architecture: source amd64 Version: 0.4.0-4 Distribution: unstable Urgency: low Maintainer: Jeff Breidenbach j...@debian.org

Bug#735103: update of debian/copyright

2014-01-13 Thread Jeff Breidenbach
Thanks. Fixing debian/copyright today. Upstream is cleaning up the oddball headers in three files and that will be part of the version 1.70 final, to be released next week.

Accepted leptonlib 1.69+1.70rc1-1 (source amd64)

2014-01-12 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Mon, 06 Jan 2014 10:03:38 -0800 Source: leptonlib Binary: libleptonica-dev liblept4 leptonica-progs Architecture: source amd64 Version: 1.69+1.70rc1-1 Distribution: experimental Urgency: low Maintainer: Jeff Breidenbach j

Accepted libwebp 0.4.0-3 (source amd64)

2014-01-09 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Wed, 08 Jan 2014 10:15:12 -0800 Source: libwebp Binary: libwebp-dev libwebp5 libwebpmux1 libwebpdemux1 webp Architecture: source amd64 Version: 0.4.0-3 Distribution: experimental Urgency: low Maintainer: Jeff Breidenbach j

Re: [Gossip] Temporary archive dysfunction

2014-01-05 Thread Jeff Breidenbach
After investigation, this turned out to be an issue with an X-No-Archive: Yes header on the list itself. Cheers, Jeff ___ Gossip mailing list https://www.mail-archive.com/gossip@mail-archive.com http://mail-archive.com/cgi-bin/mailman/options/gossip

Accepted libwebp 0.4.0-2 (source amd64)

2014-01-04 Thread Jeff Breidenbach
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Format: 1.8 Date: Fri, 03 Jan 2014 13:28:23 -0800 Source: libwebp Binary: libwebp-dev libwebp5 libwebpmux1 webp Architecture: source amd64 Version: 0.4.0-2 Distribution: experimental Urgency: low Maintainer: Jeff Breidenbach j...@debian.org Changed

Bug#697966: leptonica-progs: 'man leptonica' oversells command line help.

2014-01-03 Thread Jeff Breidenbach
Thank you for this report. I talked to upstream. First, the grep you used is incorrect; sometimes the help info is in the library but in main(). Second, we'll tone down the level of confidence in the manpage and increase the ratio of help in the next release, which is coming soon.

<    1   2   3   4   5   6   7   8   9   10   >