[sane-devel] [ANN] Unpaper - post-processing scanned book-pages
Jens Gulden m...@jensgulden.de wrote: Hi Jens, unpaper removes black edges and other photocopy artefacts from scanned images. It also deskews book pages (auto-rotates them to a straight alignment), and centers them on the sheet. Old photocopies can become well-readable PDFs again. Unpaper seems to have endianness issues; running it on my powerpc gives me a white page as the output, no problem on an i386 machine though. I'll see if I can come up with something. I'm interested in adding jpeg support to unpaper, although convert does the trick for now. JB. -- Julien BLACHE http://www.jblache.org j...@jblache.org GPG KeyID 0xF5D65169
[sane-devel] [ANN] Unpaper - post-processing scanned book-pages
On Thu, Mar 10, 2005 at 07:58:39PM +0100, Julien BLACHE wrote: Unpaper seems to have endianness issues; running it on my powerpc gives me a white page as the output, no problem on an i386 machine though. I'll see if I can come up with something. I get white pages as output on my i686 machine unless I use --no-border-scan. I haven't analyzed why - and I'm not sure whether running unpaper with all default arguments is even expected to work. -- Peter Fales Peter at fales-lorenz.net
[sane-devel] [ANN] Unpaper - post-processing scanned book-pages
Hello all, Gerard Klaver wrote: My proposal/idea would be te place it in a sane-tools package. This would be a new package with for the time being only the unpaper program. (...) I had the idea that Jens (the developer of the unpaper program) is also the maintainer for unpaper in SANE CVS. So SANE CVS contains the new version. Before starting the project-site on Berlios, I actually thought of joining the SANE-project somehow. But looking at the contents of the current sane-package, I found it to be off-topic to place an application like unpaper there. Actually, Gerard's suggestion is exactly what I originally looked for, but didn't find. Oliver Rauch wrote: It only makes sense to do this when these packages have no other homepages. I would like to close the BerliOS-project again, because I feel such a single tool is too fine-grained for one whole OS-project (and even more for having its own debian-package, as I think would be implied by Johannes Berg's idea of sane-tools as a meta-package). As I already wrote in the announcement: Almost too small for being an open-source project on its own. So, I would like to follow Gerard's suggestion. Maintaining unpaper in the CVS-tree will be ok for me, as it is only a single .c-source file and a man page. Jens (I won't be in town until thursday, so please don't expect quick response in the next days.)
[sane-devel] [ANN] Unpaper - post-processing scanned book-pages
On Wed, 2005-03-02 at 16:03 +0100, Jens Gulden wrote: Hello, unpaper removes black edges and other photocopy artefacts from scanned images. It also deskews book pages (auto-rotates them to a straight alignment), and centers them on the sheet. Old photocopies can become well-readable PDFs again. Available at http://unpaper.berlios.de/. Hope it's useful. Enjoy, Jens From the Readme: -- unpaper is a post-processing tool for scanned sheets of paper, especially for book pages that have been scanned from previously created photocopies. The main purpose is to make scanned book pages better readable on screen after conversion to PDF. Additionally, unpaper might be useful to enhance the quality of scanned pages before performing optical character recognition (OCR). unpaper tries to clean scanned images by removing dark edges that appeared through scanning or copying on areas outside the actual page content (e.g. dark areas between the left-hand-side and the right-hand-side of a double-sided book-page scan). The program also tries to detect disaligned centering and rotation of pages and will automatically straighten each page by rotating it to the correct angle. This is called deskewing. Note that the automatic processing will sometimes fail. It is always a good idea to manually control the results of unpaper and adjust the parameter settings according to the requirements of the input. Each processing step can also be disabled individually for each sheet. Input and output files can be in either .pbm or .pgm format, as also used by the Linux scanning tools scanimage and scanadf. Conversion to PDF can e.g. be achieved with the Linux tools pgm2tiff, tiffcp and tiff2pdf. -- (It's a small program with 1 single source file only. Almost too small for being an open-source project on its own. If you have ideas to integrate it into other Linux scanning/graphics projects instead, please let me know.) Hello Jens, My proposal/idea would be te place it in a sane-tools package. This would be a new package with for the time being only the unpaper program. Also if there are other tools related to scanners or webcams the have a place to be kept. For example the lprof package see: gkall.hobby.nl/sane-lprof/sane-lprof.html And maybe a place to keep the icc profiles from scanners? Other packages? In this way also the package can maybe be used as a plug-in for some scanner frontend programs? This is the first email about this proposal on the SANE mailinglist so if anybody has comments/ideas?. -- m.vr.gr. Gerard Klaver
[sane-devel] [ANN] Unpaper - post-processing scanned book-pages
Am Sam, 2005-03-05 um 15.50 schrieb gerard klaver: On Wed, 2005-03-02 at 16:03 +0100, Jens Gulden wrote: Hello, =20 unpaper removes black edges and other photocopy artefacts from scanne= d=20 images. It also deskews book pages (auto-rotates them to a straight=20 alignment), and centers them on the sheet. Old photocopies can become well-readable PDFs again. =20 Hello Jens, =20 My proposal/idea would be te place it in a sane-tools package. Hello. It only makes sense to do this when these packages have no other homepages. I think it is better when a package has it=C2=B4s own home and= we set a link to such tools, then we can be sure that we don=C2=B4t handle o= ld versions. Oliver
[sane-devel] [ANN] Unpaper - post-processing scanned book-pages
On Sat, 2005-03-05 at 16:18 +0100, Oliver Rauch wrote: Am Sam, 2005-03-05 um 15.50 schrieb gerard klaver: On Wed, 2005-03-02 at 16:03 +0100, Jens Gulden wrote: Hello, =20 unpaper removes black edges and other photocopy artefacts from scan= ned=20 images. It also deskews book pages (auto-rotates them to a straight= =20 alignment), and centers them on the sheet. Old photocopies can become well-readable PDFs again. =20 Hello Jens, =20 My proposal/idea would be te place it in a sane-tools package. =20 Hello. =20 It only makes sense to do this when these packages have no other homepages. I think it is better when a package has it=B4s own home and = we set a link to such tools, then we can be sure that we don=B4t handle ol= d versions. =20 Oliver =20 =20 I had the idea that Jens (the developer of the unpaper program) is also the maintainer for unpaper in SANE CVS. So SANE CVS contains the new version. --=20 m.vr.gr. Gerard Klaver
[sane-devel] [ANN] Unpaper - post-processing scanned book-pages
On Sa, 2005-03-05 at 16:31 +0100, gerard klaver wrote: I had the idea that Jens (the developer of the unpaper program) is also the maintainer for unpaper in SANE CVS. So SANE CVS contains the new version. Incorporating software like that is almost always a bad idea -- it ties release management and other things too closely together, even if the sources are maintained in a single place. I agree with Oliver in that it'd be nice to have a link to this, and debian (and other systems) could adopt a sane-tools meta-package that would depend on the extra tools. johannes
[sane-devel] [ANN] Unpaper - post-processing scanned book-pages
Hello, unpaper removes black edges and other photocopy artefacts from scanned images. It also deskews book pages (auto-rotates them to a straight alignment), and centers them on the sheet. Old photocopies can become well-readable PDFs again. Available at http://unpaper.berlios.de/. Hope it's useful. Enjoy, Jens From the Readme: -- unpaper is a post-processing tool for scanned sheets of paper, especially for book pages that have been scanned from previously created photocopies. The main purpose is to make scanned book pages better readable on screen after conversion to PDF. Additionally, unpaper might be useful to enhance the quality of scanned pages before performing optical character recognition (OCR). unpaper tries to clean scanned images by removing dark edges that appeared through scanning or copying on areas outside the actual page content (e.g. dark areas between the left-hand-side and the right-hand-side of a double-sided book-page scan). The program also tries to detect disaligned centering and rotation of pages and will automatically straighten each page by rotating it to the correct angle. This is called deskewing. Note that the automatic processing will sometimes fail. It is always a good idea to manually control the results of unpaper and adjust the parameter settings according to the requirements of the input. Each processing step can also be disabled individually for each sheet. Input and output files can be in either .pbm or .pgm format, as also used by the Linux scanning tools scanimage and scanadf. Conversion to PDF can e.g. be achieved with the Linux tools pgm2tiff, tiffcp and tiff2pdf. -- (It's a small program with 1 single source file only. Almost too small for being an open-source project on its own. If you have ideas to integrate it into other Linux scanning/graphics projects instead, please let me know.)