[sane-devel] [ANN] Unpaper - post-processing scanned book-pages

2005-03-10 Thread Julien BLACHE
Jens Gulden m...@jensgulden.de wrote:

Hi Jens,

 unpaper removes black edges and other photocopy artefacts from scanned
 images. It also deskews book pages (auto-rotates them to a straight
 alignment), and centers them on the sheet.
 Old photocopies can become well-readable PDFs again.

Unpaper seems to have endianness issues; running it on my powerpc
gives me a white page as the output, no problem on an i386 machine
though. I'll see if I can come up with something.

I'm interested in adding jpeg support to unpaper, although convert
does the trick for now.

JB.

-- 
Julien BLACHE   http://www.jblache.org 
j...@jblache.org  GPG KeyID 0xF5D65169



[sane-devel] [ANN] Unpaper - post-processing scanned book-pages

2005-03-10 Thread Peter Fales
On Thu, Mar 10, 2005 at 07:58:39PM +0100, Julien BLACHE wrote:
 Unpaper seems to have endianness issues; running it on my powerpc
 gives me a white page as the output, no problem on an i386 machine
 though. I'll see if I can come up with something.

I get white pages as output on my i686 machine unless I use 
--no-border-scan.   I haven't analyzed why - and I'm not sure whether
running unpaper with all default arguments is even expected to work.

--
Peter Fales
Peter at fales-lorenz.net




[sane-devel] [ANN] Unpaper - post-processing scanned book-pages

2005-03-06 Thread Jens Gulden
Hello all,

Gerard Klaver wrote:
 My proposal/idea would be te place it in a sane-tools package.
 This would be a new package with for the time being only the unpaper
 program. (...)

 I had the idea that Jens (the developer of the unpaper program) is also
 the maintainer for unpaper in SANE CVS. So SANE CVS contains the new
 version.

Before starting the project-site on Berlios, I actually thought of 
joining the SANE-project somehow. But looking at the contents of the 
current sane-package, I found it to be off-topic to place an application 
like unpaper there. Actually, Gerard's suggestion is exactly what I 
originally looked for, but didn't find.

Oliver Rauch wrote:
 It only makes sense to do this when these packages have no other
 homepages.

I would like to close the BerliOS-project again, because I feel such a 
single tool is too fine-grained for one whole OS-project (and even more 
for having its own debian-package, as I think would be implied by 
Johannes Berg's idea of sane-tools as a meta-package). As I already 
wrote in the announcement:  Almost too small for being an open-source 
project on its own.

So, I would like to follow Gerard's suggestion.
Maintaining unpaper in the CVS-tree will be ok for me, as it is only a 
single .c-source file and a man page.

Jens

(I won't be in town until thursday, so please don't expect quick 
response in the next days.)



[sane-devel] [ANN] Unpaper - post-processing scanned book-pages

2005-03-05 Thread gerard klaver
On Wed, 2005-03-02 at 16:03 +0100, Jens Gulden wrote:
 Hello,
 
 unpaper removes black edges and other photocopy artefacts from scanned 
 images. It also deskews book pages (auto-rotates them to a straight 
 alignment), and centers them on the sheet.
 Old photocopies can become well-readable PDFs again.
 
 Available at http://unpaper.berlios.de/.
 
 Hope it's useful. Enjoy,
 Jens
 
  From the Readme:
 --
 unpaper is a post-processing tool for scanned sheets of paper, 
 especially for book pages that have been scanned from previously created 
 photocopies.
 The main purpose is to make scanned book pages better readable on screen
 after conversion to PDF. Additionally, unpaper might be useful to 
 enhance the quality of scanned pages before performing optical character 
 recognition (OCR).
 
 unpaper tries to clean scanned images by removing dark edges that 
 appeared through scanning or copying on areas outside the actual page 
 content (e.g. dark areas between the left-hand-side and the 
 right-hand-side of a double-sided book-page scan).
 The program also tries to detect disaligned centering and rotation of 
 pages and will automatically straighten each page by rotating it to the 
 correct angle. This is called deskewing.
 Note that the automatic processing will sometimes fail. It is always a 
 good idea to manually control the results of unpaper and adjust the 
 parameter settings according to the requirements of the input. Each 
 processing step can also be disabled individually for each sheet.
 
 Input and output files can be in either .pbm or .pgm format, as also 
 used by the Linux scanning tools scanimage and scanadf.
 Conversion to PDF can e.g. be achieved with the Linux tools pgm2tiff, 
 tiffcp and tiff2pdf.
 --
 
 (It's a small program with 1 single source file only. Almost too small 
 for being an open-source project on its own. If you have ideas to 
 integrate it into other Linux scanning/graphics projects instead, please 
 let me know.)
 
Hello Jens,

My proposal/idea would be te place it in a sane-tools package.

This would be a new package with for the time being only the unpaper
program.

Also if there are other tools related to scanners or webcams the have
a place to be kept.

For example the lprof package see:
gkall.hobby.nl/sane-lprof/sane-lprof.html 
 
And maybe a place to keep the icc profiles from scanners?

Other packages?

In this way also the package can maybe be used as a plug-in for some
scanner frontend programs?

This is the first email about this proposal on the SANE mailinglist
so if anybody has comments/ideas?.


-- 

m.vr.gr.
Gerard Klaver




[sane-devel] [ANN] Unpaper - post-processing scanned book-pages

2005-03-05 Thread Oliver Rauch
Am Sam, 2005-03-05 um 15.50 schrieb gerard klaver:
 On Wed, 2005-03-02 at 16:03 +0100, Jens Gulden wrote:
  Hello,
 =20
  unpaper removes black edges and other photocopy artefacts from scanne=
d=20
  images. It also deskews book pages (auto-rotates them to a straight=20
  alignment), and centers them on the sheet.
  Old photocopies can become well-readable PDFs again.
 =20
 Hello Jens,
=20
 My proposal/idea would be te place it in a sane-tools package.

Hello.

It only makes sense to do this when these packages have no other
homepages. I think it is better when a package has it=C2=B4s own home and=
 we
set a link to such tools, then we can be sure that we don=C2=B4t handle o=
ld
versions.

Oliver




[sane-devel] [ANN] Unpaper - post-processing scanned book-pages

2005-03-05 Thread gerard klaver
On Sat, 2005-03-05 at 16:18 +0100, Oliver Rauch wrote:
 Am Sam, 2005-03-05 um 15.50 schrieb gerard klaver:
  On Wed, 2005-03-02 at 16:03 +0100, Jens Gulden wrote:
   Hello,
  =20
   unpaper removes black edges and other photocopy artefacts from scan=
ned=20
   images. It also deskews book pages (auto-rotates them to a straight=
=20
   alignment), and centers them on the sheet.
   Old photocopies can become well-readable PDFs again.
  =20
  Hello Jens,
 =20
  My proposal/idea would be te place it in a sane-tools package.
=20
 Hello.
=20
 It only makes sense to do this when these packages have no other
 homepages. I think it is better when a package has it=B4s own home and =
we
 set a link to such tools, then we can be sure that we don=B4t handle ol=
d
 versions.
=20
 Oliver
=20
=20

I had the idea that Jens (the developer of the unpaper program) is also
the maintainer for unpaper in SANE CVS. So SANE CVS contains the new
version.

--=20

m.vr.gr.
Gerard Klaver




[sane-devel] [ANN] Unpaper - post-processing scanned book-pages

2005-03-05 Thread Johannes Berg
On Sa, 2005-03-05 at 16:31 +0100, gerard klaver wrote:
 I had the idea that Jens (the developer of the unpaper program) is also
 the maintainer for unpaper in SANE CVS. So SANE CVS contains the new
 version.

Incorporating software like that is almost always a bad idea -- it ties
release management and other things too closely together, even if the
sources are maintained in a single place.
I agree with Oliver in that it'd be nice to have a link to this, and
debian (and other systems) could adopt a sane-tools meta-package that
would depend on the extra tools.

johannes




[sane-devel] [ANN] Unpaper - post-processing scanned book-pages

2005-03-02 Thread Jens Gulden
Hello,

unpaper removes black edges and other photocopy artefacts from scanned 
images. It also deskews book pages (auto-rotates them to a straight 
alignment), and centers them on the sheet.
Old photocopies can become well-readable PDFs again.

Available at http://unpaper.berlios.de/.

Hope it's useful. Enjoy,
Jens

 From the Readme:
--
unpaper is a post-processing tool for scanned sheets of paper, 
especially for book pages that have been scanned from previously created 
photocopies.
The main purpose is to make scanned book pages better readable on screen
after conversion to PDF. Additionally, unpaper might be useful to 
enhance the quality of scanned pages before performing optical character 
recognition (OCR).

unpaper tries to clean scanned images by removing dark edges that 
appeared through scanning or copying on areas outside the actual page 
content (e.g. dark areas between the left-hand-side and the 
right-hand-side of a double-sided book-page scan).
The program also tries to detect disaligned centering and rotation of 
pages and will automatically straighten each page by rotating it to the 
correct angle. This is called deskewing.
Note that the automatic processing will sometimes fail. It is always a 
good idea to manually control the results of unpaper and adjust the 
parameter settings according to the requirements of the input. Each 
processing step can also be disabled individually for each sheet.

Input and output files can be in either .pbm or .pgm format, as also 
used by the Linux scanning tools scanimage and scanadf.
Conversion to PDF can e.g. be achieved with the Linux tools pgm2tiff, 
tiffcp and tiff2pdf.
--

(It's a small program with 1 single source file only. Almost too small 
for being an open-source project on its own. If you have ideas to 
integrate it into other Linux scanning/graphics projects instead, please 
let me know.)