Hi
Am 15.07.2011 23:49, schrieb Sarah:
Hi,
We've encountered some problems rendering pdf files to jpeg; I've
narrowed the problem down to a short set of test cases... which turn
out to have several different issues, depending on which platform we
test them on. I'd like to submit my test case to the issue tracker,
but I'm not sure whether to submit it as one bug or four... do the
developers have a preference?
As the described issues seem to be unrelated to each other I would prefer
four single issues, so that it'll be easier to track the progress.
You can find our tracker at [1]. Please add all needed information and
if possible attach a sample pdf to it.
List of issues, and test code, below... a .zip file with sample code
and pdf files is ready to be submitted.
The original problem we were coping with was the fact that pdf scans
from our departmental networked copier always render to a black page.
The other issues were just encountered while testing.
BTW, it is *entirely* possible that I'm just doing something wrong;
I'm new to PDFBox. Is there something obviously wrong with my test
code?
I had a quick look and everything seems to be ok.
Thanks,
Sarah
Problem #1: The file "ItDoesntWorkScan.pdf" renders to an empty
black page. This file is a copy of "ItDoesntWorkPrinted.pdf"
that has been printed on paper, and then scanned with
a Xerox WorkCentre 5030 scanner, which then emails a pdf file
back to the user.
Tested On:
- Mac OS 10.6
- Windows 7
- Ubuntu 10.10
Unfortunately, the WorkCentre 5030 doesn't appear to have
many user-settable options for scanning to PDF, so we weren't
really able to try scanning with settings other than the defaults.
Are you using the most recent version 1.6.0 of PDFBox?
Problem #2: On MacOS, running the headless tests ("ant run-headless")
generates multiple instances of messages like this:
*** __NSAutoreleaseNoPool(): Object 0x10b60a5a0 of class
NSConcreteMapTableValueEnumerator autoreleased with no pool
in place - just leaking
Sounds familiar to me, but I can't remember the details. Did you search the
mailing list archive [2]
Problem #3: TestRender.pdf adds an odd-looking (different font?)
question mark to the end of every line. These are not present in
the original PDF file. Tested On:
- Mac OS 10.6
- Windows 7
- Ubuntu 10.10
Problem #4: On a plain vanilla Ubuntu 10.10 install, running
run-all failed to render any text, and threw lots of exceptions:
org.apache.pdfbox.pdmodel.font.PDTrueTypeFont.getawtFont(PDTrueTypeFont.java:425)
...however, installing the package "ttf-mscorefonts-installer"
made those exceptions go away.
(ubuntu1010_output.txt shows the exceptions;
ubuntu1010_try2_output.txt is a run after the extra fonts are installed)
Might be able to fix this one by setting UNKNOWN_FONT in
Resources/PDFBox_External_Fonts.properties, but it would seem like
it should choose some reasonable default if it isn't set...
shouldn't it?
SNIP
BR
Andreas Lehmkühler
[1] https://issues.apache.org/jira/browse/PDFBOX
[2] http://pdfbox.apache.org/mail-lists.html