[jira] [Comment Edited] (PDFBOX-2133) Parsing of a Type1 font fails with a NumberFormatException

John Hewson (JIRA) Thu, 12 Jun 2014 16:27:25 -0700

    [ 
https://issues.apache.org/jira/browse/PDFBOX-2133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14029998#comment-14029998
 ]


John Hewson edited comment on PDFBOX-2133 at 6/12/14 11:26 PM:
---------------------------------------------------------------

{quote}
I am not sure whether the parser or the PDF is wrong, but the fact that it 
renders fine in Acrobat and in 1.7.x indicates that the former is true.
{quote}

Ha, I wish! Unfortunately Acrobat is able to parse all manner of corrupt Type 1 
font files. In this case, as Tilman observed, BlueShift is supposed to be an 
integer, yet in this file it is a float.

However, if Acrobat can read it then we want PDFBox to be able to read it, so 
I've modified the Type 1 parser so that it will always parse integers as floats 
and then truncate them to ints - that way we avoid any future similar issues. 
I've added this to the trunk in [r1602311|http://svn.apache.org/r1602311].


was (Author: jahewson):
{quote}
I am not sure whether the parser or the PDF is wrong, but the fact that it 
renders fine in Acrobat and in 1.7.x indicates that the former is true.
{quote}

Ha, I wish! Unfortunately Acrobat is able to parse all manner of corrupt Type 1 
font files. In this case, as Tilman observed, BlueShift is supposed to be an 
integer, yet in this file it is a float.

However, if Acrobat can read it then we want PDFBox to be able to read it, so 
I've modified the Type 1 parser so that it will always parse integers as floats 
and then truncate them to ints - that way we avoid any future similar issues. 
I've added this to the trunk in [r1602311.|http://svn.apache.org/r1602311].

> Parsing of a Type1 font fails with a NumberFormatException
> ----------------------------------------------------------
>
>                 Key: PDFBOX-2133
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2133
>             Project: PDFBox
>          Issue Type: Bug
>          Components: FontBox
>    Affects Versions: 2.0.0
>            Reporter: Petr Slaby
>            Assignee: John Hewson
>            Priority: Minor
>             Fix For: 2.0.0
>
>         Attachments: 000116.pdf, 000304.pdf, testrun.log
>
>
> When rendering the attached PDF, parsing of a font fails with a 
> NumberFormatException. Many NullPointerExceptions and "missing fonts" are 
> being reported then. The PDF rendered fine in our modified 1.7.x where fonts 
> were read using AWT. I did not try with current 1.8.x. Stack traces are 
> attached.
> Note: This is just a file from my test suite, not a production problem. I am 
> not sure whether the parser or the PDF is wrong, but the fact that it renders 
> fine in Acrobat and in 1.7.x indicates that the former is true. The offending 
> font is F2, if I catch and ignore the runtime exception in 
> PDResources#getFonts() then it is reported as missing in the PageDrawer.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Comment Edited] (PDFBOX-2133) Parsing of a Type1 font fails with a NumberFormatException

Reply via email to