[ 
https://issues.apache.org/jira/browse/PDFBOX-3471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15434404#comment-15434404
 ] 

Petras commented on PDFBOX-3471:
--------------------------------

[~msahyoun], seems you already made proposed necessary fix :). In fact, I 
didn't notice that method when was proposing a patch, it appears there was the 
comment removal already, but as you noted it didn't work properly. You 
indicated a "TODO: verify why this is necessary empty text nodes to be 
deleted". I wonder, are there any cases where empty text nodes should not be 
deleted?

> XMP parsing fails if XMP contain comments
> -----------------------------------------
>
>                 Key: PDFBOX-3471
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3471
>             Project: PDFBox
>          Issue Type: Bug
>          Components: XmpBox
>    Affects Versions: 2.0.2
>            Reporter: Petras
>         Attachments: PDFBOX-3471_XmpParsingIgnoringComments.patch
>
>
> DomXmpParser parser fails with such correct XMP:
> {code:xml}
> <?xpacket begin="" id="W5M0MpCehiHzreSzNTczkc9d"?>
> <x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="Adobe XMP Core 5.1.0-jc003">
>     <!-- PDF/A standarto versija (1 ar 2) ir suderinamumo lygmuo (A, B ar U) 
> -->
>     <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#";>
>         <rdf:Description rdf:about = ""
>                          xmlns:pdfaid = "http://www.aiim.org/pdfa/ns/id/";>
>             <pdfaid:part>1</pdfaid:part>
>             <pdfaid:conformance>B</pdfaid:conformance>
>         </rdf:Description>
>     </rdf:RDF>
> </x:xmpmeta>
> <?xpacket end="w"?>
> {code}
> DomXmpParser finds comment node and fails:
> {code}
> org.apache.xmpbox.xml.XmpParsingException: More than one element found in 
> x:xmpmeta
>       at 
> org.apache.xmpbox.xml.DomXmpParser.findDescriptionsParent(DomXmpParser.java:750)
>       at org.apache.xmpbox.xml.DomXmpParser.parse(DomXmpParser.java:183)
>       at org.apache.xmpbox.xml.DomXmpParser.parse(DomXmpParser.java:111)
> ...
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to