[
https://issues.apache.org/jira/browse/JENA-2331?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17544111#comment-17544111
]
Brian Vosburgh commented on JENA-2331:
--------------------------------------
>From what I can tell from the Woodstox issue I reference in the description,
>the new {{XMLConstants}} settings are redundant with the already-supported
>StAX {{XMLInputFactory}} settings. Do you read the discussion in that issue
>differently?
Yes, the "environment" installed Woodstox. We have another third party library
that pulled it in. And the intent is clearly to use the parser directly, not to
replace Jena's parser.
I have modified the PR to implement option (2).
Yet another workaround that can be used is to temporarily set a Java system
property, forcing Jena to use the JDK parser:
{code:java}
System.setProperty("javax.xml.stream.XMLInputFactory",
"com.sun.xml.internal.stream.XMLInputFactoryImpl");
// initialize Jena etc.
System.clearProperty("javax.xml.stream.XMLInputFactory");
{code}
> JenaXMLInput does not play well with Woodstox
> ---------------------------------------------
>
> Key: JENA-2331
> URL: https://issues.apache.org/jira/browse/JENA-2331
> Project: Apache Jena
> Issue Type: Bug
> Components: Core
> Affects Versions: Jena 4.5.0
> Reporter: Brian Vosburgh
> Priority: Major
>
> New in 4.5.0, {{JenaXMLInput}} initializes the loaded {{XMLInputFactory}}
> with at least one property that is not supported by the Woodstox StAX parser
> (see [https://github.com/FasterXML/woodstox/issues/51]). This results in a
> logged stack trace that looks like this:
> {code}
> ERROR [main] o.a.j.u.JenaXMLInput - Problem setting StAX property
> java.lang.IllegalArgumentException: Unrecognized property
> 'http://javax.xml.XMLConstants/property/accessExternalDTD'
> at
> com.ctc.wstx.api.CommonConfig.reportUnknownProperty(CommonConfig.java:167)
> ~[woodstox-core-6.2.7.jar:6.2.7]
> at com.ctc.wstx.api.CommonConfig.setProperty(CommonConfig.java:158)
> ~[woodstox-core-6.2.7.jar:6.2.7]
> at com.ctc.wstx.api.ReaderConfig.setProperty(ReaderConfig.java:35)
> ~[woodstox-core-6.2.7.jar:6.2.7]
> at
> com.ctc.wstx.stax.WstxInputFactory.setProperty(WstxInputFactory.java:400)
> ~[woodstox-core-6.2.7.jar:6.2.7]
> at
> org.apache.jena.util.JenaXMLInput.initXMLInputFactory(JenaXMLInput.java:81)
> ~[jena-core-4.5.0.jar:4.5.0]
> at org.apache.jena.util.JenaXMLInput.<clinit>(JenaXMLInput.java:90)
> ~[jena-core-4.5.0.jar:4.5.0]
> at
> org.apache.jena.rdfxml.xmlinput.impl.RDFXMLParser.create(RDFXMLParser.java:75)
> ~[jena-core-4.5.0.jar:4.5.0]
> at org.apache.jena.rdfxml.xmlinput.ARP.<init>(ARP.java:76)
> ~[jena-core-4.5.0.jar:4.5.0]
> at
> org.apache.jena.riot.lang.ReaderRIOTRDFXML.<init>(ReaderRIOTRDFXML.java:62)
> ~[jena-arq-4.5.0.jar:4.5.0]
> at
> org.apache.jena.riot.lang.ReaderRIOTRDFXML.lambda$static$0(ReaderRIOTRDFXML.java:59)
> ~[jena-arq-4.5.0.jar:4.5.0]
> at org.apache.jena.riot.RDFParser.createReader(RDFParser.java:486)
> [jena-arq-4.5.0.jar:4.5.0]
> at org.apache.jena.riot.RDFParser.createReader(RDFParser.java:480)
> [jena-arq-4.5.0.jar:4.5.0]
> at org.apache.jena.riot.RDFParser.parseNotUri(RDFParser.java:399)
> [jena-arq-4.5.0.jar:4.5.0]
> at org.apache.jena.riot.RDFParser.parse(RDFParser.java:356)
> [jena-arq-4.5.0.jar:4.5.0]
> at org.apache.jena.riot.RDFParserBuilder.parse(RDFParserBuilder.java:568)
> [jena-arq-4.5.0.jar:4.5.0]
> at
> org.apache.jena.riot.RDFDataMgr.parseFromInputStream(RDFDataMgr.java:718)
> [jena-arq-4.5.0.jar:4.5.0]
> at org.apache.jena.riot.RDFDataMgr.read(RDFDataMgr.java:253)
> [jena-arq-4.5.0.jar:4.5.0]
> at org.apache.jena.riot.RDFDataMgr.read(RDFDataMgr.java:235)
> [jena-arq-4.5.0.jar:4.5.0]
> at
> org.apache.jena.riot.adapters.RDFReaderRIOT.read(RDFReaderRIOT.java:69)
> [jena-arq-4.5.0.jar:4.5.0]
> at org.apache.jena.rdf.model.impl.ModelCom.read(ModelCom.java:253)
> [jena-core-4.5.0.jar:4.5.0]
> at
> org.apache.jena.ontology.OntDocumentManager.findMetadata(OntDocumentManager.java:890)
> [jena-core-4.5.0.jar:4.5.0]
> at
> org.apache.jena.ontology.OntDocumentManager.initialiseMetadata(OntDocumentManager.java:848)
> [jena-core-4.5.0.jar:4.5.0]
> at
> org.apache.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:196)
> [jena-core-4.5.0.jar:4.5.0]
> at
> org.apache.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:178)
> [jena-core-4.5.0.jar:4.5.0]
> at
> org.apache.jena.ontology.OntDocumentManager.<init>(OntDocumentManager.java:162)
> [jena-core-4.5.0.jar:4.5.0]
> {code}
> Jena should probably do one of the following:
> * Instead of relying on whatever parser is discovered by
> {{{}XMLInputFactory.newInstance(){}}}, Jena can directly load the
> JDK-supplied parser, which supports these properties, by calling
> {{XMLInputFactory.newDefaultFactory()}} instead.
> * Jena can be a bit more robust when dealing with non-JDK parsers by
> wrapping each call to {{XMLInputFactory.setProperty(...)}} with a separate
> {{try-catch}} block and log any exception appropriately.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]