Recorded as https://issues.apache.org/jira/browse/JENA-2097
Was the reading from a file using "riot"?
It is different because it installs a different error handler that
reports errors but does not throw an exception.
A java workaround is:
ErrorHandler eh = ErrorHandlerFactory
.errorHandlerTracking(ErrorHandlerFactory.stdLogger,
false, false);
then
RDFParser.source(in).errorHandler(eh).lang(Lang.NT).parse(model);
or set system-wide with:
ErrorHandlerFactory.setDefaultErrorHandler
On 27/04/2021 16:29, Jindřich Mynarz wrote:
If I read the code correctly, the regular expression for checking UUID URNs
is matched to the complete original IRI string (e.g.,
"urn:uuid:3e5baa77-a990-4a34-85b5-b66246829d24") (
https://github.com/apache/jena/blob/main/jena-core/src/main/java/org/apache/jena/irix/IRIProviderJenaIRI.java#L270)
rather than to the fragment trailing after "urn:uuid" that the comment
suggests (
https://github.com/apache/jena/blob/main/jena-core/src/main/java/org/apache/jena/irix/IRIProviderJenaIRI.java#L255
).
Is this correct understanding?
Yes.
(And the regexp is case sensitive which it should not be.)
Thanks for the details and investigation.
Andy
- Jindrich
On Tue, 27 Apr 2021 at 16:59, Jindřich Mynarz <[email protected]>
wrote:
Hi,
when reading data with UUID URNs from InputStreams in Jena 4 I get the
error "Bad IRI: Not a valid UUID string". For example, when I run the
following:
Model model = ModelFactory.createDefaultModel();
String data = "<urn:uuid:4f115b8c-5300-4e4d-84f4-1a7593e5fd57> <
http://ex.org/a> <http://ex.org/b> .";
try (ByteArrayInputStream in = new
ByteArrayInputStream(data.getBytes(StandardCharsets.UTF_8))) {
RDFDataMgr.read(model, in, Lang.NTRIPLES);
}
I get the following error:
org.apache.jena.riot.RiotException: [line: 1, col: 1 ] Bad IRI: Not a
valid UUID string: urn:uuid:4f115b8c-5300-4e4d-84f4-1a7593e5fd57
at
org.apache.jena.riot.system.ErrorHandlerFactory$ErrorHandlerStd.error(ErrorHandlerFactory.java:146)
at
org.apache.jena.riot.system.ParserProfileStd.internalMakeIRI(ParserProfileStd.java:112)
at
org.apache.jena.riot.system.ParserProfileStd.resolveIRI(ParserProfileStd.java:85)
at
org.apache.jena.riot.system.ParserProfileStd.createURI(ParserProfileStd.java:187)
at
org.apache.jena.riot.system.ParserProfileStd.create(ParserProfileStd.java:259)
at org.apache.jena.riot.lang.LangNTriples.tokenAsNode(LangNTriples.java:70)
at org.apache.jena.riot.lang.LangNTuple.parseTriple(LangNTuple.java:92)
at org.apache.jena.riot.lang.LangNTriples.parseOne(LangNTriples.java:61)
at org.apache.jena.riot.lang.LangNTriples.runParser(LangNTriples.java:53)
at org.apache.jena.riot.lang.LangBase.parse(LangBase.java:43)
at
org.apache.jena.riot.RDFParserRegistry$ReaderRIOTLang.read(RDFParserRegistry.java:184)
at org.apache.jena.riot.RDFParser.read(RDFParser.java:357)
at org.apache.jena.riot.RDFParser.parseNotUri(RDFParser.java:347)
at org.apache.jena.riot.RDFParser.parse(RDFParser.java:294)
at org.apache.jena.riot.RDFParserBuilder.parse(RDFParserBuilder.java:550)
at
org.apache.jena.riot.RDFDataMgr.parseFromInputStream(RDFDataMgr.java:721)
at org.apache.jena.riot.RDFDataMgr.read(RDFDataMgr.java:255)
at org.apache.jena.riot.RDFDataMgr.read(RDFDataMgr.java:222)
at org.apache.jena.riot.RDFDataMgr.read(RDFDataMgr.java:208)
This error doesn't appear when the data is read from a file or when I use
the previous version of Jena (i.e. 3.17.0).
Can someone spot if I'm doing something incorrectly? Is this a genuine
regression in Jena 4?
Best regards,
Jindrich