Recorded as https://issues.apache.org/jira/browse/JENA-2097

Was the reading from a file using "riot"?

It is different because it installs a different error handler that reports errors but does not throw an exception.

A java workaround is:

    ErrorHandler eh = ErrorHandlerFactory
        .errorHandlerTracking(ErrorHandlerFactory.stdLogger,
                              false, false);

then

RDFParser.source(in).errorHandler(eh).lang(Lang.NT).parse(model);

or set system-wide with:

ErrorHandlerFactory.setDefaultErrorHandler


On 27/04/2021 16:29, Jindřich Mynarz wrote:
If I read the code correctly, the regular expression for checking UUID URNs
is matched to the complete original IRI string (e.g.,
"urn:uuid:3e5baa77-a990-4a34-85b5-b66246829d24") (
https://github.com/apache/jena/blob/main/jena-core/src/main/java/org/apache/jena/irix/IRIProviderJenaIRI.java#L270)
rather than to the fragment trailing after "urn:uuid" that the comment
suggests (
https://github.com/apache/jena/blob/main/jena-core/src/main/java/org/apache/jena/irix/IRIProviderJenaIRI.java#L255
).

Is this correct understanding?

Yes.

(And the regexp is case sensitive which it should not be.)

Thanks for the details and investigation.

    Andy


- Jindrich

On Tue, 27 Apr 2021 at 16:59, Jindřich Mynarz <[email protected]>
wrote:

Hi,

when reading data with UUID URNs from InputStreams in Jena 4 I get the
error "Bad IRI: Not a valid UUID string". For example, when I run the
following:

Model model = ModelFactory.createDefaultModel();
String data = "<urn:uuid:4f115b8c-5300-4e4d-84f4-1a7593e5fd57> <
http://ex.org/a> <http://ex.org/b> .";
try (ByteArrayInputStream in = new
ByteArrayInputStream(data.getBytes(StandardCharsets.UTF_8))) {
     RDFDataMgr.read(model, in, Lang.NTRIPLES);
}

I get the following error:

org.apache.jena.riot.RiotException: [line: 1, col: 1 ] Bad IRI: Not a
valid UUID string: urn:uuid:4f115b8c-5300-4e4d-84f4-1a7593e5fd57
at
org.apache.jena.riot.system.ErrorHandlerFactory$ErrorHandlerStd.error(ErrorHandlerFactory.java:146)
at
org.apache.jena.riot.system.ParserProfileStd.internalMakeIRI(ParserProfileStd.java:112)
at
org.apache.jena.riot.system.ParserProfileStd.resolveIRI(ParserProfileStd.java:85)
at
org.apache.jena.riot.system.ParserProfileStd.createURI(ParserProfileStd.java:187)
at
org.apache.jena.riot.system.ParserProfileStd.create(ParserProfileStd.java:259)
at org.apache.jena.riot.lang.LangNTriples.tokenAsNode(LangNTriples.java:70)
at org.apache.jena.riot.lang.LangNTuple.parseTriple(LangNTuple.java:92)
at org.apache.jena.riot.lang.LangNTriples.parseOne(LangNTriples.java:61)
at org.apache.jena.riot.lang.LangNTriples.runParser(LangNTriples.java:53)
at org.apache.jena.riot.lang.LangBase.parse(LangBase.java:43)
at
org.apache.jena.riot.RDFParserRegistry$ReaderRIOTLang.read(RDFParserRegistry.java:184)
at org.apache.jena.riot.RDFParser.read(RDFParser.java:357)
at org.apache.jena.riot.RDFParser.parseNotUri(RDFParser.java:347)
at org.apache.jena.riot.RDFParser.parse(RDFParser.java:294)
at org.apache.jena.riot.RDFParserBuilder.parse(RDFParserBuilder.java:550)
at
org.apache.jena.riot.RDFDataMgr.parseFromInputStream(RDFDataMgr.java:721)
at org.apache.jena.riot.RDFDataMgr.read(RDFDataMgr.java:255)
at org.apache.jena.riot.RDFDataMgr.read(RDFDataMgr.java:222)
at org.apache.jena.riot.RDFDataMgr.read(RDFDataMgr.java:208)

This error doesn't appear when the data is read from a file or when I use
the previous version of Jena (i.e. 3.17.0).

Can someone spot if I'm doing something incorrectly? Is this a genuine
regression in Jena 4?

Best regards,

Jindrich


Reply via email to