[ https://issues.apache.org/jira/browse/ANY23-381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16567568#comment-16567568 ]
Hudson commented on ANY23-381: ------------------------------ SUCCESS: Integrated in Jenkins build Any23-trunk #1609 (See [https://builds.apache.org/job/Any23-trunk/1609/]) ANY23-381 fix illegal unescaped characters in JSON-LD (hans: rev 817e744af90d8f3c9bf419e5c395c421e0c3924a) * (edit) core/src/main/java/org/apache/any23/extractor/rdf/BaseRDFExtractor.java * (edit) core/src/test/java/org/apache/any23/extractor/html/EmbeddedJSONLDExtractorTest.java * (add) test-resources/src/test/resources/html/html-jsonld-unescaped-characters.html > JsonParseException: Illegal unquoted character > ---------------------------------------------- > > Key: ANY23-381 > URL: https://issues.apache.org/jira/browse/ANY23-381 > Project: Apache Any23 > Issue Type: Bug > Components: extractors > Affects Versions: 2.3 > Reporter: Hans Brende > Assignee: Hans Brende > Priority: Major > Fix For: 2.3 > > > While perusing the site http://losangeles.eventful.com/events I stumbled > across the following exception: > {noformat} > org.eclipse.rdf4j.rio.RDFParseException: Could not parse JSONLD > at org.eclipse.rdf4j.rio.jsonld.JSONLDParser.parse(JSONLDParser.java:77) > at > org.apache.any23.extractor.rdf.BaseRDFExtractor.run(BaseRDFExtractor.java:196) > ... 36 more > Caused by: com.fasterxml.jackson.core.JsonParseException: Illegal unquoted > character ((CTRL-CHAR, code 10)): has to be escaped using backslash to be > included in string value > at [Source: (BufferedReader); line: 1, column: 147] > at > com.fasterxml.jackson.core.JsonParser._constructError(JsonParser.java:1804) > at > com.fasterxml.jackson.core.base.ParserMinimalBase._reportError(ParserMinimalBase.java:663) > at > com.fasterxml.jackson.core.base.ParserMinimalBase._throwUnquotedSpace(ParserMinimalBase.java:627) > at > com.fasterxml.jackson.core.json.ReaderBasedJsonParser._finishString2(ReaderBasedJsonParser.java:2045) > at > com.fasterxml.jackson.core.json.ReaderBasedJsonParser._finishString(ReaderBasedJsonParser.java:2016) > at > com.fasterxml.jackson.core.json.ReaderBasedJsonParser.getText(ReaderBasedJsonParser.java:278) > at > com.fasterxml.jackson.databind.deser.std.UntypedObjectDeserializer$Vanilla.deserialize(UntypedObjectDeserializer.java:672) > at > com.fasterxml.jackson.databind.deser.std.MapDeserializer._readAndBindStringKeyMap(MapDeserializer.java:527) > at > com.fasterxml.jackson.databind.deser.std.MapDeserializer.deserialize(MapDeserializer.java:364) > at > com.fasterxml.jackson.databind.deser.std.MapDeserializer.deserialize(MapDeserializer.java:29) > at > com.fasterxml.jackson.databind.ObjectMapper._readValue(ObjectMapper.java:3972) > at > com.fasterxml.jackson.databind.ObjectMapper.readValue(ObjectMapper.java:2264) > at > com.fasterxml.jackson.core.JsonParser.readValueAs(JsonParser.java:1729) > at > com.github.jsonldjava.utils.JsonUtils.fromJsonParser(JsonUtils.java:196) > at com.github.jsonldjava.utils.JsonUtils.fromReader(JsonUtils.java:173) > at > com.github.jsonldjava.utils.JsonUtils.fromInputStream(JsonUtils.java:154) > at > com.github.jsonldjava.utils.JsonUtils.fromInputStream(JsonUtils.java:111) > at org.eclipse.rdf4j.rio.jsonld.JSONLDParser.parse(JSONLDParser.java:71) > ... 37 more > {noformat} > caused by the {{description}} field in the following json spanning multiple > unescaped newlines: > {noformat} > { > "@context": "http://schema.org", > "@type": "Event", > "name": "#1 Magic Show in L.A.", > "description": "#1 MAGIC SHOW IN L.A. > The current WINNER of the CW’s Penn & Teller’s FOOL US, Illusionist > extraordinaire Ivan Amodei is on a national tour with his show INTIMATE > ILLUSIONS. > Currently, on an ei...", > "startDate": "Saturday, August 11, 2018 4:00 PM", > "image": > "//d1marr3m5x4iac.cloudfront.net/images/perspectivecrop373by249/I0-001/040/358/185-9.png_/1-magic-show-la-85.png", > "location": { > "@type": "Place", > "name": "Beverly Wilshire Hotel", > "url": > "//losangeles.eventful.com/venues/beverly-wilshire-hotel-/V0-001-003541383-4", > "address": { > "streetAddress": "9500 Wilshire Boulevard", > "addressLocality": "Beverly Hills", > "addressRegion": "California", > "postalCode": "90212" > } > }, > "offers": { > "@type": "Offer", > "url": > "//losangeles.eventful.com/events/1-magic-show-la-/E0-001-114704991-1/tickets", > "availability": "http://schema.org/InStock" > }, > "performer": [ { > "@type": "Person", > "name": "Ivan Amodei" > } ] > } > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)