[ https://issues.apache.org/jira/browse/ANY23-328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16374668#comment-16374668 ]
Hudson commented on ANY23-328: ------------------------------ SUCCESS: Integrated in Jenkins build Any23-trunk #1540 (See [https://builds.apache.org/job/Any23-trunk/1540/]) ANY23-328 Strip comments from json-ld content to make parsing more (hans: rev 189bf260e74436860054469fde8192531cce6f14) * (edit) core/src/main/java/org/apache/any23/extractor/html/EmbeddedJSONLDExtractor.java * (edit) core/src/main/java/org/apache/any23/extractor/rdf/BaseRDFExtractor.java * (add) test-resources/src/test/resources/html/html-jsonld-strip-comments.html * (edit) core/src/test/java/org/apache/any23/extractor/html/EmbeddedJSONLDExtractorTest.java > Problem parsing json-ld content surrounded by comments > ------------------------------------------------------ > > Key: ANY23-328 > URL: https://issues.apache.org/jira/browse/ANY23-328 > Project: Apache Any23 > Issue Type: Bug > Components: core > Affects Versions: 2.1 > Reporter: Hans Brende > Assignee: Hans Brende > Priority: Major > Fix For: 2.2 > > > Sometimes in json-ld script blocks (e.g., on https://www.guthriegreen.com), > you will see > /*<![CDATA[*/ > ...json-ld content... > /*]]>*/ > or > //<![CDATA[ > ...json-ld content... > //]]> > > Currently we are stripping CDATA markers, but we are not stripping leading & > trailing comments, which will cause json-ld parsing to fail. This may be > related to issue #17. > -- This message was sent by Atlassian JIRA (v7.6.3#76005)