[ 
https://issues.apache.org/jira/browse/ANY23-328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16374668#comment-16374668
 ] 

Hudson commented on ANY23-328:
------------------------------

SUCCESS: Integrated in Jenkins build Any23-trunk #1540 (See 
[https://builds.apache.org/job/Any23-trunk/1540/])
ANY23-328 Strip comments from json-ld content to make parsing more (hans: rev 
189bf260e74436860054469fde8192531cce6f14)
* (edit) 
core/src/main/java/org/apache/any23/extractor/html/EmbeddedJSONLDExtractor.java
* (edit) core/src/main/java/org/apache/any23/extractor/rdf/BaseRDFExtractor.java
* (add) test-resources/src/test/resources/html/html-jsonld-strip-comments.html
* (edit) 
core/src/test/java/org/apache/any23/extractor/html/EmbeddedJSONLDExtractorTest.java


> Problem parsing json-ld content surrounded by comments
> ------------------------------------------------------
>
>                 Key: ANY23-328
>                 URL: https://issues.apache.org/jira/browse/ANY23-328
>             Project: Apache Any23
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 2.1
>            Reporter: Hans Brende
>            Assignee: Hans Brende
>            Priority: Major
>             Fix For: 2.2
>
>
> Sometimes in json-ld script blocks (e.g., on https://www.guthriegreen.com), 
> you will see
> /&ast;<![CDATA[&ast;/
> ...json-ld content...
> /&ast;]]>&ast;/
> or 
> //<![CDATA[
> ...json-ld content...
> //]]>
>  
> Currently we are stripping CDATA markers, but we are not stripping leading & 
> trailing comments, which will cause json-ld parsing to fail. This may be 
> related to issue #17. 
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to