[ 
https://issues.apache.org/jira/browse/ANY23-326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16336452#comment-16336452
 ] 

Lewis John McGibbney commented on ANY23-326:
--------------------------------------------

Correct, OK doke. When I run with master brancg I get the following

{code}

<?xml version="1.0" encoding="UTF-8" ?>
<response>
<extractors>
<extractor>html-mf-xfn</extractor>
<extractor>html-rdfa11</extractor>
</extractors>
<report>
<message/>
<error/>
<issueReport>
<extractorIssues extractor="html-rdfa11">
<issue level="WARNING" row="-1" col="-1">Can't resolve term profile</issue>
<issue level="WARNING" row="-1" col="-1">Can't resolve term profile</issue>
<issue level="WARNING" row="-1" col="-1">Can't resolve term 
authorization_endpoint</issue>
<issue level="WARNING" row="-1" col="-1">Can't resolve term webmention</issue>
<issue level="WARNING" row="-1" col="-1">Can't resolve term 
token_endpoint</issue>
<issue level="WARNING" row="-1" col="-1">Can't resolve term micropub</issue>
<issue level="WARNING" row="-1" col="-1">Can't resolve term home</issue>
<issue level="ERROR" row="-1" col="-1">The element type "input" must be 
terminated by the matching end-tag "</input>".</issue>
</extractorIssues>
</issueReport>
<validationReport>
<issues>
</issues>
<ruleActivations>
</ruleActivations>
<errors>
</errors>
</validationReport>
</report>
<data>
<![CDATA[
# OUTPUT FORMAT: Turtle (mimeTypes=text/turtle, application/x-turtle; ext=ttl)
# BEGIN: 
ExtractionContext(urn:x-any23:html-mf-xfn:root-extraction-result-id:https://ben.thatmustbe.me/note/2017/12/28/1)
@prefix xfn: <http://microformats.org/wiki/xfn/> .
@prefix rdf: <https://www.w3.org/TR/REC-rdf-syntax#> .
@prefix foaf: <http://xmlns.com/foaf/spec/> .

_:node1c3bnm00nx6041 <http://vocab.sindice.com/xfn#mePage> 
<http://www.facebook.com/dissolve333> .

<https://ben.thatmustbe.me/note/2017/12/28/1> 
<http://vocab.sindice.com/xfn#me-hyperlink> 
<http://www.facebook.com/dissolve333> .

_:node1c3bnm00nx6041 <http://vocab.sindice.com/xfn#mePage> 
<http://twitter.com/dissolve333> .

<https://ben.thatmustbe.me/note/2017/12/28/1> 
<http://vocab.sindice.com/xfn#me-hyperlink> <http://twitter.com/dissolve333> .

_:node1c3bnm00nx6041 <http://vocab.sindice.com/xfn#mePage> 
<https://github.com/dissolve/> .

<https://ben.thatmustbe.me/note/2017/12/28/1> 
<http://vocab.sindice.com/xfn#me-hyperlink> <https://github.com/dissolve/> .

_:node1c3bnm00nx6041 a <http://xmlns.com/foaf/0.1/Person> ;
    <http://vocab.sindice.com/xfn#mePage> 
<https://ben.thatmustbe.me/note/2017/12/28/1> .
# BEGIN: 
ExtractionContext(urn:x-any23:html-rdfa11:root-extraction-result-id:https://ben.thatmustbe.me/note/2017/12/28/1)

<https://ben.thatmustbe.me/note/2017/12/28/1#headBanner> 
<http://www.w3.org/1999/xhtml/vocab#role> 
<http://www.w3.org/1999/xhtml/vocab#banner> .

<https://ben.thatmustbe.me/note/2017/12/28/1#content> 
<http://www.w3.org/1999/xhtml/vocab#role> 
<http://www.w3.org/1999/xhtml/vocab#main> .

<https://ben.thatmustbe.me/note/2017/12/28/1#secondary> 
<http://www.w3.org/1999/xhtml/vocab#role> 
<http://www.w3.org/1999/xhtml/vocab#complementary> .
# END: 
ExtractionContext(urn:x-any23:html-rdfa11:root-extraction-result-id:https://ben.thatmustbe.me/note/2017/12/28/1)
# END: 
ExtractionContext(urn:x-any23:html-mf-xfn:root-extraction-result-id:https://ben.thatmustbe.me/note/2017/12/28/1)
]]>
</data>
</response>

{code}

> parsing unclosed meta and input tags fails
> ------------------------------------------
>
>                 Key: ANY23-326
>                 URL: https://issues.apache.org/jira/browse/ANY23-326
>             Project: Apache Any23
>          Issue Type: Bug
>          Components: CLI
>    Affects Versions: 2.1
>         Environment: ubuntu 17.04
>            Reporter: Ben Roberts
>            Priority: Major
>             Fix For: 2.2
>
>
> parsing fails as soon as it hits an unclosed input or meta tag, as an example 
> try
>  ./bin/any23 rover https://ben.thatmustbe.me/note/2017/12/28/1
> [Fatal Error] :170:3: The element type "input" must be terminated by the 
> matching end-tag "</input>".
>  
> It seems like the issue might be that this is using a very old version of 
> jsoup.  at least as best I could tell.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to