[
https://issues.apache.org/jira/browse/ANY23-169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16319587#comment-16319587
]
Lewis John McGibbney commented on ANY23-169:
--------------------------------------------
Using the current any23.org service, I can confirm this bug. The following
output is created
{code}
<?xml version="1.0" encoding="UTF-8" ?>
<response>
<extractors>
<extractor>html-microdata</extractor>
<extractor>html-head-title</extractor>
</extractors>
<report>
<message/>
<error/>
<issueReport>
<extractorIssues extractor="html-rdfa11">
<issue level="ERROR" row="-1" col="-1">Attribute name "itemscope" associated
with an element type "span" must be followed by the ' = ' character.</issue>
</extractorIssues>
</issueReport>
<validationReport>
<issues>
</issues>
<ruleActivations>
</ruleActivations>
<errors>
</errors>
</validationReport>
</report>
<data>
<![CDATA[
# OUTPUT FORMAT: Turtle (mimeTypes=text/turtle, application/x-turtle; ext=ttl)
# BEGIN:
ExtractionContext(urn:x-any23:html-microdata:root-extraction-result-id:https://ruben.verborgh.org/tmp/slash-test.html)
@prefix doac: <http://ramonantonio.net/doac/0.1/#> .
@prefix rdf: <https://www.w3.org/TR/REC-rdf-syntax#> .
@prefix foaf: <http://xmlns.com/foaf/spec/> .
# BEGIN:
ExtractionContext(urn:x-any23:html-head-title:root-extraction-result-id:https://ruben.verborgh.org/tmp/slash-test.html)
@prefix dcterms: <http://purl.org/dc/terms/> .
<https://ruben.verborgh.org/tmp/slash-test.html> dcterms:title "Title" .
_:nodefea56f8ce507e9a90bac1fc5872b158 a <http://schema.org/Person> ;
<http://schema.org/Person/url>
<https://ruben.verborgh.org/tmp/slash-test.html//> ,
<https://ruben.verborgh.org/tmp/slash-test.html/other.html> .
<https://ruben.verborgh.org/tmp/slash-test.html>
<http://www.w3.org/1999/xhtml/microdata#item>
_:nodefea56f8ce507e9a90bac1fc5872b158 ;
dcterms:title "Title" .
# END:
ExtractionContext(urn:x-any23:html-microdata:root-extraction-result-id:https://ruben.verborgh.org/tmp/slash-test.html)
# END:
ExtractionContext(urn:x-any23:html-head-title:root-extraction-result-id:https://ruben.verborgh.org/tmp/slash-test.html)
]]>
</data>
</response>
{code}
> Incorrect interpretation of relative and absolute paths with Microdata
> ----------------------------------------------------------------------
>
> Key: ANY23-169
> URL: https://issues.apache.org/jira/browse/ANY23-169
> Project: Apache Any23
> Issue Type: Bug
> Components: microdata
> Reporter: Ruben Verborgh
> Labels: microdata, url, urls
> Fix For: 2.2
>
>
> Parsing the following fragment located at
> http://ruben.verborgh.org/tmp/slash-test.html
> <a itemprop="url" href="/">Homepage</a>
> <a itemprop="url" href="other.html">Other</a>
> results in the URIs
> http://ruben.verborgh.org/tmp/slash-test.html//
> http://ruben.verborgh.org/tmp/slash-test.html/other.html
> instead of the correct
> http://ruben.verborgh.org/tmp/
> http://ruben.verborgh.org/tmp/other.html
> Note that there is no trailing slash in the original.
> Test case:
> http://ruben.verborgh.org/tmp/slash-test.html
> http://any23.org/any23/?format=best&uri=http%3A%2F%2Fruben.verborgh.org%2Ftmp%2Fslash-test.html&validation-mode=none
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)