[
https://issues.apache.org/jira/browse/JENA-220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13227415#comment-13227415
]
Paolo Castagna commented on JENA-220:
-------------------------------------
What I would probably do and what makes more sense to me is to aggregate RDF
data crawling the relevant/needed HTML pages using tools available such as
Any23, java-rdfa, etc.
The RDF data crawled that way can be added to a TDB store and that used to
provide a SPARQL endpoint with Fuseki.
This way, you do not depend on external services (which might not be available)
and you have full control on the RDF data you need and you want to publish.
The notion of querying an HTML page with RDFa in it using SPARQL seems to me
not quite useful, since often the amount of data available in a single page is
limited and many use cases requires queries to span across pages anyway.
> General purpose SPARQL query engine uses W3C is uses online RDFa Distiller -
> so not usable behind a firewall
> ------------------------------------------------------------------------------------------------------------
>
> Key: JENA-220
> URL: https://issues.apache.org/jira/browse/JENA-220
> Project: Apache Jena
> Issue Type: Improvement
> Components: Fuseki
> Affects Versions: Fuseki 0.2.1
> Environment: I runnung Fuseki version 0.2.1-incubating-SNAPSHOT
> (Build date: 2012-03-03T05:07:05+0000) under Windows 7 Professional: java
> -jar fuseki-server.jar --mem /dataset
> Reporter: Tobias Trapp
> Priority: Minor
> Fix For: Fuseki 0.2.1
>
>
> When I'm using http://localhost:3030/sparql in Fuseki - I get an wrong
> results when working behind a firewall. The reason is simple: RDFa is
> distilled using the W3C service http://www.w3.org/2007/08/pyRdfa/ which you
> can see in the console or when HTTP sniffing. So the sofwtare is not usable
> in an intranet solution or behind firewalls. Is it possible to use another
> distiller, perhaps http://dev.w3.org/2004/PythonLib-IH/dist/pyRdfa.tar.gz
> using Jython?
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira