Hi Patrick,

Thank you.

Just to specify what I mean by broken IRI support. I know IRIs work in 
Virtuoso quite good, better than in most other RDF Stores and it's just 
the RDF/XML serializer that has a small encoding bug, but RDF/XML seems 
to be the default serialization for SPARQL answers.

People just use a common RDF framework, try to query the endpoint and 
get garbled results, after which they complain about the endpoint not 
working right.

I know you can specify another serialization format like N3 or Turtle or 
use a small hack and get the right encoding, but I found that out the 
hard way as most people who try to query any Internationalized DBpedia 
endpoint will do.

Kind Regards,
Alexandru

On 10/19/2011 05:08 PM, Patrick van Kleef wrote:
> Hi Alexandru,
>
>> It would be quite nice to get an answer about this issue from someone at
>> OpenLink since it seems that they do read this mailing list and this is
>> a known issue.
>> BTW I need to correct the title of this mail. The issue is not with the
>> DBpedia VAD, it is with Virtuoso itself since the SPARQL endpoint
>> returns the same garbled results. So at this time the Virtuoso IRI
>> handling is broken at least when using SPARQL .
>
> I have passed on your observation to the Virtuoso development team and 
> i am awaiting an answer.
>
>
> Patrick
> ---
> OpenLink Software
>
>> On 10/18/2011 09:29 AM, Dimitris Kontokostas wrote:
>>> Hi Alexandru,
>>>
>>> This is a known issue and we reported it to virtuoso ~9 months ago.
>>> Unfortunatelly we use debian packages for our installation which
>>> usually are a little behind from the latest releases, so we can't say
>>> if it is fixed
>>>
>>> But, IRIs cannot be 100% serialized in RDF/XML.
>>> So even if Virtuoso fixes the encoding, the rdf might still be invalid
>>>
>>> Regards,
>>> Dimitris
>>>
>>> On Mon, Oct 17, 2011 at 6:42 PM, Alexandru 
>>> Todor<[email protected]>  wrote:
>>>> Hi,
>>>>
>>>> I've recieved a mail a couple of weeks ago from some users of the 
>>>> German
>>>> DBpedia a few weeks ago who where reporting that they weren't getting
>>>> any results when querying the endpoint for URIs that contained German
>>>> umlauts(or any other utf8 characters). I reported the issue to the 
>>>> Jena
>>>> mailing list and they fixed it, but in the process we also 
>>>> discovered a
>>>> bug with Virtuoso.
>>>>
>>>> There is a problem with the IRI encoding in the DBpedia
>>>> Internationalization VAD. Namely when querying the SPARQL endpoint the
>>>> encoding of the IRIs in RDF/XML is garbled. The issue can be found in
>>>> both Greek and German endpoints.
>>>>
>>>> For example: http://de.dbpedia.org/data/Berlin-Dahlem.rdf , in the 
>>>> first
>>>> XML lines yo you will notice things linke
>>>> http://de.dbpedia.org/resource/Königin-Luise-Stiftung instead of
>>>> http://de.dbpedia.org/resource/Königin-Luise-Stiftung or
>>>> http://de.dbpedia.org/resource/Gernot_Michael_Müller instead of
>>>> http://de.dbpedia.org/resource/Gernot_Michael_Müller. You will notice
>>>> simmilar issues if you look at this resource from the Greek DBpedia:
>>>> http://el.dbpedia.org/data/Αλέξανδρος_ο_Μέγας.rdf .
>>>>
>>>> This problems is that when querying the Internationalization Endpoints
>>>> not only with Jena but with any other SPARQL client, the user is going
>>>> to getting garbled IRIs if they contain UTF8 characters.
>>>>
>>>>
>>>> Kind Regards,
>>>> Alexandru Todor
>>>>
>>>>
>>>> ------------------------------------------------------------------------------
>>>>  
>>>>
>>>> All the data continuously generated in your IT infrastructure 
>>>> contains a
>>>> definitive record of customers, application performance, security
>>>> threats, fraudulent activity and more. Splunk takes this data and 
>>>> makes
>>>> sense of it. Business sense. IT sense. Common sense.
>>>> http://p.sf.net/sfu/splunk-d2d-oct
>>>> _______________________________________________
>>>> Dbpedia-discussion mailing list
>>>> [email protected]
>>>> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>>>>
>>>
>>>
>>
>>
>> ------------------------------------------------------------------------------
>>  
>>
>> All the data continuously generated in your IT infrastructure contains a
>> definitive record of customers, application performance, security
>> threats, fraudulent activity and more. Splunk takes this data and makes
>> sense of it. Business sense. IT sense. Common sense.
>> http://p.sf.net/sfu/splunk-d2d-oct
>> _______________________________________________
>> Dbpedia-discussion mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>


------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2d-oct
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to