On 7/23/12 9:53 AM, Yves Raimond wrote:
On Mon, Jul 23, 2012 at 2:19 PM, Dimitris Kontokostas
<kontokos...@informatik.uni-leipzig.de> wrote:

On Mon, Jul 23, 2012 at 3:37 PM, Yves Raimond <yves.raim...@gmail.com>
wrote:
So http://dbpedia.org/URIencoding is deprecated then?

No.
To be more specific, the bug/error only is in this external file
http://downloads.dbpedia.org/3.7/links/yago_links.nt.bz2
which was loaded in virtuoso. This file only contains (some) wrongly encoded
DBpedia resource URIs
Sorry to be a bit pushy, but that dump actually has URIs that are
formatted rightly according the URI encoding guidelines, with brackets
escaped, which is my main point.

Just to sum up:
  * URI encoding guidelines say brackets should be %-escaped
  * Yago dump has them %-escaped
  * DBpedia dump doesn't have them %-escaped

Which I hope explains why I find all that very confusing!

Conclusion, the dump at: http://downloads.dbpedia.org/3.7/links/yago_links.nt.bz2
which is based on Yago contains incorrect mappings, right?

Kingsley

Best,
y

If YAGO change
their URIs, I suppose that in the example below it will move to
http://dbpedia.org/page/Keith_Allen_(actor) which has all the other
information. But that one doesn't have its brackets escaped, which the
URI encoding rules say it should.

If that is the case, would it be possible to update that page to
describe the updated encoding rules?

Best,
y

Best,
Dimitris


On Mon, Jul 23, 2012 at 1:28 PM, Dimitris Kontokostas
<kontokos...@informatik.uni-leipzig.de> wrote:
well, it's a bug, so both :)
If you want to retrieve yago, some or all of them do not decode the '('
/
')'

If you wait a while for the new release this should be resolved

Best,
Dimitris


On Mon, Jul 23, 2012 at 1:56 PM, Yves Raimond <yves.raim...@gmail.com>
wrote:
Hello!

I am even more confused after reading that email :)

<quote>
In the example
   http://dbpedia.org/page/Republican_Party_%28United_States%29
   http://dbpedia.org/page/Republican_Party_(United_States)
the existence of the second is a bug. The URIs used in the YAGO dump
were not properly encoded before loading (as you can see this resource
only has YAGO properties). This will be fixed in the next release.
</quote>

In the example below and for lots of other URIs we're dealing with,
this is exactly the inverse. The %-encoded URI is the one appearing in
the YAGO dataset. The non-encoded URI seems to be the 'real' one.

So which URI should we be using?

Best,
y

On Mon, Jul 23, 2012 at 11:50 AM, Dimitris Kontokostas
<kontokos...@informatik.uni-leipzig.de> wrote:
Hi Yves,

This is a bug from the yago dataset. You can see in [1] for more
info.

Best,
Dimitris

[1] http://sourceforge.net/mailarchive/message.php?msg_id=27618543

On Mon, Jul 23, 2012 at 1:37 PM, Yves Raimond
<yves.raim...@gmail.com>
wrote:
Hello!

We keep hitting various URI-encoding related issues in the last
couple
of weeks. The rules at http://dbpedia.org/URIencoding make it clear
that brackets should be escaped. However for a number of resources
it
doesn't appear to be the case, e.g.

http://dbpedia.org/page/Keith_Allen_%28actor%29 (which has only a
bit
of the information - YAGO types)
vs.
http://dbpedia.org/page/Keith_Allen_(actor) (which has all the rest,
but no YAGO types).

Could it be caused by
http://wiki.dbpedia.org/Internationalization?v=8c8 developments?

Best,
Yves




------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond.
Discussions
will include endpoint security, mobile security and the latest in
malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion




--

Regards,

Kingsley Idehen 
Founder & CEO
OpenLink Software
Company Web: http://www.openlinksw.com
Personal Weblog: http://www.openlinksw.com/blog/~kidehen
Twitter/Identi.ca handle: @kidehen
Google+ Profile: https://plus.google.com/112399767740508618350/about
LinkedIn Profile: http://www.linkedin.com/in/kidehen





Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to