Hi all

I recently discovered, that DBpedia unnecessarily encodes '(' and ')'
in URLs to '%28' and '%29' because both
 RFC 1738  Uniform Resource Locators (URL)
 RFC 2396  (URI Generic Syntax) and
define that "(" and ")" can be normally used within URLs.

For most of the cases requests with the unencoded variant of the URL
would not return any results
e.g.
  http://dbpedia.org/resource/Family_%28biology%29
  http://dbpedia.org/resource/Family_(biology) -> not found
Wikipedia allows access with both variants.

In some cases DBpedia provides different information for both variants
of the URL
e.g.
  http://dbpedia.org/page/Republican_Party_%28United_States%29
  http://dbpedia.org/page/Republican_Party_(United_States)

As far as I can say this affects all URLs that use '(' and ')'

A similar thing is also the case for URLs that include an "#"
e.g.
 http://dbpedia.org/resource/Midfielder%23Winger is an own Resource
 http://dbpedia.org/resource/Midfielder#Winger returns the contents
for Midfielder

best
Rupert

-- 
| Rupert Westenthaler             [email protected]
| Bodenlehenstraße 11                             ++43-699-11108907
| A-5500 Bischofshofen

------------------------------------------------------------------------------
Achieve unprecedented app performance and reliability
What every C/C++ and Fortran developer should know.
Learn how Intel has extended the reach of its next-generation tools
to help boost performance applications - inlcuding clusters.
http://p.sf.net/sfu/intel-dev2devmay
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to