Thanks to all on their comments and info. I wanted to follow up on Rumi's
comment:
"http://live.dbpedia.org/sparql is currently pointing to
http://dbedia-live.openlinksw.com/sparql so contain the same data.
This data is updated regularly with the latest changes from Wikipedia itself."
If the 2 end points for live data contain the same data then how come the query
in my first email return such varying result (one returns a count of ~ 3
million while the other returns a count of ~63 million)?
Based on Juan's comment about, I also tried querying for a smaller subset and
see different results here as well. The query I used is:
PREFIX dbo: <http://dbpedia.org/ontology/>
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX dbpprop: <http://dbpedia.org/property/>
PREFIX dbo: <http://dbpedia.org/ontology/>
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX dbpprop: <http://dbpedia.org/property/>
SELECT ?wikiPageID count(*)
FROM <http://dbpedia.org>
WHERE
{ ?person rdf:type dbo:Person .
?person dbpprop:name ?name .
?person dbo:birthDate ?birthDate .
?person dbo:abstract ?abstract .
?person dbo:wikiPageID ?wikiPageID .
?person dbo:wikiPageRevisionID ?wikiPageRevisionID
OPTIONAL
{ ?person dbo:wikiPageModified ?wikiPageModified }
OPTIONAL
{ ?person dbo:wikiPageExtracted ?wikiPageExtracted }
FILTER langMatches(lang(?abstract), "en")
FILTER (?wikiPageID = 365352 || ?wikiPageID = 39972083)
}
GROUP BY ?wikiPageID
On http://live.dbpedia.org/sparql it shows:
wikiPageID callret-1
39972083 8
On http://dbpedia-live.openlinksw.com/sparql it shows:
wikiPageID callret-1
39972083 2
365352 544
Why is that happening?
~ Shruti
On Friday, June 27, 2014 7:39:39 AM, Kingsley Idehen <kide...@openlinksw.com>
wrote:
>
>
>On 6/27/14 9:52 AM, Rumi wrote:
>
>Hi Juan,
>>
>>On 27-Jun-14 9:06 AM, Juan Lucas Domínguez Rubio wrote:
>>
>charset=unicode" http-equiv=Content-Type>
>Hello,
>Very surprising results. The difference is so big that if you choose a random
>resource from the large result set, then it will probably not exist in the
>small result set. Have you tried?
>
>By the way, how do the live endpoints (http://live.dbpedia.org/sparql,
>http://dbpedia-live.openlinksw.com/sparql) compare to the standard endpoit
>(http://dbpedia.org/sparql) in terms of:
>
>- Amount of data
>- Response time
>- Availability
>
>Anybody knows about that?
>http://live.dbpedia.org/sparql is currently pointing to
>http://dbedia-live.openlinksw.com/sparql so contain the same data.
>This data is updated regularly with the latest changes from
Wikipedia itself.
>
>Statistics on the update process can be found at
>http://dbpedia-live.openlinksw.com/live/
>
>http://dbpedia.org/sparql/ is based on the 3.9 dataset as published by the
>dbpedia team.
>It is a static dataset that is refreshed about once a year ...
>
>So live.dbpedia.org has the newest data but in some cases
dbpedia.org has some extra data from other datasets that live will
not have.
>
>As for Response time and Availability, it is a best effort
project, so we make sure it runs most of the time, but we will not
give guarantees for uptime and availability.
>
>There are rate limiters and ACLs in place to make sure everyone
gets a fair shot at running queries.
>
>Those limits can cause problems for certain kind of analysis as
such queries may timeout before giving meaningful data back.
>In such cases we strongly recommend users to setup a version of
dbpedia in the cloud so they can have the same data but without
competing with queries from other users.
>
>Hope this helps.
>
>
>Best Regards,
>Rumi Kocis
>One little addition to what Rumi stated above, you can also try the
instance at: <http://lod.openlinksw.com> which provides a LOD Cloud cache.
Naturally, this will not be as up to date as the DBpedia-Live instance in
regards to data transformed from Wikipedia documents etc..
>
>
>-- Regards, Kingsley Idehen
Founder & CEO
OpenLink Software
Company Web: http://www.openlinksw.com Personal Weblog:
http://www.openlinksw.com/blog/~kidehen Twitter Profile:
https://twitter.com/kidehen Google+ Profile:
https://plus.google.com/+KingsleyIdehen/about LinkedIn Profile:
http://www.linkedin.com/in/kidehen
>
>------------------------------------------------------------------------------
>Open source business process management suite built on Java and Eclipse
>Turn processes into business applications with Bonita BPM Community Edition
>Quickly connect people, data, and systems into organized workflows
>Winner of BOSSIE, CODIE, OW2 and Gartner awards
>http://p.sf.net/sfu/Bonitasoft
>
>_______________________________________________
>Dbpedia-discussion mailing list
>Dbpedia-discussion@lists.sourceforge.net
>https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
>
>
------------------------------------------------------------------------------
Open source business process management suite built on Java and Eclipse
Turn processes into business applications with Bonita BPM Community Edition
Quickly connect people, data, and systems into organized workflows
Winner of BOSSIE, CODIE, OW2 and Gartner awards
http://p.sf.net/sfu/Bonitasoft
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion