Thanks to all on their comments and info. I wanted to follow up on Rumi's 
comment:
"http://live.dbpedia.org/sparql is currently pointing to 
http://dbedia-live.openlinksw.com/sparql so contain the same data. 
This data is updated regularly with the latest changes from Wikipedia itself."


If the 2 end points for live data contain the same data then how come the query 
in my first email return such varying result (one returns a count of ~ 3 
million while the other returns a count of ~63 million)?

Based on Juan's comment about, I also tried querying for a smaller subset and 
see different results here as well. The query I used is:

PREFIX  dbo:  <http://dbpedia.org/ontology/>
PREFIX  rdf:  <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX  dbpprop: <http://dbpedia.org/property/>

PREFIX  dbo:  <http://dbpedia.org/ontology/>
PREFIX  rdf:  <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX  dbpprop: <http://dbpedia.org/property/>

SELECT  ?wikiPageID count(*)
FROM <http://dbpedia.org>
WHERE
  { ?person rdf:type dbo:Person .
    ?person dbpprop:name ?name .
    ?person dbo:birthDate ?birthDate .
    ?person dbo:abstract ?abstract .
    ?person dbo:wikiPageID ?wikiPageID .
    ?person dbo:wikiPageRevisionID ?wikiPageRevisionID
    OPTIONAL
      { ?person dbo:wikiPageModified ?wikiPageModified }
    OPTIONAL
      { ?person dbo:wikiPageExtracted ?wikiPageExtracted }
    FILTER langMatches(lang(?abstract), "en")
    FILTER (?wikiPageID = 365352 || ?wikiPageID = 39972083)
  }
GROUP BY ?wikiPageID

On http://live.dbpedia.org/sparql it shows:
wikiPageID        callret-1
39972083          8 

On http://dbpedia-live.openlinksw.com/sparql it shows:
wikiPageID        callret-1
39972083          2
365352            544


Why is that happening?

~ Shruti



On Friday, June 27, 2014 7:39:39 AM, Kingsley Idehen <kide...@openlinksw.com> 
wrote:
 

>
>
>On 6/27/14 9:52 AM, Rumi wrote:
>
>Hi Juan,
>>
>>On 27-Jun-14 9:06 AM, Juan Lucas Domínguez Rubio wrote:
>>
>charset=unicode" http-equiv=Content-Type> 
>Hello,
>Very surprising results. The difference is so big that if you choose a random 
>resource from the large result set, then it will probably not exist in the 
>small result set. Have you tried?
> 
>By the way, how do the live endpoints (http://live.dbpedia.org/sparql, 
>http://dbpedia-live.openlinksw.com/sparql) compare to the standard endpoit 
>(http://dbpedia.org/sparql) in terms of:
> 
>- Amount of data
>- Response time
>- Availability
> 
>Anybody knows about that?
>http://live.dbpedia.org/sparql is currently pointing to 
>http://dbedia-live.openlinksw.com/sparql so contain the same data. 
>This data is updated regularly with the latest changes from
      Wikipedia itself. 
>
>Statistics on the update process can be found at 
>http://dbpedia-live.openlinksw.com/live/
>
>http://dbpedia.org/sparql/ is based on the 3.9 dataset as published by the 
>dbpedia team. 
>It is a static dataset that is refreshed about once a year ...
>
>So live.dbpedia.org has the newest data but in some cases
      dbpedia.org has some extra data from other datasets that live will
      not have.
>
>As for Response time and Availability, it is a best effort
      project, so we make sure it runs most of the time, but we will not
      give guarantees for uptime and availability. 
>
>There are rate limiters and ACLs in place to make sure everyone
      gets a fair shot at running queries. 
>
>Those limits can cause problems for certain kind of analysis as
      such queries may timeout before giving meaningful data back. 
>In such cases we strongly recommend users to setup a version of
      dbpedia in the cloud so they can have the same data but without
      competing with queries from other users.
>
>Hope this helps.
>
>
>Best Regards,
>Rumi Kocis 
>One little addition to what Rumi stated above, you can also try the
    instance at: <http://lod.openlinksw.com> which provides a LOD Cloud cache. 
Naturally, this will not be as up to date as the DBpedia-Live instance in 
regards to data transformed from Wikipedia documents etc.. 
>
>
>--  Regards, Kingsley Idehen         
Founder & CEO 
OpenLink Software     
Company Web: http://www.openlinksw.com Personal Weblog: 
http://www.openlinksw.com/blog/~kidehen Twitter Profile: 
https://twitter.com/kidehen Google+ Profile: 
https://plus.google.com/+KingsleyIdehen/about LinkedIn Profile: 
http://www.linkedin.com/in/kidehen
>
>------------------------------------------------------------------------------
>Open source business process management suite built on Java and Eclipse
>Turn processes into business applications with Bonita BPM Community Edition
>Quickly connect people, data, and systems into organized workflows
>Winner of BOSSIE, CODIE, OW2 and Gartner awards
>http://p.sf.net/sfu/Bonitasoft
>
>_______________________________________________
>Dbpedia-discussion mailing list
>Dbpedia-discussion@lists.sourceforge.net
>https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
>
>
------------------------------------------------------------------------------
Open source business process management suite built on Java and Eclipse
Turn processes into business applications with Bonita BPM Community Edition
Quickly connect people, data, and systems into organized workflows
Winner of BOSSIE, CODIE, OW2 and Gartner awards
http://p.sf.net/sfu/Bonitasoft
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to