Re: Oracle Uniprot RDF data set and benchmarks

Jim Hendler Wed, 08 Feb 2006 13:30:14 -0800


At 9:26 -0500 2/8/06, Susie Stephens wrote:

I will find out more about the Uniprot subgraph that we used for theVLDB paper, and see if we can make it available.
However, I really like Eric Jain's offer of providing stable datasets of different sizes for benchmarking. It makes sense to me tohave an independent organization providing the data sets.
Susie

I love this idea, but I would go a bit further - be even nicer for usnon-biologists if it also included some example queries to run (andmaybe even the correct answer sets) - I think if that existed, wecould push some of the triple store developers to use it as abenchmark, which would help both communities...

Eric Miller wrote:
 On Feb 8, 2006, at 6:22 AM, Eric Jain wrote:
 Ian Wilson wrote:
We will thus want to maintain a local copy of this extract (onthe wiki?) so changes in the graph don't change the benchmarkingresults.
The data in http://www.isb-sib.ch/~ejain/rdf/data/ is indeedupdated every two weeks, but I could also provide some more stabledata sets for benchmarking if there is interest, perhaps with 1M,10M and 100M triples?
I think this would be extremely useful for a variety ofcommunities trying to assess issues of scalability; the more"connected" graphs subsets for testing, the better.
 thanks in advance!

 -- eric miller                              http://www.w3.org/people/em/
 semantic web activity lead               http://www.w3.org/2001/sw/
 w3c world wide web consortium            http://www.w3.org/


--
Professor James Hendler                   Director
Joint Institute for Knowledge Discovery           301-405-2696
UMIACS, Univ of Maryland                          301-314-9734 (Fax)
College Park, MD 20742                    http://www.cs.umd.edu/~hendler
Web Log: http://www.mindswap.org/blog/author/hendler

Re: Oracle Uniprot RDF data set and benchmarks

Reply via email to