I will find out more about the Uniprot subgraph that we used for the
VLDB paper, and see if we can make it available.
However, I really like Eric Jain's offer of providing stable data sets
of different sizes for benchmarking. It makes sense to me to have an
independent organization providing the data sets.
Susie
Eric Miller wrote:
On Feb 8, 2006, at 6:22 AM, Eric Jain wrote:
Ian Wilson wrote:
We will thus want to maintain a local copy of this extract (on the
wiki?) so changes in the graph don't change the benchmarking results.
The data in http://www.isb-sib.ch/~ejain/rdf/data/ is indeed updated
every two weeks, but I could also provide some more stable data sets
for benchmarking if there is interest, perhaps with 1M, 10M and 100M
triples?
I think this would be extremely useful for a variety of communities
trying to assess issues of scalability; the more "connected" graphs
subsets for testing, the better.
thanks in advance!
--
eric miller http://www.w3.org/people/em/
semantic web activity lead http://www.w3.org/2001/sw/
w3c world wide web consortium http://www.w3.org/