Hi all,
The Web of Linked
Data shouldn't be about mass crawling (search engine style) etc...
It has to be. How would you answer a query like "all offers for a book
written by a German author" without crawling the relevant data sets?
First question would be: which dataset has this information? Does amazon
has it, or does it needs to be linked to other people dataset where you
can find such information? (which brings all the question of
disambiguation of entities, etc...)
In any case, there are multiple ways to endup with more or less the same
result. Tell me if I am right, but I think that the current set of
related cartridges only get data from a book URL? So, it is just
converting data about a particular book, for a given URL, using some API
(amazon in this case).
What about search URLs, using search APIs from the same services? I can
certainly think about a cartridge that does just this: searching for
items, and returning the resultsets in RDF using some ontologies. And
then you use the current cartridge to get all the information about the
items you care about in the resultset.
One thing is sure is that the expressiveness of your queries is bound to
the expressiveness of the search API you query. So this is not the
answer to all problems.
But one question: is it realists to think that anyone could query all
amazon and ebay sites (US, CAN, and all the other countries) to convert
everything? And if it endups being the case, how synching and
maintenance could take place?
It really depends on the usecases, but there are much that can be done
by leveraging all APIs in systems such as the Virtuoso sponger. I think
that what you are talking about here will only happen when these
services will want it to happen.
Thanks,
Take care,
Fred