Re: Analyzing the success of LOD

Kingsley Idehen Mon, 02 Mar 2009 05:57:06 -0800

Matthias Samwald wrote:

Andraz:
That the bubbles continue to grown is however a sociological
interesting phenomen :-)
And a good sign that something has gone right :)
Giovanni:
Maybe :-) but people do things for many other reason that "they'reright".
I think the LOD project is a great success. It is a very livelycommunity, there has been significant progress over the last year(amount of data, quality of underlying technologies such as Virtuoso).However, the community should take some time to analyze WHY it issuccessful, and why it is more successful than attempts of usingRDF/OWL before 2007. Some thoughts on this:
* The main ingredient to the success of LOD is that it is relativelycentralized. It would not work without DBpedia serving as the'nucleus' of the cloud. It would not work without someone dedicated todrawing the clould diagram that everyone is happy to show onPowerpoint slides. It would not work without this mailing list thatserves an open platform for the community. However, I have theimpression that some key persons in the LOD community might not behappy about this reason for success at all. For them, the LOD projectis a mere testing ground for the next generation of the entire web,and showing that linked data works in a decentralized way is a crucialaspect of this vision. The fact that the current LOD cloud wasactually produced in a rather centralized process, and that most ofthe valuable data sources in the LOD cloud are actually under thecontrol of a very small number of stakeholders, is seen as a transientblemish, at best.However, I think that this is a problematic situation, and we shouldembrace the semi-centralized nature of the LOD project, rather thanhiding it away. Having a close-knit group of stakeholders thatcontribute to a partly distributed, partly centralized knowledge basemight actually be a very interesting endeavor -- and it might be a wayto provide a clear incentive to participate. LOD could be a novel typeof open-source project, one that is not only concerned with code, butalso with the underlying data. The products of this open sourceproject could then be used in various kinds of projects, some of themwith commercial focus. In such a scenario, being the main stakeholderfor a certain subset of LOD might become profitable, and giveincentive to improve the data provided and controlled by eachstakeholder. This business model could be similar to that ofsuccessful open source content management systems such as Typo3 orDrupal, where the code is free, but providing consulting andcustomization for certain commercial users is based on financial support.I know that this idea of a 'LOD brand' counters the main motivation ofmost people in the community, but it might be the key to creating anincentive structure for providing linked data, improving data qualityand actually getting people to use the data. With the currentphilosophy, I see the danger of LOD staying a permanent 'proof ofconcept'. The concept has been proved by now.

Mattias,

I don't think your point of view is contrarian, it is certainly quite inline with my world view and aspirations re. the LOD effort :-)

* A good point by Giovanni is that mere interlinking of datasets waspossible since 1999 by re-using URIs, and that post-hoc mappingbetween datasets was possible since 2004, when owl:sameAs wasinvented. The linked data movement 'only' added the consensus thatHTTP URIs should be used, and that a HTTP GET request should yield asmall RDF subgraph, listing the RDF triples about the resource.Surely, this is a very practical thing for many reasons, but was itinstrumental for the success of LOD?

No.

At the moment, it seems that most *useful* applications of LOD dataare based on a central triple store created by the aggregation of someor all LOD data sources. In that case, one might ask whether thedereferenceable URIs are really an essential ingredient to the successor LOD, or just a 'good to have', but not essential, feature.

Like most things in the Linked Data realm, there isn't a single factor.The success is inherently connected to recombination and meshing.

DBpedia produced a corpus of "Names" endowed with de-referencable URIs.Thus, in a single project you ended up with a "Linked Data" meme proofof concept based on a familiar knowledgebae (Wikipedia).

Naturally, from DBpedia emerged the LOD cloud, and from the LOD cloud wenow have a much wider corpus of "Names" and a substrate for some seriousinnovation and value delivery.

De-referencable URIs, Negotable Representation of Resource Descriptions,and other elements of the Linked Data Web's FORCE as are simply there tobe tapped by current/next generation of innovators on the Web and/oracross the Enterprise en route to solving real problems. Examples areswould include:

1. Identity (decentralized and non-repudiatable variety via foaf+sslwhich is ultimately going to be sparql+ssl) -- then we can fix mail,commenting and other critical aspects of the Web and Internet2. Data Integration (across the Web, Intranets, and Extranets) --disparate schemas and dirty data are facts of life when dealing with anyDBMS system3. Open Data Access decoupled from Data Representation - for eons manyhad to deal with XDR hell and application specific representations ofDBMS query results.

LOD makes lookups and joins smarter and more powerful. When all is saidan done, beyond storage, DBMS exploitation is about Lookups, Views, andJoins. LOD now enables value delivery based on the aforementionedwithout exposing the intricacies of the LOD mesh. Basically, ourconversation don't have to start from the technical end anymore, westart with demonstrable value etc.

LOD is successful because it is full of pragmatists equipped withtechnical skills and broad industry experience :-)



Kingsley

Giovanni:
An alternative explanation i like is
http://inamidst.com/whits/2008/technobunkum
This is the second time I see this link on this mailing list. He makessome very good points about the importance of focusing on providingsolutions to problems, instead of becoming too tangled up intechnicalities. I also read his other text onhttp://inamidst.com/whits/2008/ambient which gives a lot of insightinto why he has abandoned Semantic Web technologies. I guess theproblems he likes to see solved are too trivial to require aparadigmatic change (such as a global trend towards RDF/OWL andlinked data). However, I would not generalize this experience to yieldthe conclusion that the Semantic Web is a huge case of 'Technobunkum'(what a silly term, by the way). The fact that not every tiny littleproblem on the web might be in need of Semantic Web technologies doesnot mean that these technologies are worthless. There are plenty ofreal use cases in important business segments and companies wherethere is dire need for such new technologies -- life science andhealth care come to mind. I have the feeling that the whole web 2.0hype of the recent years has distorted the perception of webdevelopers about what is actually of societal and economic importance.Creating yet another, slightly improved mashup between your Flickrphotos, Google maps and Wikipedia might actually not be the mostimportant problem of the world today. And it probably doesn't earn youmoney either. End of rant.
Cheers,
Matthias Samwald

DERI Galway, Ireland
http://deri.ie/

Konrad Lorenz Institute for Evolution & Cognition Research, Austria
http://kli.ac.at/



--


Regards,

Kingsley Idehen       Weblog: http://www.openlinksw.com/blog/~kidehen

President & CEOOpenLink Software Web: http://www.openlinksw.com

Re: Analyzing the success of LOD

Reply via email to