Hi,

following the discussion regarding deficiencies of the DBpedia ontology, I 
would like to make you know that a team of collaborators sponsored by 
Structured Dynamics is working on  classification of the Wikipedia articles 
into the Umbel [1] ontology. Yet another ontology, you may think, but Umbel is 
exceptional in the regard that it is entirely based on Cyc [2] - a more than 
20-year effort of building an ontology large enough to capture entire common 
sense knowledge. We address many of the problems that are present both in 
Wikipedia and DBpedia, e.g. things like list and disambiguation pages, 
administrative categories in Wikipedia, compound categories in Wikipedia, etc. 

The good news is that exactly this week we've produced the first "final" result 
of the classification, i.e. around 90% of the Wikipedia articles has a Umbel 
type (we use ~4k different types in the classification) assigned via a mapping 
between Wikipedia categories and Umbel types. I am writing in quotes, since 
that result still has to be validated and curated and we are probably not going 
to publish it just at the moment. Still, I expect some consumable results to be 
produced in May, so stay tuned.

What is more - regarding the problem of integrating the data from different 
data sources - I used Cyc, especially its conflict detection mechanism, in the 
past [3] to integrate classification results produced by various type 
assignment methods. For me it seems to be the best resource applicable in that 
problem, since besides the generalization relation, which is found in all the 
ontologies, most of the classes have disjointWith statements, even on a level 
of Cats and Dogs (i.e. individual species). As a result it is relatively easy 
to keep the data coherent.

Kind regards,
Aleksander

[1] http://umbel.org
[2] http://www.cyc.com/platform/opencyc
[3] http://ceur-ws.org/Vol-906/paper2.pdf

---- Wł. Śr, 09 kwi 2014 23:25:25 +0200 Kingsley 
Idehen<kide...@openlinksw.com> napisał(a) ---- 


On 4/9/14 4:53 PM, Paul Houle wrote: 
> The type assignments in DBpedia are very precise (few false 
> statements) but not accurate in the sense that recall is poor; many 
> things fall through the cracks. The real problem is that the the 
> mappings are the map, not the territory. Wikipedia is an 
> encyclopedia for humans, not for machines, so DBpedia has to parse 
> whatever unsane markup they give us. 
> 
> Systems like Wikidata and Freebase can be edited by machines and 
> human ontologists and get better recall for types. 
> 
> http://basekb.com/ 
> 
> is a conversion from Freebase to industry standard RDF. You could 
> use :BaseKB as a substitute for DBpedia, but DBpedia has advantages 
> too because in addition to the 4 million things important enough to be 
> in DBpedia, there is another 37 million unimportant things in :BaseKB 
> that matter only to librarians, video store clerks and professional 
> discographers. 
> 
> These unimportant things will drive you crazy unless you master 
> them, and the easiest way to turn down the noise is to restrict 
> search to the 4 million things. 
> 
> I could make you an RDF file that has statements such as 
> 
> ?dbpediaTopic a ?freebaseType . 
> 
> you could load that together with the rest of DBpedia. That would 
> get you a long way towards good lists. The trouble at this point is 
> that you don't have the freebase types connected to the DBpedia types 
> so you can't join them against the schema to find properties and such. 
> Mapping the types to the DBpedia types would not be that hard either, 
> since the two systems are well aligned. Then you get something that 
> looks like DBpedia but has more accurate types. 
> 
> Freebase has more accurate and better populated data for things 
> like ticker symbols, geo-coordinates, genders, birth dates and the 
> like. It would not be hard to rewrite Freebase statements to 
> 
> ?dbpediaTopic ?freebasePredicate ?anotherDbpediaTopic . 
> 
> and that would produce something that would be remarkably user friendly. 
 
:baseKB could (and maybe should) pitched as a human-and-machine curated 
bridge between Freebase, DBpedia, and Wikidata (I think). 
 
Have you considered mapping the classes and properties across DBpedia, 
Freebase, and Wikidata? 
 
 
-- 
 
Regards, 
 
Kingsley Idehen     
Founder & CEO 
OpenLink Software 
Company Web: http://www.openlinksw.com 
Personal Weblog: http://www.openlinksw.com/blog/~kidehen 
Twitter Profile: https://twitter.com/kidehen 
Google+ Profile: https://plus.google.com/+KingsleyIdehen/about 
LinkedIn Profile: http://www.linkedin.com/in/kidehen 
 
 
 
 
 
------------------------------------------------------------------------------ 
Put Bad Developers to Shame 
Dominate Development with Jenkins Continuous Integration 
Continuously Automate Build, Test & Deployment 
Start a new project now. Try Jenkins in the cloud. 
http://p.sf.net/sfu/13600_Cloudbees_______________________________________________
 
Dbpedia-discussion mailing list 
Dbpedia-discussion@lists.sourceforge.net 
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion 


------------------------------------------------------------------------------
Put Bad Developers to Shame
Dominate Development with Jenkins Continuous Integration
Continuously Automate Build, Test & Deployment 
Start a new project now. Try Jenkins in the cloud.
http://p.sf.net/sfu/13600_Cloudbees
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to