I have a somewhat mixed feeling regarding the organization of Wikipedia. It is 
true that if you use it as a human, you will find what you need in a reasonable 
time (usually using search box and following direct category links). But from 
the POV of a KB engineer, its organization is very far from being perfect. Let 
me just give you an example: administrative categories in Wikipedia. These are 
categories that should not be normally displayed to the end-user. You would 
expect that there is one, maybe two means of expressing that given category is 
administrative (e.g. a container category and a template). But there are at 
least several ways of stating that a given category is administrative - 
sometimes this information is only stated in the contents of the category.

Another example - eponymous categories. You have Cat_main template, which is 
used to state that there is a corresponding article for a given category. You 
also have Main template, which has similar meaning. But in majority of the 
situations the link between the cat. and article is only provided as a link at 
the beginning of the category contents or as the first item on the list of 
category articles. 

So I have doubts regarding construction of a well-structured ontology that 
would emerge e.g. from the usage of infobox templates in Wikipedia. I am not 
saying that Cyc or Umbel are perfect (they are not), there are also 
duplications and unnecessary classes there. But Cyc has been constructed for 
more than 20 years and I believe that many of the problems we are discussing 
here were already discussed during Cyc creation. Let me just say that you have 
3 concepts in Cyc that correspond to "church":
* #$ChurchService
* #$Church-Building
* #$Church-LocalCongregation

I am not saying that the mapping we will produce in May or June will provide 
the correct classes in all the cases (this is just not feasible), but I am 
saying that the necessary concepts are already present in Cyc (and Umbel). And 
if they are really missing we will provided them and attach to the rich 
structure of Cyc/Umbel.

Kind regards,
Aleksander

---- Wł. Śr, 16 kwi 2014 06:22:44 +0200 Peter F. 
Patel-Schneider<peter.patel-schnei...@nuance.com> napisał(a) ---- 


I agree that it seems harder to crowdsource ontologies. However, Wikipedia 
seems to have a half-decent organization, so maybe it is possible. My view is 
that Wikipedia is succeeding, and not just in overall organization, because 
there are Wikipedia editors that challenge and remove incorrect and incoherent 
information and also spend time oncleanup tasks. I think that any 
crowdsourced artifact needs to have participants that spend considerable time 
on these tasks.

It appears that the DBpedia ontologyhas not been subject to much of this kind 
of activity. Just today I went through the DBpedia class taxnomy and marked 
classes that were arguably misplaced. I found over 50 out of about 500 
non-sport classes(and the sport classes probably all need some attention).

The misplacement of DBpediaclasses is not justa problem with the DBpedia 
ontology, of course, as the DBpedia taxonomy is used to generate type 
statements for all DBpedia resources. The only aspect of DBpedia that makes 
this not quite so severe a problem is that many of the misplaced classes have 
few or no instances. However, some of the misplaced classes, e.g., 
FictionalCharacter, ChessPlayer, PokerPlayer, Saint, Religious, Monarch, 
Medician, Galaxy, Restaurant, Country, Grape, and Venue, have a significant 
number of instances.

peter

On 04/15/2014 07:16 PM, Mike Bergman wrote:
> Hi Peter,
>
> My observation is that crowdsourced knowledge bases (namely, Wikipedia, 
> DBpedia, schema.org, Freebase, etc) can be excellent sources for the 
> description and characterization of things and entities, but the 
structures 
> that may be derived from them will by definition be incoherent at the TBox 
> level.
>
> Exhortations to many contributors to be more coherent at a structural 
level 
> are not likely, I believe, to meet with much success. The motivations of 
> contributors and editors are most often local within the KB space. Thus, 
in 
> microcosm, many parts of these KBs can look pretty good, but when the 
scope 
> extends more broadly across the KB, the coherence breaks down. There 
aren't 
> many advocates for structure-wide coherence.
>
> As an advocate for structure-wide coherence and one who is not afraid to 
> wade into the fray, perhaps you can work some useful magic. I'm dubious, 
but 
> I truly wish you luck.
>
> Our approach, which we have been working on for some years episodically, 
> with another episode due shortly, is to use a coherent structure (UMBEL, 
in 
> our approach, which is a faithful, simplified subset of Cyc) to provide 
the 
> TBox, and then to find defensible ways to map the entity and concept 
> information in crowdsourced KBs to that structure.
>
> We have been talking about this for so long that it is time for us to 
> complete our initial development and put something forward that you and 
> others can similarly scrutinize. We hope to have something useful by this 
> summer.
>
> Thanks, Mike
>
> On 4/15/2014 6:55 PM, Patel-Schneider, Peter wrote:
>> Hmm.
>>
>> Well, perhaps one could argue that there should be no hierarchy at all.
>>
>> Using the DBpedia ontology does commit you to a lot of things, many of 
them 
>> quite questionable. For example, in the DBpedia ontology churches are 
>> buildings, which is not true for many churches, and not even true for 
the 
>> physical location associated with many churches. This is one of the 
things 
>> that I think needs to be changed.
>>
>> peter
>>
>>
>> On Apr 15, 2014, at 4:57 PM, Paul Houle <ontolo...@gmail.com>
>> wrote:
>>
>>> I try not to get hung up on the idea of having one right hierarchy 
but
>>> assume most end users will need to interpret the types that exist 
in
>>> the way that makes sense for what they are doing.
>>>
>>> The idea of foaf:Agent, which is a superclass of both person and
>>> organization, is a powerful concept because of properties shared by
>>> these two "things"; for instance, either can be a party to a
>>> lawsuit. Even in music you could say a brand like "Michael Jackson"
>>> is a team effort.
>>>
>>> On the other hand some people want :Flutist to be a subclass of
>>> :Person and it makes sense to say one person who plays the flute 
is a
>>> :Flutist but you can't say that a trio that all plays the flute is 
a
>>> :Flutist. Everybody has some theorem they expect the system to 
prove
>>> and they won't accept your axiom set unless you can prove their
>>> theorem with it.
>>>
>>> On Tue, Apr 15, 2014 at 6:51 PM, Patel-Schneider, Peter
>>> <peter.patel-schnei...@nuance.com> wrote:
>>>> schema.org is controlled by the schema.org partners, Google, 
Yahoo!, 
>>>> Bing, and Yandex. Contributions from the community are 
accepted, but are 
>>>> vetted before being added to schema.org
>>>>
>>>> See http://schema.org for more information.
>>>>
>>>>
>>>> One problem with alignment to schema.org is that the formal 
meaning of 
>>>> the schema.org ontology is unusual and not fully explained.
>>>>
>>>> peter
>>>>
>>>>
>>>>
>>>> On Apr 14, 2014, at 10:08 PM, 小出 誠二 
<seijikoi...@gmail.com> wrote:
>>>>
>>>>> Dear Peter
>>>>>
>>>>> For eyes of ontologists, it is well known that DBpedia 
ontology is
>>>>> incorrect, but I have never check about the contents of 
schema.org. Thank
>>>>> you for the info.
>>>>>
>>>>> I am planning to correct trustable ontologies like 
schema.org, but I do not
>>>>> know how to revise or advice the contents of schema.org.
>>>>>
>>>>> Does anyone know it? Or does anyone have interest the 
portal sites of
>>>>> trustable ontologies?
>>>>>
>>>>> Seiji Koide
>>>>>
>>>>> -----Original Message-----
>>>>> From: Patel-Schneider, Peter 
[mailto:peter.patel-schnei...@nuance.com]
>>>>> Sent: Tuesday, April 15, 2014 1:50 PM
>>>>> To: dbpedia-discussion@lists.sourceforge.net
>>>>> Subject: [Dbpedia-discussion] probably incorrect mapping 
to schema.org from
>>>>> MusicalArtist
>>>>>
>>>>> The ontology says that MusicalArtist is a subclass of 
>>>>> schema.org:MusicGroup.
>>>>>
>>>>> This seemed very odd to me, but then I looked at 
schema.org and noticed 
>>>>> that
>>>>> schema.org:MusicGroup can also be for *solo* artists. 
However,
>>>>> MusicalArtist is for any musical artist, not just 
soloists. So this mapping
>>>>> still looks incorrect.
>>>>>
>>>>> peter
>>
>> 
------------------------------------------------------------------------------
>> Learn Graph Databases - Download FREE O'Reilly Book
>> "Graph Databases" is the definitive new guide to graph databases and 
their
>> applications. Written by three acclaimed leaders in the field,
>> this first edition is now available. Download your free book today!
>> http://p.sf.net/sfu/NeoTech
>> _______________________________________________
>> Dbpedia-discussion mailing list
>> Dbpedia-discussion@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>>


------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and their
applications. Written by three acclaimed leaders in the field,
this first edition is now available. Download your free book today!
http://p.sf.net/sfu/NeoTech
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion


------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and their
applications. Written by three acclaimed leaders in the field,
this first edition is now available. Download your free book today!
http://p.sf.net/sfu/NeoTech
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to