Hi Peter and Marco, Except for a few minor points, I agree completely with the findings of Peter and Marco's comments. So, if the majority of us agree, let us start to do something about it. Since such an endeavour would be a good starting point for doing things better, how about trying to make the DBpedia ontology a little bit closer to existing classifications? For instance, those that exist in the field of describing scientific disciplines and areas of knowledge, like the UDC Linked Data Summary? (http://udcdata.info/ )
Regards, Gerard ________________________________________ Van: Marco Fossati [hell.j....@gmail.com] Verzonden: donderdag 10 april 2014 15:14 Aan: dbpedia-discussion@lists.sourceforge.net Onderwerp: Re: [Dbpedia-discussion] A quick analysis of the classes in the DBpedia ontology Hi Peter, Thank you for your detailed report. The DBpedia ontology is (a) crowdsourced and (b) follows a data-driven approach. Classes and properties are mainly derived from the actual data coming from different Wikipedia chapters. Those are the main reasons of the issues you mentioned. It would be great if you could contribute a deep analysis and detect the inconsistencies. In this way, we could clean the ontology up and provide rock solid semantics. As you already mention lots of examples, a brand new ontology and exact deltas with the current one would be highly beneficial. Cheers! On 4/10/14, 12:17 AM, Patel-Schneider, Peter wrote: > I did a quick analysis of the classes in the DBpedia ontology and found > quite a few issues that I think need attention. > > - Many classes have no instances. Each of these empty classes > should be examined to see whether they should be removed or modified. > > - The sports-related groupings are differentially populated, differentially > organized, and unaxiomatized. These groupings should be regularized and > minimal axiomatizations provided for them. For example, there would be > classes for Basketball under at least SportsLeague, SportsTeam, Coach, > and SportsEvent each defined as the restriction of the grouping elements > related to Basketball. The sports groupings include Sport (which is > special), SportsLeague, SportsTeam, Athlete, Coach, SportsTeamMember, > SportsManager, SportsEvent, SportFacility, SportCompetitionResult, > SportsSeason, and Tournament. > > - Numerous stated inclusion relationships are not correct when considering > the normal definition of the class names. Each of these should be > examined and either descriptions of the classes that support the > inclusion relationship be provided or the relationship itself modified. > For example, instances of the RecordOffice class do not appear to be > non-profit organizations. Some other examples of questionable or > outright incorrect subclasses here are TermOfOffice, BackScene, > ChessPlayer, PokerPlayer, TeamMember, Saint, FictionalCharacter, > MythologicalFigure, OrganisationMember, Religious, Baronet, Medician, > Professor, Embryology, Lymph, Constellation, Galaxy, ElectionDiagram, > Olympics, OlympicEvent, ControlledDesignationOfOriginWine, > PublicServiceInput, PublicServiceOutput, and ProgrammingLanguage. > > - Some class relationships are missing. For example, TeamMember is > unrelated to SportsTeamMember even though they are both supposed to be > members of athletic teams. Some other examples of missing relationships > are between BullFighter and Bullfighter, between Host and TelevisionHost, > and between Comic and Comics. The missing relationships should be > provided or the classes merged. > > - Place is a rather unnatural union. It should either be removed or > better organized. > > - There are quite a few subclasses of Building that are not truely > buildings, including AmusementParkAttraction, Casino, Factory, Hotel, > MilitaryStructure, Abbey and the other religious places of worship, > Restaurant, ShoppingMall, and Venue. Similarly, there are a number of > subclasses of ArchitecturalStructure that may not be architectural > structures, including Garden, PublicTransitSystem, and Park. There are a > few subclasses of NaturalPlace that are not necessarily natural > places, including Canal, and even Lake. These classes should be > moved up in the ontology. > > - The subclasses of Species are not collections of species. The > subclasses should either be modified or moved elsewhere in the > ontology. > > - The normal definition of PopulatedPlace is much too narrow to encompass > all its subclasses. A new general class should be created to > encompass the subclasses and PopulatedPlace be modified as necessary. > > - There are a number of strange top-level or second-level classes. These > classes should be examined to ensure that they make sense. Many of > these classes appear to be somehow related to measurements, including > Altitude, Area, Blazon, ChartsPlacement, Demographics, Depth, > GrossDomesticProduct, GrossDomesticProductPerCapita, > HumanDevelopmentIndex, Population, Sales, Statistics, and Tax. Other > strange classes include LifeCycleEvent, Imdb, Listen, PenaltyShootOut, > PersonFunction, PoliticalFunction, Profession, TopicalConcept, Type, and > YearInSpaceflight. > > Even if I had editing rights to the ontology I think that the fixes I have > outlined above go beyond what should be done without some discussion. > > Comments? > > peter > > > ------------------------------------------------------------------------------ > Put Bad Developers to Shame > Dominate Development with Jenkins Continuous Integration > Continuously Automate Build, Test & Deployment > Start a new project now. Try Jenkins in the cloud. > http://p.sf.net/sfu/13600_Cloudbees > _______________________________________________ > Dbpedia-discussion mailing list > Dbpedia-discussion@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion > -- Marco Fossati http://about.me/marco.fossati Twitter: @hjfocs Skype: hell_j ------------------------------------------------------------------------------ Put Bad Developers to Shame Dominate Development with Jenkins Continuous Integration Continuously Automate Build, Test & Deployment Start a new project now. Try Jenkins in the cloud. http://p.sf.net/sfu/13600_Cloudbees _______________________________________________ Dbpedia-discussion mailing list Dbpedia-discussion@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion Disclaimer Dit bericht met eventuele bijlagen is vertrouwelijk en uitsluitend bestemd voor de geadresseerde. Indien u niet de bedoelde ontvanger bent, wordt u verzocht de afzender te waarschuwen en dit bericht met eventuele bijlagen direct te verwijderen en/of te vernietigen. Het is niet toegestaan dit bericht en eventuele bijlagen te vermenigvuldigen, door te sturen, openbaar te maken, op te slaan of op andere wijze te gebruiken. Ordina N.V. en/of haar groepsmaatschappijen accepteren geen verantwoordelijkheid of aansprakelijkheid voor schade die voortvloeit uit de inhoud en/of de verzending van dit bericht. This e-mail and any attachments are confidential and are solely intended for the addressee. If you are not the intended recipient, please notify the sender and delete and/or destroy this message and any attachments immediately. It is prohibited to copy, to distribute, to disclose or to use this e-mail and any attachments in any other way. Ordina N.V. and/or its group companies do not accept any responsibility nor liability for any damage resulting from the content of and/or the transmission of this message. ------------------------------------------------------------------------------ Put Bad Developers to Shame Dominate Development with Jenkins Continuous Integration Continuously Automate Build, Test & Deployment Start a new project now. Try Jenkins in the cloud. http://p.sf.net/sfu/13600_Cloudbees _______________________________________________ Dbpedia-discussion mailing list Dbpedia-discussion@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion