Hi Dimitris, Thanks for answering my question!
By "the subcategory link is exposed", I mean that usually subcategories are declared in the template. So that in the dump xml pages, we only see the name of the template. But in this case, if you look at: https://en.wikipedia.org/wiki/Special:Export/Category:Films_shot_in_the_United_States the subcategory links like [[Category:Films shot in Alabama]] is directly exposed in the xml text. Just that. Is it intended to be like this, or is it a bug? I find that this is the case of pages with a clickable map. yang On Wed, May 4, 2016 at 11:36 PM, Dimitris Kontokostas <jimk...@gmail.com> wrote: > > > On Thu, May 5, 2016 at 12:46 AM, Yang Gao <yang....@snapchat.com> wrote: >> >> Hi Dimitris, >> >> Thanks for answering my question! I actually figured it out, that in >> most cases the subcategories are abstracted out into templates, and >> therefore category extractor would not touch the subcategories... >> >> In this case, the subcategory link is exposed, therefore the category >> extractor is confused... > > > Thanks Yang, > > The problem with the categories in templates is known from the beginning of > DBpedia. > btw, what exactly do you mean with "he subcategory link is exposed"? can you > give an example in case we miss something? > > Cheers, > Dimitris > >> >> >> Thanks anyway for your answer! >> >> Yang >> >> On Wed, May 4, 2016 at 12:50 AM, Dimitris Kontokostas <jimk...@gmail.com> >> wrote: >> > Hi, >> > >> > The wikipedia categories is a very big "hierarchy" with many levels (and >> > many cycles) >> > do you want only the very top level (root), exclude the leaf nodes or >> > something else ? >> > >> > On Tue, Apr 26, 2016 at 7:03 PM, Yang Gao <yang....@snapchat.com> wrote: >> >> >> >> Hi, >> >> >> >> When I run the ArticleCategoriesExtractor and the >> >> SkosCategoriesExtractor, I found that the extractor mixes >> >> subcategories as listed in the "Subcategories" section and the >> >> super-categories as listed at the bottom of the page. >> >> >> >> For example, for wiki page "Category:Films shot in the United States" >> >> in the link: >> >> https://en.wikipedia.org/wiki/Category:Films_shot_in_the_United_States >> >> >> >> the extractor puts together subcategory "Category:Films shot in >> >> Alabama‎" and super-category "Category:Films by country of shooting >> >> location". >> >> >> >> If I only want super category, could you kindly suggest me a solution >> >> without too much noise? Has anyone done that before? >> >> >> >> Thanks a lot for your help and support! >> >> >> >> yang >> > >> > >> > >> > >> > -- >> > Kontokostas Dimitris > > > > > -- > Kontokostas Dimitris ------------------------------------------------------------------------------ Find and fix application performance issues faster with Applications Manager Applications Manager provides deep performance insights into multiple tiers of your business applications. It resolves application problems quickly and reduces your MTTR. Get your free trial! https://ad.doubleclick.net/ddm/clk/302982198;130105516;z _______________________________________________ Dbpedia-developers mailing list Dbpedia-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dbpedia-developers