Re: [Dbpedia-discussion] gender extraction from Wikipedia

2014-12-01 Thread Volha Bryl
The GenderExtractor "as is" was run during the last extraction, but the resulting dataset - available at [1] - contains only 4K triples, which seems to suggest that the extractor doesn't work correctly. Volha [1] http://data.dws.informatik.uni-mannheim.de/dbpedia/2014/en/ On 12/1/2014 4:48 PM

Re: [Dbpedia-discussion] gender extraction from Wikipedia

2014-12-01 Thread Ruben Verborgh
Hi Pablo, Maybe first some context: I was asked by the Dutch DBpedia chapter to perform this work. However, not everybody (including me) was aware of this existing code. When I reported back, Dimitris send me a pointer to it. > Do you think this could improve on the current Gender extractor that

Re: [Dbpedia-discussion] gender extraction from Wikipedia

2014-12-01 Thread Pablo N. Mendes
Hi Ruben, Do you think this could improve on the current Gender extractor that Max and I created? We'd love to have it improved. Why don't you send a pull request over there? https://github.com/dbpedia/extraction-framework/blob/master/core/src/main/scala/org/dbpedia/extraction/mappings/GenderExtra

[Dbpedia-discussion] gender extraction from Wikipedia

2014-12-01 Thread Ruben Verborgh
Dear all, This weekend, I quickly experimented with gender extraction from the Dutch Wikipedia. A summary of the approach and results is available here: http://ruben.verborgh.org/blog/2014/11/30/distinguishing-between-frank-and-nancy/ The highlights are: - I extracted 52,686 gender indications w