[Dbpedia-discussion] ImageExtractor issue

2012-03-19 Thread Amit Kumar
Hi, We have been trying to setup an instance of dbpedia to continously extract data from wikipedia dumps/updates. While going through the output we observed that the image extractor was only picking up the first image for any page. I can see commented out code present in the ImageExtractor

Re: [Dbpedia-discussion] ImageExtractor issue

2012-03-19 Thread Pablo Mendes
Hi Amit, We have been trying to setup an instance of dbpedia to continously extract data from wikipedia dumps/updates. While We would like to do the same for the DBpedia Portuguese. If you can share any code, it would be much appreciated. Cheers Pablo On Mar 19, 2012 10:38 AM, Amit Kumar

Re: [Dbpedia-discussion] ImageExtractor issue

2012-03-19 Thread Amit Kumar
Hi Pablo, For the continuous extraction we are trying to setup a pipeline, which polls and downloads the Wikipedia data, passes it through DEF(Dbpedia Extraction Framework) and then create knowledgebases. Many of the plumbing is handled by Yahoo! Internal tools and platform but there are some

Re: [Dbpedia-discussion] ImageExtractor issue

2012-03-19 Thread Dimitris Kontokostas
Hi Pablo, Amit, Although I didn't write the image extractor, I think that this is more a matter of semantics than technical and it was left this way intentionally. The first picture is usually the most representative of the article and thus we use foaf:depiction. Other pictures might not be about

Re: [Dbpedia-discussion] ImageExtractor issue

2012-03-19 Thread Pablo Mendes
Amit, Both sound great! We'd love to have them contributed to the project. Cheers, Pablo On Mon, Mar 19, 2012 at 11:45 AM, Amit Kumar amitk...@yahoo-inc.com wrote: Hi Pablo, For the continuous extraction we are trying to setup a pipeline, which polls and downloads the Wikipedia data, passes

Re: [Dbpedia-discussion] ImageExtractor issue

2012-03-19 Thread Jona Christopher Sahnwaldt
Hi all, we also wrote bash scripts that download the latest wikipedia dumps [1][2] and import them into a database [3]. I wasn't around when we switched from bash to Scala, but I guess it was because we wanted code that can also run on Windows. Regards, JC [1]