Dear all, 

I tried recently to extract data from a french Wiktionary dump with the 
extractor of the community on github. 

But this create strange data like this, for every word. 
Few data about every word, word from other languages are parsed too. I don't 
know if it's normal. 

<http://wiktionary.dbpedia.org/resource/encyclopédie> 
<http://usefulinc.com/ns/doap#creator> 
<http://de.wiktionary.org/w/index.php?title=encyclopédie&action=history> . 
<http://wiktionary.dbpedia.org/resource/encyclopédie> 
<http://www.monnet-project.eu/lemon#sense> 
<http://wiktionary.dbpedia.org/resource/encyclopédie> . 
<http://wiktionary.dbpedia.org/resource/encyclopédie> 
<http://www.w3.org/2000/01/rdf-schema#label> 
"encyclopédie"^^<http://www.w3.org/2001/XMLSchema#string> . 
<http://wiktionary.dbpedia.org/resource/encyclopédie> 
<http://www.w3.org/2000/01/rdf-schema#seeAlso> 
<http://de.wiktionary.org/wiki/encyclopédie> . 
<http://wiktionary.dbpedia.org/resource/encyclopédie> 
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type> 
<http://wiktionary.dbpedia.org/terms/LexicalEntity> . 
<http://wiktionary.dbpedia.org/resource/encyclopédie> 
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type> 
<http://www.monnet-project.eu/lemon#LexicalSense> . 
<http://wiktionary.dbpedia.org/resource/encyclopédie> 
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type> 
<http://www.monnet-project.eu/lemon#LexicalEntry> . 
<http://wiktionary.dbpedia.org/resource/encyclopédie> 
<http://wiktionary.dbpedia.org/terms/statistics> 
"7-139"^^<http://www.w3.org/2001/XMLSchema#string> . 

<http://wiktionary.dbpedia.org/resource/accueil> 
<http://usefulinc.com/ns/doap#creator> 
<http://de.wiktionary.org/w/index.php?title=accueil&action=history> . 
<http://wiktionary.dbpedia.org/resource/accueil> 
<http://www.monnet-project.eu/lemon#sense> 
<http://wiktionary.dbpedia.org/resource/accueil> . 
<http://wiktionary.dbpedia.org/resource/accueil> 
<http://www.w3.org/2000/01/rdf-schema#label> 
"accueil"^^<http://www.w3.org/2001/XMLSchema#string> . 
<http://wiktionary.dbpedia.org/resource/accueil> 
<http://www.w3.org/2000/01/rdf-schema#seeAlso> 
<http://de.wiktionary.org/wiki/accueil> . 
<http://wiktionary.dbpedia.org/resource/accueil> 
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type> 
<http://www.monnet-project.eu/lemon#LexicalEntry> . 
<http://wiktionary.dbpedia.org/resource/accueil> 
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type> 
<http://www.monnet-project.eu/lemon#LexicalSense> . 
<http://wiktionary.dbpedia.org/resource/accueil> 
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type> 
<http://wiktionary.dbpedia.org/terms/LexicalEntity> . 
<http://wiktionary.dbpedia.org/resource/accueil> 
<http://wiktionary.dbpedia.org/terms/statistics> 
"7-112"^^<http://www.w3.org/2001/XMLSchema#string> . 

If you have any idea of what is happening. 

With regards 

Raphaël Boyer 
WIMMICS TEAM 
INRIA France 
------------------------------------------------------------------------------
Presto, an open source distributed SQL query engine for big data, initially
developed by Facebook, enables you to easily query your data on Hadoop in a 
more interactive manner. Teradata is also now providing full enterprise
support for Presto. Download a free open source copy now.
http://pubads.g.doubleclick.net/gampad/clk?id=250295911&iu=/4140
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to