Dear all, I tried recently to extract data from a french Wiktionary dump with the extractor of the community on github.
But this create strange data like this, for every word. Few data about every word, word from other languages are parsed too. I don't know if it's normal. <http://wiktionary.dbpedia.org/resource/encyclopédie> <http://usefulinc.com/ns/doap#creator> <http://de.wiktionary.org/w/index.php?title=encyclopédie&action=history> . <http://wiktionary.dbpedia.org/resource/encyclopédie> <http://www.monnet-project.eu/lemon#sense> <http://wiktionary.dbpedia.org/resource/encyclopédie> . <http://wiktionary.dbpedia.org/resource/encyclopédie> <http://www.w3.org/2000/01/rdf-schema#label> "encyclopédie"^^<http://www.w3.org/2001/XMLSchema#string> . <http://wiktionary.dbpedia.org/resource/encyclopédie> <http://www.w3.org/2000/01/rdf-schema#seeAlso> <http://de.wiktionary.org/wiki/encyclopédie> . <http://wiktionary.dbpedia.org/resource/encyclopédie> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://wiktionary.dbpedia.org/terms/LexicalEntity> . <http://wiktionary.dbpedia.org/resource/encyclopédie> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.monnet-project.eu/lemon#LexicalSense> . <http://wiktionary.dbpedia.org/resource/encyclopédie> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.monnet-project.eu/lemon#LexicalEntry> . <http://wiktionary.dbpedia.org/resource/encyclopédie> <http://wiktionary.dbpedia.org/terms/statistics> "7-139"^^<http://www.w3.org/2001/XMLSchema#string> . <http://wiktionary.dbpedia.org/resource/accueil> <http://usefulinc.com/ns/doap#creator> <http://de.wiktionary.org/w/index.php?title=accueil&action=history> . <http://wiktionary.dbpedia.org/resource/accueil> <http://www.monnet-project.eu/lemon#sense> <http://wiktionary.dbpedia.org/resource/accueil> . <http://wiktionary.dbpedia.org/resource/accueil> <http://www.w3.org/2000/01/rdf-schema#label> "accueil"^^<http://www.w3.org/2001/XMLSchema#string> . <http://wiktionary.dbpedia.org/resource/accueil> <http://www.w3.org/2000/01/rdf-schema#seeAlso> <http://de.wiktionary.org/wiki/accueil> . <http://wiktionary.dbpedia.org/resource/accueil> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.monnet-project.eu/lemon#LexicalEntry> . <http://wiktionary.dbpedia.org/resource/accueil> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.monnet-project.eu/lemon#LexicalSense> . <http://wiktionary.dbpedia.org/resource/accueil> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://wiktionary.dbpedia.org/terms/LexicalEntity> . <http://wiktionary.dbpedia.org/resource/accueil> <http://wiktionary.dbpedia.org/terms/statistics> "7-112"^^<http://www.w3.org/2001/XMLSchema#string> . If you have any idea of what is happening. With regards Raphaël Boyer WIMMICS TEAM INRIA France
------------------------------------------------------------------------------ Presto, an open source distributed SQL query engine for big data, initially developed by Facebook, enables you to easily query your data on Hadoop in a more interactive manner. Teradata is also now providing full enterprise support for Presto. Download a free open source copy now. http://pubads.g.doubleclick.net/gampad/clk?id=250295911&iu=/4140
_______________________________________________ Dbpedia-discussion mailing list Dbpedia-discussion@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion