This is the result for German Wikipedia: I ran the bot for German and I wanted to add P31:5 but it seems more than 90% of Wikidata items have P31 statement (how?) and there was nothing that I could do, so I got list of articles in German Wikipedia that doesn't have item in Wikidata. There were 16K articles and output of the bot for each one of them is this <https://tools.wmflabs.org/dexbot/kian_res2.txt>. If you plot it, you would have this <https://tools.wmflabs.org/dexbot/kian2.png>. When the number is below 0.50 it is obvious that they are not human. Between 0.50-0.61 there are 78 articles that the bot can't determine whether it's a human or not [1] and articles with more than 0.61 is definitely human. I used 0.62 just to be sure and created 3600 items with P31:5 in them.
Imagine if I do something like that for English Wikipedia. [1]: They are probably about a cat or tree with categories of humans in them. Best On Sun, Mar 8, 2015 at 3:07 AM, Amir Ladsgroup <ladsgr...@gmail.com> wrote: > > > On Sat, Mar 7, 2015 at 9:19 PM, Jeroen De Dauw <jeroended...@gmail.com> > wrote: > >> Hey, >> >> Yay, neural nets are definitely fun! Am I right in understanding this is >> a software you created for the specific purpose of doing tasks in Wikidata? >> > > Yes, in Wikidata and Wikipedia. > >> >> > Congratulations for this bold step towards the Singularity :-) >> >> Don't worry, it'll be some time before AI can actually ingest Wikidata, >> see https://dl.dropboxusercontent.com/u/7313450/entropy/aitraining.png >> >> Cheers >> >> -- >> Jeroen De Dauw - http://www.bn2vs.com >> Software craftsmanship advocate >> Evil software architect at Wikimedia Germany >> ~=[,,_,,]:3 >> >> _______________________________________________ >> Wikidata-l mailing list >> Wikidata-l@lists.wikimedia.org >> https://lists.wikimedia.org/mailman/listinfo/wikidata-l >> >> > > > -- > Amir > > -- Amir
_______________________________________________ Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l