On Sun, Feb 21, 2016 at 4:25 PM, Markus Krötzsch <
mar...@semantic-mediawiki.org> wrote:

> On 21.02.2016 20:37, Tom Morris wrote:
>
>> On Sun, Feb 21, 2016 at 11:41 AM, Markus Krötzsch
>> <mar...@semantic-mediawiki.org <mailto:mar...@semantic-mediawiki.org>>
>> wrote:
>>
>>     On 18.02.2016 15:59, Lydia Pintscher wrote:
>>
>>         Thomas, Denny, Sebastian, Thomas, and I have published a paper
>>         which was
>>         accepted for the industry track at WWW 2016. It covers the
>> migration
>>         from Freebase to Wikidata. You can now read it here:
>>         http://research.google.com/pubs/archive/44818.pdf
>>
>>     Is it possible that you have actually used the flawed statistics
>>     from the Wikidata main page regarding the size of the project? 14.5M
>>     items in Aug 2015 seems far too low a number. Our RDF exports from
>>     mid August already contained more than 18.4M items. It would be nice
>>     to get this fixed at some point. There are currently almost 20M
>>     items, and the main page still shows only 16.5M.
>>
>> Numbers are off throughout the paper.  They also quote 48M instead of
>> 58M topics for Freebase and mischaracterize some other key points. They
>> key number is that 3.2 billion facts for 58 million topics has generated
>> 106,220 new statements for Wikidata. If my calculator had more decimal
>> places, I could tell you what percentage that is.
>>
>
> Obviously, any tool can only import statements for which we have items and
> properties at all, so the number of importable facts is much lower.


Obviously, but "much lower" from 3.2B is probably something like 50M-300M,
not 0.1M.

Tom
_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata

Reply via email to