On Sun, Feb 21, 2016 at 4:25 PM, Markus Krötzsch <markus@semantic-mediawiki.org> wrote:
On 21.02.2016 20:37, Tom Morris wrote:
On Sun, Feb 21, 2016 at 11:41 AM, Markus Krötzsch
<markus@semantic-mediawiki.org <mailto:markus@semantic-mediawiki.org>>
wrote:

    On 18.02.2016 15:59, Lydia Pintscher wrote:

        Thomas, Denny, Sebastian, Thomas, and I have published a paper
        which was
        accepted for the industry track at WWW 2016. It covers the migration
        from Freebase to Wikidata. You can now read it here:
        http://research.google.com/pubs/archive/44818.pdf

    Is it possible that you have actually used the flawed statistics
    from the Wikidata main page regarding the size of the project? 14.5M
    items in Aug 2015 seems far too low a number. Our RDF exports from
    mid August already contained more than 18.4M items. It would be nice
    to get this fixed at some point. There are currently almost 20M
    items, and the main page still shows only 16.5M.

Numbers are off throughout the paper.  They also quote 48M instead of
58M topics for Freebase and mischaracterize some other key points. They
key number is that 3.2 billion facts for 58 million topics has generated
106,220 new statements for Wikidata. If my calculator had more decimal
places, I could tell you what percentage that is.

Obviously, any tool can only import statements for which we have items and properties at all, so the number of importable facts is much lower.

Obviously, but "much lower" from 3.2B is probably something like 50M-300M, not 0.1M.

Tom