But wouldn't it be better to keep the dump as it is, for those who don't want triple size (just inventing a number here), and have one separate, or even per-language, dump with just the automated descriptions, for those who want that?
On Mon Feb 09 2015 at 11:21:56 Daniel Kinzler daniel.kinzler@wikimedia.de wrote:
Am 09.02.2015 um 12:08 schrieb Magnus Manske:
Considering that "hardcoded" descriptions (written manually, or generated automatically) for all items in all ~290 languages would likely make up
most of
the data dump file, this seems somewhat impractical :-)
It's entirely practical, and apparently what at least some consumers of our dumps expect and desire.
-- Daniel Kinzler Senior Software Developer
Wikimedia Deutschland Gesellschaft zur Förderung Freien Wissens e.V.
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l