the dumps do not contain any images, just the description text that goes along with them. Platonides got you a raw copy of the actual files
On Fri, Mar 18, 2016 at 6:31 AM, D. Hansen sammelaccount@tageskurier.de wrote:
Hi Platonides
First let me thank you very much!
99 GByte, how could that be? On https://dumps.wikimedia.org/commonswiki/20160305/ there are dumps like commonswiki-20160305-pages-articles-multistream.xml.bz2 with 6.7 GByte and that file should contain "Articles, templates, media/file descriptions, and primary meta-pages, in multiple bz2 streams, 100 pages per stream". Doesn't this include the svg-files, too? (Though I haven't found out how to get the contents out what it actually contains, because my GUI-programs on Win 7 Premium 64bit regularily crash when trying to open it. And trying to download the version splitted in 4 files now crashes my computer.)
If you say 99 GByte is correct, I will start downloading it. (I have downloaded such big files before with the openstreetmap data, and it worked.)
Best regards Dieter
Am 18.03.2016 00:42, schrieb Platonides:
Hello Dieter
That would be similarly hard for me than for you.
I have run the equivalent of:
mkdir dieter-svgs cd dieter-svgs wget
http://tools.wmflabs.org/heritage/commonswiki_svg_list-2016-03-02.txt.xzplat... time wget http://tools.wmflabs.org/heritage/commonswiki_svg_list-2016-03-02.txt.xz xz -d commonswiki_svg_list-2016-03-02.txt.xz time wget --force-directories -i commonswiki_svg_list-2016-03-02.txt cd .. tar -cjf dieter-svgs-2016-03-02.tar.bz2 dieter-svgs/
I have kept the directory structure in place (by using the --force-directories parameter) so that it isn't too stressful for the filesystem. Still, a GUI app will probably choke on it.
And after a quite long wait, here is the result: https://archivos.wikimedia.es/dieter-svgs-2016-03-02.tar.bz2 (99GB)
SHA256 checksum: 48966777cc5f5d733b2a1eaf4d11f86853e9545a56da329929c996e330b38e28 dieter-svgs-2016-03-02.tar.bz2
Best regards
Xmldatadumps-l mailing list Xmldatadumps-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l