Hello Platonides! Thank you for being do kind to make the list. Originally I wanted to download the svg-files whenever my program is using the file, one after another. Turned out, that's just not feasible.
Could I get a dump of all svg files, please? Not just the list of the files with their paths, but the svg-files themselves? Then I just have them locally, and I don't have to handle all the network exceptions that occur on my side too often.
That would be of a great help for me.
Greetings Dieter (not John, as I accidentaly wrote in the last post)
Am 03.03.2016 01:00, schrieb Platonides:
On 02/03/16 22:05, D. Hansen wrote:
Hi!
Last year I was provided very kindly with a list of all svg-files on commons, that is their then *real* http(s)-paths. (Either by John phoenixoverride@gmail.com or by Ariel T. Glenn aglenn@wikimedia.org aglenn@wikimedia.org.)
Could I get a current version of this dump, please? (With the real paths and really existing files.)
Back then the dump was
http://tools.wmflabs.org/betacommand-dev/reports/commonswiki_svg_list.txt.7z
as far as I remember.
(Someone told me I could create such a dump myself with some wiki-tools. Is this really possible?)
Greetings John
You should be able to extract such list with a query such as: SELECT CONCAT('https://upload.wikimedia.org/wikipedia/commons/', SUBSTRING(MD5(img_name), 1,1), '/', SUBSTRING(MD5(img_name), 1,2), '/', img_name) from image where img_media_type = 'DRAWING' AND img_major_mime='image' and img_minor_mime LIKE 'svg%';
which leads to 967065 images.
I've put a copy at http://tools.wmflabs.org/heritage/commonswiki_svg_list-2016-03-02.txt.xz for your convenience.