lists.wikimedia.org
Sign In
Sign Up
Sign In
Sign Up
Manage this list
×
Keyboard Shortcuts
Thread View
j
: Next unread message
k
: Previous unread message
j a
: Jump to all threads
j l
: Jump to MailingList overview
2024
April
March
February
January
2023
December
November
October
September
August
July
June
May
April
March
February
January
2022
December
November
October
September
August
July
June
May
April
March
February
January
2021
December
November
October
September
August
July
June
May
April
March
February
January
2020
December
November
October
September
August
July
June
May
April
March
February
January
2019
December
November
October
September
August
July
June
May
April
March
February
January
2018
December
November
October
September
August
July
June
May
April
March
February
January
2017
December
November
October
September
August
July
June
May
April
March
February
January
2016
December
November
October
September
August
July
June
May
April
March
February
January
2015
December
November
October
September
August
July
June
May
April
March
February
January
2014
December
November
October
September
August
July
June
May
April
March
February
January
2013
December
November
October
September
August
July
June
May
April
March
February
January
2012
December
November
October
September
August
July
June
May
April
March
February
January
2011
December
November
October
September
August
July
June
May
April
March
February
January
2010
December
November
October
September
August
July
June
May
April
March
February
January
2009
December
November
October
September
August
July
June
May
List overview
Download
thread
[Xmldatadumps-l] Fwd: Divide XML dumps by page.page_namespace (and figure out what to do with the "pages-articles" dump)
Federico Leva (Nemo)
18 Jan 2017
18 Jan '17
5:52 a.m.
Input requested:
https://lists.wikimedia.org/pipermail/wikitech-l/2017-January/087393.html
,
https://phabricator.wikimedia.org/T99483
Personally I think that the main issue is the slowness of some of the tools people use (including
dumps.wikimedia.org
itself), so I tried to improve the docs a bit with what I learnt: *
https://meta.wikimedia.org/wiki/Data_dumps/Download_tools#Downloading_the_X…
*
https://meta.wikimedia.org/wiki/Data_dumps#Faster_archives_and_servers
Nemo P.s.: In the last few weeks I archived 20+ TiB of Wikimedia Commons files on Internet Archive and I'm continuing, see
http://archiveteam.org/index.php?title=Wikimedia_Commons
.
0
0
Reply
Back to the thread
Back to the list