Hi Gonzalo,
I believe that yesterday we had to perform some maintenance tasks causing the issue that you were experiencing, they should be gone now, can you double check? There are no issues in consuming the data, but please be a good citizen avoiding to send tons of requests to our servers at the same time :)
Regards,
Luca
On Wed, Mar 2, 2016 at 8:20 PM, Nuria Ruiz nuria@wikimedia.org wrote:
cc-ing Analytics list and Ariel who maintains dumps.
On Wed, Mar 2, 2016 at 8:31 AM, Gonzalo Diaz gonzalo.diaz@cs.ox.ac.uk wrote:
Dear Nuria Ruiz,
My name is Gonzalo Diaz, and I am a PhD student of Computer Science at the University of Oxford. You can see my profile here: https://www.cs.ox.ac.uk/people/gonzalo.diaz/
I am writing because I am currently working on a research project which would benefit from processing Wikipedia pagecount files.
On Monday, 29 February 2016, we began downloading pagecount files from http://dumps.wikimedia.org/other/pagecounts-raw/. For the next 48 hours we managed to download ~15 months of raw pagecount files, using 3 different computers, and 3 instances of "wget" on each computer (for a total of 9 concurrent downloads at any given moment).
Since this morning, however, we are no longer able to download the pagecount files. Furthermore, the site dumps.wikimedia.org seems down.
Hopefully, our downloads are not responsible for this. If they are, however, we would like to apologise for the inconvenience.
In any case, we would like to request permission to continue downloading the raw pagecount files, as soon as the site is back online.
I thank you very much for your time!
Kindest regards, Gonzalo Diaz John Mittermeier
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics