cc-ing Analytics list and Ariel who maintains dumps. 





On Wed, Mar 2, 2016 at 8:31 AM, Gonzalo Diaz <gonzalo.diaz@cs.ox.ac.uk> wrote:
Dear Nuria Ruiz,

My name is Gonzalo Diaz, and I am a PhD student of Computer Science at the University of Oxford. You can see my profile here: https://www.cs.ox.ac.uk/people/gonzalo.diaz/

I am writing because I am currently working on a research project which would benefit from processing Wikipedia pagecount files.

On Monday, 29 February 2016, we began downloading pagecount files from http://dumps.wikimedia.org/other/pagecounts-raw/. For the next 48 hours we managed to download ~15 months of raw pagecount files, using 3 different computers, and 3 instances of "wget" on each computer (for a total of 9 concurrent downloads at any given moment).

Since this morning, however, we are no longer able to download the pagecount files. Furthermore, the site dumps.wikimedia.org seems down.

Hopefully, our downloads are not responsible for this. If they are, however, we would like to apologise for the inconvenience.

In any case, we would like to request permission to continue downloading the raw pagecount files, as soon as the site is back online.

I thank you very much for your time!

Kindest regards,
Gonzalo Diaz
John Mittermeier