cc-ing Analytics list and Ariel who maintains dumps.
On Wed, Mar 2, 2016 at 8:31 AM, Gonzalo Diaz gonzalo.diaz@cs.ox.ac.uk wrote:
Dear Nuria Ruiz,
My name is Gonzalo Diaz, and I am a PhD student of Computer Science at the University of Oxford. You can see my profile here: https://www.cs.ox.ac.uk/people/gonzalo.diaz/
I am writing because I am currently working on a research project which would benefit from processing Wikipedia pagecount files.
On Monday, 29 February 2016, we began downloading pagecount files from http://dumps.wikimedia.org/other/pagecounts-raw/. For the next 48 hours we managed to download ~15 months of raw pagecount files, using 3 different computers, and 3 instances of "wget" on each computer (for a total of 9 concurrent downloads at any given moment).
Since this morning, however, we are no longer able to download the pagecount files. Furthermore, the site dumps.wikimedia.org seems down.
Hopefully, our downloads are not responsible for this. If they are, however, we would like to apologise for the inconvenience.
In any case, we would like to request permission to continue downloading the raw pagecount files, as soon as the site is back online.
I thank you very much for your time!
Kindest regards, Gonzalo Diaz John Mittermeier
I believe the dumps server was undergoing maintenance today. Last email I saw was it was up and running again.
Ariel -- maybe cc this list (analytics-l) when you are announcing a maintenance window?
thanks,
-Toby
On Wed, Mar 2, 2016 at 11:20 AM, Nuria Ruiz nuria@wikimedia.org wrote:
cc-ing Analytics list and Ariel who maintains dumps.
On Wed, Mar 2, 2016 at 8:31 AM, Gonzalo Diaz gonzalo.diaz@cs.ox.ac.uk wrote:
Dear Nuria Ruiz,
My name is Gonzalo Diaz, and I am a PhD student of Computer Science at the University of Oxford. You can see my profile here: https://www.cs.ox.ac.uk/people/gonzalo.diaz/
I am writing because I am currently working on a research project which would benefit from processing Wikipedia pagecount files.
On Monday, 29 February 2016, we began downloading pagecount files from http://dumps.wikimedia.org/other/pagecounts-raw/. For the next 48 hours we managed to download ~15 months of raw pagecount files, using 3 different computers, and 3 instances of "wget" on each computer (for a total of 9 concurrent downloads at any given moment).
Since this morning, however, we are no longer able to download the pagecount files. Furthermore, the site dumps.wikimedia.org seems down.
Hopefully, our downloads are not responsible for this. If they are, however, we would like to apologise for the inconvenience.
In any case, we would like to request permission to continue downloading the raw pagecount files, as soon as the site is back online.
I thank you very much for your time!
Kindest regards, Gonzalo Diaz John Mittermeier
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Hi,
I noticed the maintenance email was announced at https://lists.wikimedia.org/pipermail/xmldatadumps-l/2016-March/001262.html but it'd be helpful to CC this list as well.
Bo
On Wed, Mar 2, 2016 at 11:26 AM, Toby Negrin tnegrin@wikimedia.org wrote:
I believe the dumps server was undergoing maintenance today. Last email I saw was it was up and running again.
Ariel -- maybe cc this list (analytics-l) when you are announcing a maintenance window?
thanks,
-Toby
On Wed, Mar 2, 2016 at 11:20 AM, Nuria Ruiz nuria@wikimedia.org wrote:
cc-ing Analytics list and Ariel who maintains dumps.
On Wed, Mar 2, 2016 at 8:31 AM, Gonzalo Diaz gonzalo.diaz@cs.ox.ac.uk wrote:
Dear Nuria Ruiz,
My name is Gonzalo Diaz, and I am a PhD student of Computer Science at the University of Oxford. You can see my profile here: https://www.cs.ox.ac.uk/people/gonzalo.diaz/
I am writing because I am currently working on a research project which would benefit from processing Wikipedia pagecount files.
On Monday, 29 February 2016, we began downloading pagecount files from http://dumps.wikimedia.org/other/pagecounts-raw/. For the next 48 hours we managed to download ~15 months of raw pagecount files, using 3 different computers, and 3 instances of "wget" on each computer (for a total of 9 concurrent downloads at any given moment).
Since this morning, however, we are no longer able to download the pagecount files. Furthermore, the site dumps.wikimedia.org seems down.
Hopefully, our downloads are not responsible for this. If they are, however, we would like to apologise for the inconvenience.
In any case, we would like to request permission to continue downloading the raw pagecount files, as soon as the site is back online.
I thank you very much for your time!
Kindest regards, Gonzalo Diaz John Mittermeier
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Just FYI, Ariel is probably not going to see this until the morning. But I can confirm that if there was maintenance being done, the server seems to not have come back up correctly.
Original Message From: Bo Han Sent: Wednesday, March 2, 2016 14:29 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Reply To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Cc: Ariel Glenn WMF; John Mittermeier; Gonzalo Diaz Subject: Re: [Analytics] Requesting access to Wikimedia Pageview Dumps for Research
Hi,
I noticed the maintenance email was announced at https://lists.wikimedia.org/pipermail/xmldatadumps-l/2016-March/001262.html but it'd be helpful to CC this list as well.
Bo
On Wed, Mar 2, 2016 at 11:26 AM, Toby Negrin tnegrin@wikimedia.org wrote:
I believe the dumps server was undergoing maintenance today. Last email I saw was it was up and running again.
Ariel -- maybe cc this list (analytics-l) when you are announcing a maintenance window?
thanks,
-Toby
On Wed, Mar 2, 2016 at 11:20 AM, Nuria Ruiz nuria@wikimedia.org wrote:
cc-ing Analytics list and Ariel who maintains dumps.
On Wed, Mar 2, 2016 at 8:31 AM, Gonzalo Diaz gonzalo.diaz@cs.ox.ac.uk wrote:
Dear Nuria Ruiz,
My name is Gonzalo Diaz, and I am a PhD student of Computer Science at the University of Oxford. You can see my profile here: https://www.cs.ox.ac.uk/people/gonzalo.diaz/
I am writing because I am currently working on a research project which would benefit from processing Wikipedia pagecount files.
On Monday, 29 February 2016, we began downloading pagecount files from http://dumps.wikimedia.org/other/pagecounts-raw/. For the next 48 hours we managed to download ~15 months of raw pagecount files, using 3 different computers, and 3 instances of "wget" on each computer (for a total of 9 concurrent downloads at any given moment).
Since this morning, however, we are no longer able to download the pagecount files. Furthermore, the site dumps.wikimedia.org seems down.
Hopefully, our downloads are not responsible for this. If they are, however, we would like to apologise for the inconvenience.
In any case, we would like to request permission to continue downloading the raw pagecount files, as soon as the site is back online.
I thank you very much for your time!
Kindest regards, Gonzalo Diaz John Mittermeier
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
It is back up and was announced so via email to the various lists. I'm sorry I didn't think to add analytics on those, I did spam several lists already. As the initial mail said, you can pick up anything a day or more old from one of our mirrors, or from ms1001.wikimedia.org during outages like this.
Ariel
On Wed, Mar 2, 2016 at 9:48 PM, Dan Andreescu dandreescu@wikimedia.org wrote:
Just FYI, Ariel is probably not going to see this until the morning. But I can confirm that if there was maintenance being done, the server seems to not have come back up correctly.
Original Message From: Bo Han Sent: Wednesday, March 2, 2016 14:29 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Reply To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Cc: Ariel Glenn WMF; John Mittermeier; Gonzalo Diaz Subject: Re: [Analytics] Requesting access to Wikimedia Pageview Dumps for Research
Hi,
I noticed the maintenance email was announced at https://lists.wikimedia.org/pipermail/xmldatadumps-l/2016-March/001262.html but it'd be helpful to CC this list as well.
Bo
On Wed, Mar 2, 2016 at 11:26 AM, Toby Negrin tnegrin@wikimedia.org wrote:
I believe the dumps server was undergoing maintenance today. Last email I saw was it was up and running again.
Ariel -- maybe cc this list (analytics-l) when you are announcing a maintenance window?
thanks,
-Toby
On Wed, Mar 2, 2016 at 11:20 AM, Nuria Ruiz nuria@wikimedia.org wrote:
cc-ing Analytics list and Ariel who maintains dumps.
On Wed, Mar 2, 2016 at 8:31 AM, Gonzalo Diaz gonzalo.diaz@cs.ox.ac.uk wrote:
Dear Nuria Ruiz,
My name is Gonzalo Diaz, and I am a PhD student of Computer Science at the University of Oxford. You can see my profile here: https://www.cs.ox.ac.uk/people/gonzalo.diaz/
I am writing because I am currently working on a research project which would benefit from processing Wikipedia pagecount files.
On Monday, 29 February 2016, we began downloading pagecount files from http://dumps.wikimedia.org/other/pagecounts-raw/. For the next 48
hours we
managed to download ~15 months of raw pagecount files, using 3
different
computers, and 3 instances of "wget" on each computer (for a total of 9 concurrent downloads at any given moment).
Since this morning, however, we are no longer able to download the pagecount files. Furthermore, the site dumps.wikimedia.org seems down.
Hopefully, our downloads are not responsible for this. If they are, however, we would like to apologise for the inconvenience.
In any case, we would like to request permission to continue
downloading
the raw pagecount files, as soon as the site is back online.
I thank you very much for your time!
Kindest regards, Gonzalo Diaz John Mittermeier
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Thank you Ariel, if you could ping analytics@ in the future also it will be great.
On Wed, Mar 2, 2016 at 12:32 PM, Ariel Glenn WMF ariel@wikimedia.org wrote:
It is back up and was announced so via email to the various lists. I'm sorry I didn't think to add analytics on those, I did spam several lists already. As the initial mail said, you can pick up anything a day or more old from one of our mirrors, or from ms1001.wikimedia.org during outages like this.
Ariel
On Wed, Mar 2, 2016 at 9:48 PM, Dan Andreescu dandreescu@wikimedia.org wrote:
Just FYI, Ariel is probably not going to see this until the morning. But I can confirm that if there was maintenance being done, the server seems to not have come back up correctly.
Original Message From: Bo Han Sent: Wednesday, March 2, 2016 14:29 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Reply To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Cc: Ariel Glenn WMF; John Mittermeier; Gonzalo Diaz Subject: Re: [Analytics] Requesting access to Wikimedia Pageview Dumps for Research
Hi,
I noticed the maintenance email was announced at
https://lists.wikimedia.org/pipermail/xmldatadumps-l/2016-March/001262.html but it'd be helpful to CC this list as well.
Bo
On Wed, Mar 2, 2016 at 11:26 AM, Toby Negrin tnegrin@wikimedia.org wrote:
I believe the dumps server was undergoing maintenance today. Last email
I
saw was it was up and running again.
Ariel -- maybe cc this list (analytics-l) when you are announcing a maintenance window?
thanks,
-Toby
On Wed, Mar 2, 2016 at 11:20 AM, Nuria Ruiz nuria@wikimedia.org
wrote:
cc-ing Analytics list and Ariel who maintains dumps.
On Wed, Mar 2, 2016 at 8:31 AM, Gonzalo Diaz <gonzalo.diaz@cs.ox.ac.uk
wrote:
Dear Nuria Ruiz,
My name is Gonzalo Diaz, and I am a PhD student of Computer Science at the University of Oxford. You can see my profile here: https://www.cs.ox.ac.uk/people/gonzalo.diaz/
I am writing because I am currently working on a research project
which
would benefit from processing Wikipedia pagecount files.
On Monday, 29 February 2016, we began downloading pagecount files from http://dumps.wikimedia.org/other/pagecounts-raw/. For the next 48
hours we
managed to download ~15 months of raw pagecount files, using 3
different
computers, and 3 instances of "wget" on each computer (for a total of
9
concurrent downloads at any given moment).
Since this morning, however, we are no longer able to download the pagecount files. Furthermore, the site dumps.wikimedia.org seems
down.
Hopefully, our downloads are not responsible for this. If they are, however, we would like to apologise for the inconvenience.
In any case, we would like to request permission to continue
downloading
the raw pagecount files, as soon as the site is back online.
I thank you very much for your time!
Kindest regards, Gonzalo Diaz John Mittermeier
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Hi Nuria et al.,
I’m glad to hear that our downloads did not have anything to do with the site going down. Now that things are back up and running, it looks like everything is working again on our end.
Thank you for the help!
John and Gonzalo
John C. Mittermeier St Catherine’s College Manor Road Oxford OX1 3UJ Email: john.mittermeier@gmail.com mailto:john.mittermeier@gmail.com Twitter: jcmittermeier
On Mar 2, 2016, at 8:46 PM, Nuria Ruiz <nuria@wikimedia.org mailto:nuria@wikimedia.org> wrote:
Thank you Ariel, if you could ping analytics@ in the future also it will be great.
On Wed, Mar 2, 2016 at 12:32 PM, Ariel Glenn WMF <ariel@wikimedia.org mailto:ariel@wikimedia.org> wrote: It is back up and was announced so via email to the various lists. I'm sorry I didn't think to add analytics on those, I did spam several lists already. As the initial mail said, you can pick up anything a day or more old from one of our mirrors, or from ms1001.wikimedia.org http://ms1001.wikimedia.org/ during outages like this.
Ariel
On Wed, Mar 2, 2016 at 9:48 PM, Dan Andreescu <dandreescu@wikimedia.org mailto:dandreescu@wikimedia.org> wrote: Just FYI, Ariel is probably not going to see this until the morning. But I can confirm that if there was maintenance being done, the server seems to not have come back up correctly.
Original Message From: Bo Han Sent: Wednesday, March 2, 2016 14:29 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Reply To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Cc: Ariel Glenn WMF; John Mittermeier; Gonzalo Diaz Subject: Re: [Analytics] Requesting access to Wikimedia Pageview Dumps for Research
Hi,
I noticed the maintenance email was announced at https://lists.wikimedia.org/pipermail/xmldatadumps-l/2016-March/001262.html https://lists.wikimedia.org/pipermail/xmldatadumps-l/2016-March/001262.html but it'd be helpful to CC this list as well.
Bo
On Wed, Mar 2, 2016 at 11:26 AM, Toby Negrin <tnegrin@wikimedia.org mailto:tnegrin@wikimedia.org> wrote:
I believe the dumps server was undergoing maintenance today. Last email I saw was it was up and running again.
Ariel -- maybe cc this list (analytics-l) when you are announcing a maintenance window?
thanks,
-Toby
On Wed, Mar 2, 2016 at 11:20 AM, Nuria Ruiz <nuria@wikimedia.org mailto:nuria@wikimedia.org> wrote:
cc-ing Analytics list and Ariel who maintains dumps.
On Wed, Mar 2, 2016 at 8:31 AM, Gonzalo Diaz <gonzalo.diaz@cs.ox.ac.uk mailto:gonzalo.diaz@cs.ox.ac.uk> wrote:
Dear Nuria Ruiz,
My name is Gonzalo Diaz, and I am a PhD student of Computer Science at the University of Oxford. You can see my profile here: https://www.cs.ox.ac.uk/people/gonzalo.diaz/ https://www.cs.ox.ac.uk/people/gonzalo.diaz/
I am writing because I am currently working on a research project which would benefit from processing Wikipedia pagecount files.
On Monday, 29 February 2016, we began downloading pagecount files from http://dumps.wikimedia.org/other/pagecounts-raw/ http://dumps.wikimedia.org/other/pagecounts-raw/. For the next 48 hours we managed to download ~15 months of raw pagecount files, using 3 different computers, and 3 instances of "wget" on each computer (for a total of 9 concurrent downloads at any given moment).
Since this morning, however, we are no longer able to download the pagecount files. Furthermore, the site dumps.wikimedia.org http://dumps.wikimedia.org/ seems down.
Hopefully, our downloads are not responsible for this. If they are, however, we would like to apologise for the inconvenience.
In any case, we would like to request permission to continue downloading the raw pagecount files, as soon as the site is back online.
I thank you very much for your time!
Kindest regards, Gonzalo Diaz John Mittermeier
Analytics mailing list Analytics@lists.wikimedia.org mailto:Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org mailto:Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org mailto:Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org mailto:Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics https://lists.wikimedia.org/mailman/listinfo/analytics
Hi Gonzalo,
I believe that yesterday we had to perform some maintenance tasks causing the issue that you were experiencing, they should be gone now, can you double check? There are no issues in consuming the data, but please be a good citizen avoiding to send tons of requests to our servers at the same time :)
Regards,
Luca
On Wed, Mar 2, 2016 at 8:20 PM, Nuria Ruiz nuria@wikimedia.org wrote:
cc-ing Analytics list and Ariel who maintains dumps.
On Wed, Mar 2, 2016 at 8:31 AM, Gonzalo Diaz gonzalo.diaz@cs.ox.ac.uk wrote:
Dear Nuria Ruiz,
My name is Gonzalo Diaz, and I am a PhD student of Computer Science at the University of Oxford. You can see my profile here: https://www.cs.ox.ac.uk/people/gonzalo.diaz/
I am writing because I am currently working on a research project which would benefit from processing Wikipedia pagecount files.
On Monday, 29 February 2016, we began downloading pagecount files from http://dumps.wikimedia.org/other/pagecounts-raw/. For the next 48 hours we managed to download ~15 months of raw pagecount files, using 3 different computers, and 3 instances of "wget" on each computer (for a total of 9 concurrent downloads at any given moment).
Since this morning, however, we are no longer able to download the pagecount files. Furthermore, the site dumps.wikimedia.org seems down.
Hopefully, our downloads are not responsible for this. If they are, however, we would like to apologise for the inconvenience.
In any case, we would like to request permission to continue downloading the raw pagecount files, as soon as the site is back online.
I thank you very much for your time!
Kindest regards, Gonzalo Diaz John Mittermeier
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics