Greetings,
I am looking to do some year-end statistical summaries. I am aware of the over-reporting incident involving CentralAuth between August and December 2013:
https://docs.google.com/document/d/1kpJrfataS5KAxGXFoygQVhMlzFftjsvX9HktSAAK...
I know that Erik fixed/re-generated the files and fixed the numbers on the "wikistats" report card, but were the corrected "projectcount" files ever dumped anywhere? I'd like to re-run these through my system.
Hi Andrew,
http://dumps.wikimedia.org/other/pagecounts-ez/projectcounts/
'Hourly page views per wiki, corrected for site outages and underreporting. Also repackaged, as one tar file per year.'
Cheers, Erik
-----Original Message----- From: analytics-bounces@lists.wikimedia.org [mailto:analytics-bounces@lists.wikimedia.org] On Behalf Of Andrew G. West Sent: Wednesday, January 08, 2014 21:12 To: analytics@lists.wikimedia.org Subject: [Analytics] Corrected "projectcount" files
Greetings,
I am looking to do some year-end statistical summaries. I am aware of the over-reporting incident involving CentralAuth between August and December 2013:
https://docs.google.com/document/d/1kpJrfataS5KAxGXFoygQVhMlzFftjsvX9HktSAAK...
I know that Erik fixed/re-generated the files and fixed the numbers on the "wikistats" report card, but were the corrected "projectcount" files ever dumped anywhere? I'd like to re-run these through my system.
-- Andrew G. West, PhD Research Scientist Verisign Labs - Reston, VA
_______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Thanks for the response Erik,
The 2013 file in that directory seemed to last be touched on June 21. Given its file size is ~1/2 the year previous, I imagine the file hasn't been generating since that time? And therefore doesn't speak to the affected period. Thanks, -AW
On 01/08/2014 03:32 PM, Erik Zachte wrote:
Hi Andrew,
http://dumps.wikimedia.org/other/pagecounts-ez/projectcounts/
'Hourly page views per wiki, corrected for site outages and underreporting. Also repackaged, as one tar file per year.'
Cheers, Erik
-----Original Message----- From: analytics-bounces@lists.wikimedia.org [mailto:analytics-bounces@lists.wikimedia.org] On Behalf Of Andrew G. West Sent: Wednesday, January 08, 2014 21:12 To: analytics@lists.wikimedia.org Subject: [Analytics] Corrected "projectcount" files
Greetings,
I am looking to do some year-end statistical summaries. I am aware of the over-reporting incident involving CentralAuth between August and December 2013:
https://docs.google.com/document/d/1kpJrfataS5KAxGXFoygQVhMlzFftjsvX9HktSAAK...
I know that Erik fixed/re-generated the files and fixed the numbers on the "wikistats" report card, but were the corrected "projectcount" files ever dumped anywhere? I'd like to re-run these through my system.
-- Andrew G. West, PhD Research Scientist Verisign Labs - Reston, VA
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Thanks Andrews,
I just fixed the rsync process. You can download up to date files now.
Erik
-----Original Message----- From: Andrew G. West [mailto:west.andrew.g@gmail.com] Sent: Wednesday, January 08, 2014 21:38 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Cc: ezachte@wikimedia.org Subject: Re: [Analytics] Corrected "projectcount" files
Thanks for the response Erik,
The 2013 file in that directory seemed to last be touched on June 21. Given its file size is ~1/2 the year previous, I imagine the file hasn't been generating since that time? And therefore doesn't speak to the affected period. Thanks, -AW
On 01/08/2014 03:32 PM, Erik Zachte wrote:
Hi Andrew,
http://dumps.wikimedia.org/other/pagecounts-ez/projectcounts/
'Hourly page views per wiki, corrected for site outages and underreporting. Also repackaged, as one tar file per year.'
Cheers, Erik
-----Original Message----- From: analytics-bounces@lists.wikimedia.org [mailto:analytics-bounces@lists.wikimedia.org] On Behalf Of Andrew G. West Sent: Wednesday, January 08, 2014 21:12 To: analytics@lists.wikimedia.org Subject: [Analytics] Corrected "projectcount" files
Greetings,
I am looking to do some year-end statistical summaries. I am aware of the over-reporting incident involving CentralAuth between August and December 2013:
https://docs.google.com/document/d/1kpJrfataS5KAxGXFoygQVhMlzFftjsvX9H ktSAAKfrQ/edit
I know that Erik fixed/re-generated the files and fixed the numbers on the "wikistats" report card, but were the corrected "projectcount" files ever dumped anywhere? I'd like to re-run these through my system.
-- Andrew G. West, PhD Research Scientist Verisign Labs - Reston, VA
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
-- Andrew G. West, PhD Research Scientist Verisign Labs - Reston, VA