Just a reminder, we will be deprecating the pagecounts datasets at the end of May, as we mentioned earlier this year [0]. This means these files will remain there to be used by researchers but new files will not be generated in the future.
*Pagecounts datasets that will be deprecated*
pagecounts-raw pagecounts-all-sites
Options for switching to the new datasets [1]: pageviews for the same format but better quality data pagecounts-ez for compressed data
[0] https://lists.wikimedia.org/pipermail/analytics/2016-March/005060.html [1] https://dumps.wikimedia.org/other/analytics/
Hey Dan, thanks for the reminder! I'm worried there are a lot of community and GLAM tools that rely on these datasets and are not yet transitioned to the new data sources (like WikiProject popular pages https://tools.wmflabs.org/popularpages/). Is there any chance we could get a 1 month reprieve so that Community Tech could track down and fix some of these tools before the deprecation? Sorry we didn't do this earlier. We just hired a new developer though and I think this would be a good on-boarding task for them. Let me know what you think.
On Thu, May 26, 2016 at 11:34 AM, Dan Andreescu dandreescu@wikimedia.org wrote:
Just a reminder, we will be deprecating the pagecounts datasets at the end of May, as we mentioned earlier this year [0]. This means these files will remain there to be used by researchers but new files will not be generated in the future.
*Pagecounts datasets that will be deprecated*
pagecounts-raw pagecounts-all-sites
Options for switching to the new datasets [1]: pageviews for the same format but better quality data pagecounts-ez for compressed data
[0] https://lists.wikimedia.org/pipermail/analytics/2016-March/005060.html [1] https://dumps.wikimedia.org/other/analytics/
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
It's as much as changing the download url. The new version is downward compatible.
https://dumps.wikimedia.org/other/pageviews/
Erik Zachte
From: Analytics [mailto:analytics-bounces@lists.wikimedia.org] On Behalf Of Ryan Kaldari Sent: Thursday, May 26, 2016 23:27 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Subject: Re: [Analytics] Pagecount Datasets to be Deprecated at the end of May
Hey Dan,
thanks for the reminder! I'm worried there are a lot of community and GLAM tools that rely on these datasets and are not yet transitioned to the new data sources (like WikiProject popular pages https://tools.wmflabs.org/popularpages/ ). Is there any chance we could get a 1 month reprieve so that Community Tech could track down and fix some of these tools before the deprecation? Sorry we didn't do this earlier. We just hired a new developer though and I think this would be a good on-boarding task for them. Let me know what you think.
On Thu, May 26, 2016 at 11:34 AM, Dan Andreescu dandreescu@wikimedia.org wrote:
Just a reminder, we will be deprecating the pagecounts datasets at the end of May, as we mentioned earlier this year [0]. This means these files will remain there to be used by researchers but new files will not be generated in the future.
Pagecounts datasets that will be deprecated
pagecounts-raw
pagecounts-all-sites
Options for switching to the new datasets [1]:
pageviews for the same format but better quality data
pagecounts-ez for compressed data
[0] https://lists.wikimedia.org/pipermail/analytics/2016-March/005060.html
[1] https://dumps.wikimedia.org/other/analytics/
_______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Cool. WikiProject Popular Pages is fixed now, BTW. We'll try to make sure everyone is switched over ASAP. Thanks for the extra time!
On May 26, 2016, at 4:45 PM, Dan Andreescu dandreescu@wikimedia.org wrote:
No problem at all, end of June is fine with us. But Erik's right, it's just a matter of changing the url you download from, the rest is meant to be compatible and we can help if you have questions.
From: Ryan Kaldari Sent: Thursday, May 26, 2016 17:27 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Reply To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Subject: Re: [Analytics] Pagecount Datasets to be Deprecated at the end of May
Hey Dan, thanks for the reminder! I'm worried there are a lot of community and GLAM tools that rely on these datasets and are not yet transitioned to the new data sources (like WikiProject popular pages). Is there any chance we could get a 1 month reprieve so that Community Tech could track down and fix some of these tools before the deprecation? Sorry we didn't do this earlier. We just hired a new developer though and I think this would be a good on-boarding task for them. Let me know what you think.
On Thu, May 26, 2016 at 11:34 AM, Dan Andreescu dandreescu@wikimedia.org wrote: Just a reminder, we will be deprecating the pagecounts datasets at the end of May, as we mentioned earlier this year [0]. This means these files will remain there to be used by researchers but new files will not be generated in the future.
Pagecounts datasets that will be deprecated
pagecounts-raw pagecounts-all-sites
Options for switching to the new datasets [1]: pageviews for the same format but better quality data pagecounts-ez for compressed data
[0] https://lists.wikimedia.org/pipermail/analytics/2016-March/005060.html [1] https://dumps.wikimedia.org/other/analytics/
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Hey Ryan, just seeing if everyone's switched over and we can deprecate these datasets now. We won't do it until after the long weekend, so no rush getting back to us.
On Fri, May 27, 2016 at 12:45 PM, Ryan Kaldari rkaldari@wikimedia.org wrote:
Cool. WikiProject Popular Pages is fixed now, BTW. We'll try to make sure everyone is switched over ASAP. Thanks for the extra time!
On May 26, 2016, at 4:45 PM, Dan Andreescu dandreescu@wikimedia.org wrote:
No problem at all, end of June is fine with us. But Erik's right, it's just a matter of changing the url you download from, the rest is meant to be compatible and we can help if you have questions.
*From: *Ryan Kaldari *Sent: *Thursday, May 26, 2016 17:27 *To: *A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. *Reply To: *A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. *Subject: *Re: [Analytics] Pagecount Datasets to be Deprecated at the end of May
Hey Dan, thanks for the reminder! I'm worried there are a lot of community and GLAM tools that rely on these datasets and are not yet transitioned to the new data sources (like WikiProject popular pages https://tools.wmflabs.org/popularpages/). Is there any chance we could get a 1 month reprieve so that Community Tech could track down and fix some of these tools before the deprecation? Sorry we didn't do this earlier. We just hired a new developer though and I think this would be a good on-boarding task for them. Let me know what you think.
On Thu, May 26, 2016 at 11:34 AM, Dan Andreescu dandreescu@wikimedia.org wrote:
Just a reminder, we will be deprecating the pagecounts datasets at the end of May, as we mentioned earlier this year [0]. This means these files will remain there to be used by researchers but new files will not be generated in the future.
*Pagecounts datasets that will be deprecated*
pagecounts-raw pagecounts-all-sites
Options for switching to the new datasets [1]: pageviews for the same format but better quality data pagecounts-ez for compressed data
[0] https://lists.wikimedia.org/pipermail/analytics/2016-March/005060.html [1] https://dumps.wikimedia.org/other/analytics/
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Yes, we've switched over everything we know about. Thanks for the ping!
On Fri, Jul 1, 2016 at 8:59 AM, Dan Andreescu dandreescu@wikimedia.org wrote:
Hey Ryan, just seeing if everyone's switched over and we can deprecate these datasets now. We won't do it until after the long weekend, so no rush getting back to us.
On Fri, May 27, 2016 at 12:45 PM, Ryan Kaldari rkaldari@wikimedia.org wrote:
Cool. WikiProject Popular Pages is fixed now, BTW. We'll try to make sure everyone is switched over ASAP. Thanks for the extra time!
On May 26, 2016, at 4:45 PM, Dan Andreescu dandreescu@wikimedia.org wrote:
No problem at all, end of June is fine with us. But Erik's right, it's just a matter of changing the url you download from, the rest is meant to be compatible and we can help if you have questions.
*From: *Ryan Kaldari *Sent: *Thursday, May 26, 2016 17:27 *To: *A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. *Reply To: *A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. *Subject: *Re: [Analytics] Pagecount Datasets to be Deprecated at the end of May
Hey Dan, thanks for the reminder! I'm worried there are a lot of community and GLAM tools that rely on these datasets and are not yet transitioned to the new data sources (like WikiProject popular pages https://tools.wmflabs.org/popularpages/). Is there any chance we could get a 1 month reprieve so that Community Tech could track down and fix some of these tools before the deprecation? Sorry we didn't do this earlier. We just hired a new developer though and I think this would be a good on-boarding task for them. Let me know what you think.
On Thu, May 26, 2016 at 11:34 AM, Dan Andreescu <dandreescu@wikimedia.org
wrote:
Just a reminder, we will be deprecating the pagecounts datasets at the end of May, as we mentioned earlier this year [0]. This means these files will remain there to be used by researchers but new files will not be generated in the future.
*Pagecounts datasets that will be deprecated*
pagecounts-raw pagecounts-all-sites
Options for switching to the new datasets [1]: pageviews for the same format but better quality data pagecounts-ez for compressed data
[0] https://lists.wikimedia.org/pipermail/analytics/2016-March/005060.html [1] https://dumps.wikimedia.org/other/analytics/
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Hi Folks,
As planned a few month ago, *pagecounts-raw and pagecounts-all-sites datasets generation is now stopped* (since 2016-08-05T12:00 to be precise). As explained by Dan in previous emails, old data will not be removed from the dumps, and the new pageview dataset is available here https://dumps.wikimedia.org/other/pageviews/. Cheers Joseph
On Thu, May 26, 2016 at 8:34 PM, Dan Andreescu dandreescu@wikimedia.org wrote:
Just a reminder, we will be deprecating the pagecounts datasets at the end of May, as we mentioned earlier this year [0]. This means these files will remain there to be used by researchers but new files will not be generated in the future.
*Pagecounts datasets that will be deprecated*
pagecounts-raw pagecounts-all-sites
Options for switching to the new datasets [1]: pageviews for the same format but better quality data pagecounts-ez for compressed data
[0] https://lists.wikimedia.org/pipermail/analytics/2016-March/005060.html [1] https://dumps.wikimedia.org/other/analytics/
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics