Hi Folks,

As planned a few month ago, pagecounts-raw and pagecounts-all-sites datasets generation is now stopped (since 2016-08-05T12:00 to be precise).
As explained by Dan in previous emails, old data will not be removed from the dumps, and the new pageview dataset is available here.


On Thu, May 26, 2016 at 8:34 PM, Dan Andreescu <dandreescu@wikimedia.org> wrote:
Just a reminder, we will be deprecating the pagecounts datasets at the end of May, as we mentioned earlier this year [0].  This means these files will remain there to be used by researchers but new files will not be generated in the future.

Pagecounts datasets that will be deprecated


Options for switching to the new datasets [1]:
  pageviews for the same format but better quality data
  pagecounts-ez for compressed data

[0] https://lists.wikimedia.org/pipermail/analytics/2016-March/005060.html
[1] https://dumps.wikimedia.org/other/analytics/

Analytics mailing list

Joseph Allemandou
Data Engineer @ Wikimedia Foundation
IRC: joal