Thanks for the clarification, Joseph.
Bo
On Tue, Mar 1, 2016 at 2:02 PM, Joseph Allemandou jallemandou@wikimedia.org wrote:
Hi Again,
@Dan: We will indeed reload data into cassandra.
@Bo: Actually the two datasets are fairly different.
The one called pagecounts is slowly getting deprecated toward the one called pageview, defined by Research people at WMF: https://meta.wikimedia.org/wiki/Research:Page_view
The pageview dumps are actually a 'legacy format' view of the new pageview :)
Code for the legacy extraction: https://github.com/wikimedia/analytics-refinery/blob/master/oozie/pagecounts... Code for the new pageview definition: https://github.com/wikimedia/analytics-refinery-source/blob/master/refinery-...
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics