Hi,
within the Analytics cluster, the pagecounts-all-sites dataset is still referred to by the legacy name “webstats” in
* the wmf.webstats Hive table, and * the /wmf/data/archive/webstats HDFS path.
Both have no known external customers, so renaming them to "pagecounts-all-sites" should not affect anyone.
But just in case ... if you use any of them, let us know by 2015-01-09 08:00 UTC.
If no one speaks up, I'll remove the webstats Hive table, and the webstats path in HDFS.
The new Hive table
wmf.pagecounts_all_sites
and the new HDFS path
/wmf/data/archive/pagecounts-all-sites
are already available and contain both the old and also the new data.
Have fun, Christian
P.S.: This only affects the data in the Analytics cluster. The public URL stays unaffected:
http://dumps.wikimedia.org/other/pagecounts-all-sites/
No changes there.
Hi,
On Tue, Jan 06, 2015 at 02:01:26AM +0100, Christian Aistleitner wrote:
[...]
- the wmf.webstats Hive table, and
- the /wmf/data/archive/webstats HDFS path.
Both have no known external customers, so renaming them to "pagecounts-all-sites" should not affect anyone.
But just in case ... if you use any of them, let us know by 2015-01-09 08:00 UTC.
If no one speaks up, I'll remove the webstats Hive table, and the webstats path in HDFS.
Done.
Have fun, Christian