cu_changes
table, but since we regenerate the Data Lake's editing data every month, we instead keep data in mediawiki_private_cu_changes
and geoeditors_daily
for the two latest calendar months (the month of the latest mediawiki_history snapshot and the previous). Older data may be temporarily available before it is purged, but you should not rely on this.
That's right, Neil, I just changed the language around a bit, thanks for updating that!_______________________________________________On Tue, Nov 20, 2018 at 3:26 PM Neil Patel Quinn <nquinn@wikimedia.org> wrote:_______________________________________________Hey there!Could someone from Analytics clarify the purging schedule for geoeditors_daily and add it on Wikitech? I've added some information based on my experience using the dataset, but it may not be fully accurate.I wrote:Because these tables contain the countries of individual editors, we only keep the data corresponding to the two most recent full months (the month of the latest mediawiki_history snapshot and the previous).
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics