Hi Goran,
We'll keep an eye toward consistency, but we have not made the data
extraction into a fully automated process.
We identified 3 columns that had slightly different names and we'll fix
them:
overall SIZE rank (2020) vs. overall size rank (2018, 2019)
second month editor retention (2020) vs. second-month new editor retention
(2018, 2019)
monthly structured discussions messages (2020) vs. monthly structured
discussions (Flow) messages (2018, 2019)
The "project code" column was duplicated in 2020; the duplicate has now
been removed.
Finally, in 2019 we had added 3 new columns that we hadn't tracked in
2018: content pages, cumulative content edits, edits per content page.
Please be aware that we may add or change columns in the future as needs
evolve.
Warm regards,
Kate
On Tue, Feb 23, 2021 at 12:37 PM Goran Milovanovic <
goran.milovanovic_ext(a)wikimedia.de> wrote:
Well, it would be desirable to maintain consistent
column names across the
years...
Best,
Goran
Goran S. Milovanović, PhD
Data Scientist, Software Department
Wikimedia Deutschland
------------------------------------------------
"It's not the size of the dog in the fight,
it's the size of the fight in the dog."
- Mark Twain
------------------------------------------------
On Tue, Feb 23, 2021 at 2:42 AM Jennifer Wang <jwang(a)wikimedia.org> wrote:
Hi all,
For your reference we have updated wiki comparison dataset
<https://www.mediawiki.org/wiki/Product_Analytics/Comparison_datasets>
with 2020 data
<https://docs.google.com/spreadsheets/d/1a-UBqsYtJl6gpauJyanx0nyxuPqRvhzJRN817XpkuS8/edit?usp=sharing>
. If you have any feedback or suggestions, please let us know via
product-analytics(a)wikimedia.org.
Regards,
Jennifer & Product Analytics
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics