Hello everyone - apologies for cross-posting! *TL;DR*: We would like your
feedback on our Metrics Kit project. Please have a look and comment on
The Wikimedia Foundation's Trust and Safety team, in collaboration with the
Community Health Initiative, is working on a Metrics Kit designed to
measure the relative "health" of various communities that make up the
The ultimate outcome will be a public suite of statistics and data looking
at various aspects of Wikimedia project communities. This could be used by
both community members to make decisions on their community direction and
Wikimedia Foundation staff to point anti-harassment tool development in the
We have a set of metrics we are thinking about including in the kit,
ranging from the ratio of active users to active administrators,
administrator confidence levels, and off-wiki factors such as freedom to
participate. It's ambitious, and our methods of collecting such data will
Right now, we'd like to know:
* Which metrics make sense to collect? Which don't? What are we missing?
* Where would such a tool ideally be hosted? Where would you normally look
for statistics like these?
* We are aware of the overlap in scope between this and Wikistats <
https://stats.wikimedia.org/v2/#/all-projects> — how might these tools
Your opinions will help to guide this project going forward. We'll be
reaching out at different stages of this project, so if you're interested
in direct messaging going forward, please feel free to indicate your
interest by signing up on the consultation page.
Looking forward to reading your thoughts.
P.S.: Please feel free to CC me in conversations that might happen on this
 What do we mean by "health"? There is no standard definition of what
makes a Wikimedia community "healthy", but there are many indicators that
highlight where a wiki is doing well, and where it could improve. This
project aims to provide a variety of useful data points that will inform
community decisions that will benefit from objective data.
*Joe Sutherland* (he/him or they/them)
Trust and Safety Specialist
Within the course of the day today we will be updating turnilo and
superset, superset version does not have major differences so hopefully you
do not notice any issues. Turnilo however includes several new features,
among them the ability to create heatmaps, the new functionality is still
not perfect but quite useful.
See, for example, self-identified bot requests (bots that tell us they are
bots) per country per hour for yesterday:
[image: Screen Shot 2019-08-21 at 4.38.13 PM.png]
as part of https://phabricator.wikimedia.org/T201165 the Analytics team
thought to reach out to everybody to make it clear that all the home
directories on the stat/notebook nodes are not backed up periodically. They
run on a software RAID configuration spanning multiple disks of course, so
we are resilient on a disk failure, but even if unlikely if might happen
that a host could loose all its data. Please keep this in mind when working
on important projects and/or handling important data that you care about.
I just added a warning to
If you have really important data that is too big to backup, keep in mind
that you can use your home directory (/user/your-username) on HDFS (that
replicates data three times across multiple nodes).
Please let us know if you have comments/suggestions/etc.. in the
Thanks in advance!
Luca (on behalf of the Analytics team)
Hi WMF Analytics,
In my web searches in the past few months I am seeing an increasing number
of websites that have republished Wikimedia content, sometimes in ways that
I suspect are in violation of trademark and/or Creative Commons licensing
rules. (My guess is that these sites make money through advertising that
they place on their sites.) Has WMF observed any negative effects in web
traffic that can be attributed to other websites reusing Wikimedia content
It might be interesting if WMF can obtain statistics from web search
providers regarding how many times users click on search engine links to
sites that reuse Wikimedia content and/or trademarks.
( https://meta.wikimedia.org/wiki/User:Pine )