Hi there,
Asking for a task to volunteer, Sumana encouraged me to look at the topic of community metrics. She pointed to https://wikitech.wikimedia.org/view/Pentaho as a starting point.
After a first look at Pentaho and what some colleagues at the MeeGo project did with it [1], I searched (a bit) for any wiki pages or discussions about community metrics here, but couldn't find any.
http://www.mediawiki.org/wiki/User:Qgil/MediaWiki_Community_Metrics
Edits welcome!
I'm looking for feedback, help, and a first prototype of an automatically refreshed report hopefully sooner than later. Something simple to build upon.
Even if it's too tempting to define the first prototype thinking first on tools or data available, you are encouraged to start by proposing what questions do you want actually answered. What community trends do you want to know?
See http://www.mediawiki.org/wiki/User:Qgil/MediaWiki_Community_Metrics#Trends_t... - in few days we should have agreed on the first and most important trends we want to visualize.
[1] http://wiki.meego.com/Metrics/Dashboard
-- Quim
On 09/28/2012 01:10 PM, Quim Gil wrote:
Even if it's too tempting to define the first prototype thinking first on tools or data available, you are encouraged to start by proposing what questions do you want actually answered. What community trends do you want to know?
A first proposal to stir your pros and cons:
=== Developers ===
* [[Developers]] with [[Gerrit]] access. ** Reviewers. ** Core developers with merge permissions. ** Active in the past week / month / year. ** WMF employees, other MediaWiki professionals, hobbyists. ** Countries where they work from. * New accounts. ** How many requests (approved, declined?) per week / month / year. ** Primary motivation: new or existing project - which projects. ** WMF employees, other MediaWiki professionals, hobbyists. ** Countries where they work from.
=== Software projects ===
* [https://gerrit.wikimedia.org/r/#/admin/projects/ Projects in Gerrit] ** Types of project: MediaWiki core, extensions, mobile, infrastructure... ** Active in the past week / month / year. ** Officially supported. ** Considered stable, beta, experimental. * Data per project: ** Commits (merged, rejected, waiting) and reviews. ** Committers and reviewers. ** WMF employees, other MediaWiki professionals, hobbyists. ** Countries where they work from.
I wonder how much of this can already be extracted with http://gerrit-stats.wmflabs.org/ . /me must look deeper.
See http://www.mediawiki.org/wiki/User:Qgil/MediaWiki_Community_Metrics#Trends_t...
- in few days we should have agreed on the first and most important
trends we want to visualize.
PS: http://www.ohloh.net/p/mediawiki/contributors seems to be stuck in svn?
-- Quim
Hi, update on http://www.mediawiki.org/wiki/User:Qgil/MediaWiki_Community_Metrics
- Amgine proposed to describe the problem first, and now we have http://www.mediawiki.org/wiki/User:Qgil/MediaWiki_Community_Metrics#Problems...
- ^demon, MarkTraceur and Diederik provided more feedback on what exists and what can't be provided nowadays. Very useful to focus the goals of the first prototype. For instance, even if would be useful to know where are the contributors located, the facts is that we don't have this data.
- After a call with Diederik we both agree that that the best initial approach is to look at what can Gerrit provide, and use/expand gerrit-stats + Limn for the first Dashboard prototype.
Question: what is the best approach to retrieve the number of existing Gerrit accounts?
-- Quim
Hi,
On 10/02/2012 03:12 PM, Diederik van Liere wrote:
Question: what is the best approach to retrieve the number of existing Gerrit accounts?
This number is already stored within gerrit-stats, it is just not being written to a dataset.
Right, is anybody planning to add this? Otherwise I will try - but I will need help with so many new pieces.
I have cloned gerrit-stats, Limn and scorecard to understand how these tools work. I'm going through the files, but it will be helpful if you point me where to look at. :)
While Limn is very well documented I couldn't find any gerrit-stats docs, is that right?
I'm not even sure where to find the code for http://gerrit-stats.wmflabs.org/ . In gerrit I could only find the /analytics/scorecard project.
Also thinking what is the best way to recreate http://gerrit-stats.wmflabs.org/ to play with and test before pushing any changes.
Thank you for your help.
-- Quim
I'm not even sure where to find the code for http://gerrit-stats.wmflabs.* *org/ http://gerrit-stats.wmflabs.org/ . In gerrit I could only find the /analytics/scorecard project.
The repo is available at: https://gerrit.wikimedia.org/r/gitweb?p=analytics%2Fgerrit-stats.git;a=short... As mentioned before, Limn is responsible for visualizing the data, gerrit-stats only pulls data from Gerrit and construct measures. Happy to discuss how to come up with developer centric measures.
Best, Diederik
On 10/15/2012 07:39 AM, Diederik van Liere wrote:
The repo is available at: https://gerrit.wikimedia.org/r/gitweb?p=analytics%2Fgerrit-stats.git;a=short... As mentioned before, Limn is responsible for visualizing the data, gerrit-stats only pulls data from Gerrit and construct measures. Happy to discuss how to come up with developer centric measures.
Alright, if I'm understanding correctly the pieces:
The available types of data available from the Gerrit accounts can be seen at the 'accounts' table at [analytics/gerrit-stats.git] / gerritstats / sql / database_design.sql https://gerrit.wikimedia.org/r/gitweb?p=analytics/gerrit-stats.git;a=blob;f=...
The accounts data actually retrieved is defined at [analytics/gerrit-stats.git] / gerritstats / developer.py using the 'Developer' object. https://gerrit.wikimedia.org/r/gitweb?p=analytics/gerrit-stats.git;a=blob;f=...
Is this data being actually collected? Is it available and accessible anywhere? I couldn't find it at [analytics/gerrit-stats/data.git] / datafiles / analytics / gerrit-stats / https://gerrit.wikimedia.org/r/gitweb?p=analytics/gerrit-stats/data.git;a=tr...
And (finally?) the structure of http://gerrit-stats.wmflabs.org/ is being defined by [analytics/gerrit-stats/data.git] / dashboards / gerrit.json https://gerrit.wikimedia.org/r/gitweb?p=analytics/gerrit-stats/data.git;a=bl...
The developer graphs should be added there once it is clear what data we want show and where can Limn find it, right?
For the sake of simplicity, I propose to start with a simple graph showing how many accounts `registered_on`. If I can get your help connecting all the dots for this then it will be a lot easier for me to come up with more graphs.
-- Quim
wikitech-l@lists.wikimedia.org