Hi Federico,
On lun, 2013-11-11 at 19:40 +0100, Federico Leva (Nemo) wrote:
Alvaro del Castillo, 11/11/2013 17:51:
You can see it working in the Tech Community Metrics dashboard applied to mediawiki.org:
http://korma.wmflabs.org/browser/mediawiki.html
Right now it shows:
- Total page created and evolution in time of page creation.
- The same for editions
- The same for editors
Thanks! This will be interesting for the MediaWiki wikis without regular dumps.
Yes, this is our target.
As for mediawiki.org, in your goals what is this service going to add to the main statistics http://stats.wikimedia.org/wikispecial/EN/TablesWikipediaMEDIAWIKI.htm ?
I am not sure we are going to add new things, this report is pretty exhaustive, but we are creating graphs similar to the others panels in Tech Community Metrics.
We are updating also daily the data, as like the other data sources.
And we are integrating user identities in the full process with the other data sources so for example, you can have the activity for a developer in git, gerrit, bugzilla, irc, mailing lists and mediawiki.
I think we should link this report from Tech Community Metrics to get all details about statistics.
Some weirdnesses I noted:
- Total edit counts don't match, compare e.g. Krinkle, Kgh, Jack,
Jeroen: http://stats.wikimedia.org/wikispecial/EN/TablesWikipediaMEDIAWIKI.htm#wikip... . This probably means that you are considering the wrong namespaces; please use content namespaces.
Ok! taking a look to namespaces. It is easy to filter them using the API. The key is to understand which are content namespaces. Some recommendation? I have seen that "* Talk" pages are discussions pages name spaces so we can not include them.
- The number of editors is not explained. I hope it matches the metrics
definitions (it seems similar) but it's not clear. 3) Number of edits per month is suspiciously low in some old months, even more than (1) would seem to justify.
Yes, by default when getting allpages, we are getting just the ns=0 (Default: 0). So once this is fixed, we can recheck the data.
For instance, for October 2007 you report 28 pages (new pages, I suppose) and 275 edits (to the unknown namespaces of (1), I guess) while WikiStats says 5 new countable pages per day and 1.8 k (total) edits.
Ok, we should research it. Queries are not so complex [0] but it is my first time with this API so I could be doing something wrong.
Cheers
Nemo
[0] https://github.com/MetricsGrimoire/MediaWikiAnalysis/blob/master/mediawiki_a...