Sumana Harihareswara, 20/07/2012 23:38:
On 07/19/2012 04:19 PM, Federico Leva (Nemo) wrote:
Sumana Harihareswara, 19/07/2012 22:08:
I noticed the jump in the June engineering report. Where does the big difference compared to previous month's number come from?
Nemo
Now that we've re-calculated our numbers for the past few months, it's not really that big a jump -- see http://lists.wikimedia.org/pipermail/wikitech-l/2012-July/061649.html .
Sure, I meant: what's the reason of the previous underreporting, if you've found it?
Nemo
The previous numbers were from Ohloh, and I think Ohloh was still only taking into account Subversion statistics!
Is it still doing do and how can we get it fixed if not? It's a nice resource for some things.
Nemo
On Tue, Jul 24, 2012 at 2:31 PM, Federico Leva (Nemo) nemowiki@gmail.com wrote:
Sumana Harihareswara, 20/07/2012 23:38:
On 07/19/2012 04:19 PM, Federico Leva (Nemo) wrote:
Sumana Harihareswara, 19/07/2012 22:08:
I noticed the jump in the June engineering report. Where does the big difference compared to previous month's number come from?
Nemo
Now that we've re-calculated our numbers for the past few months, it's not really that big a jump -- see http://lists.wikimedia.org/pipermail/wikitech-l/2012-July/061649.html .
Sure, I meant: what's the reason of the previous underreporting, if you've found it?
Nemo
The previous numbers were from Ohloh, and I think Ohloh was still only taking into account Subversion statistics!
Is it still doing do and how can we get it fixed if not? It's a nice resource for some things.
There's afaik two scripts that will take a repository (such as the WMF repository) and turn it into a nice database for analysing contributions/commits.
cvsanaly http://tools.libresoft.es/cvsanaly/ miningit https://github.com/SoftwareIntrospectionLab/MininGit
Miningit is a fork of cvsanaly.
I've been running both scripts and the MW repositories but running into several issues with them. As soon as I have it running consistently I can put the db on the toolserver or something like that if there's interest (and toolserver ppl don't mind that)
Finne
Nemo
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
-- "Maybe you knew early on that your track went from point A to B, but unlike you I wasn't given a map at birth!" Alyssa, "Chasing Amy"
----- Mensaje original -----
De: Finne Boonen hennar@gmail.com Para: Research into Wikimedia content and communities wiki-research-l@lists.wikimedia.org CC: Enviado: Martes 24 de julio de 2012 15:01 Asunto: Re: [Wiki-research-l] request for Git statistics (or, "don't stand back, I don't know regular expressions") (Wiki-research-l Digest, Vol 83, Issue 13)
On Tue, Jul 24, 2012 at 2:31 PM, Federico Leva (Nemo) nemowiki@gmail.com wrote:
Sumana Harihareswara, 20/07/2012 23:38:
On 07/19/2012 04:19 PM, Federico Leva (Nemo) wrote:
Sumana Harihareswara, 19/07/2012 22:08:
I noticed the jump in the June engineering report. Where
does the big
difference compared to previous month's number come
from?
Nemo
Now that we've re-calculated our numbers for the past few
months, it's
not really that big a jump -- see
http://lists.wikimedia.org/pipermail/wikitech-l/2012-July/061649.html .
Sure, I meant: what's the reason of the previous
underreporting, if
you've found it?
Nemo
The previous numbers were from Ohloh, and I think Ohloh was still only taking into account Subversion statistics!
Is it still doing do and how can we get it fixed if not? It's a nice
resource for some things.
There's afaik two scripts that will take a repository (such as the WMF repository) and turn it into a nice database for analysing contributions/commits.
cvsanaly http://tools.libresoft.es/cvsanaly/ miningit https://github.com/SoftwareIntrospectionLab/MininGit
Miningit is a fork of cvsanaly.
I've been running both scripts and the MW repositories but running into several issues with them. As soon as I have it running consistently I can put the db on the toolserver or something like that if there's interest (and toolserver ppl don't mind that)
Finne,
Please, let me know about the issues you are experiencing with cvsanaly. My colleagues at Bitergia (a new spin-off) or I could help you to run the tools against these repos.
Best, Felipe.
Finne
Nemo
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
-- "Maybe you knew early on that your track went from point A to B, but unlike you I wasn't given a map at birth!" Alyssa, "Chasing Amy"
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
On 07/24/2012 08:31 AM, Federico Leva (Nemo) wrote:
Sumana Harihareswara, 20/07/2012 23:38:
On 07/19/2012 04:19 PM, Federico Leva (Nemo) wrote:
Sumana Harihareswara, 19/07/2012 22:08:
I noticed the jump in the June engineering report. Where does the big difference compared to previous month's number come from?
Nemo
Now that we've re-calculated our numbers for the past few months, it's not really that big a jump -- see http://lists.wikimedia.org/pipermail/wikitech-l/2012-July/061649.html .
Sure, I meant: what's the reason of the previous underreporting, if you've found it?
Nemo
The previous numbers were from Ohloh, and I think Ohloh was still only taking into account Subversion statistics!
Is it still doing do and how can we get it fixed if not? It's a nice resource for some things.
Nemo
Ohloh would sure be a nice resource - I'm not sure how to get it fixed exactly, but please feel free to poke around, tell Ohloh where our new repository is, and try to get it fixed. Sorry, it's a low priority for me right now, but you have my authorization to try to get it fixed.
By the way, the WMF analytics team is working on some new analysis tools for our use of Gerrit but it's still very rough. https://gerrit.wikimedia.org/r/gitweb?p=analytics/gerrit-stats.git;a=summary is the repository to follow (https://gerrit.wikimedia.org/r/#/q/status:open+project:mediawiki/core,n,z).
Sumana Harihareswara, 24/07/2012 22:47:
Ohloh would sure be a nice resource - I'm not sure how to get it fixed exactly, but please feel free to poke around, tell Ohloh where our new repository is, and try to get it fixed. Sorry, it's a low priority for me right now, but you have my authorization to try to get it fixed.
The new repo for core was already there, but extensions were missing; I've now added them. https://www.ohloh.net/p/mediawiki (Some seem to partially disagree, by the way.)
By the way, the WMF analytics team is working on some new analysis tools for our use of Gerrit but it's still very rough. https://gerrit.wikimedia.org/r/gitweb?p=analytics/gerrit-stats.git;a=summary is the repository to follow (https://gerrit.wikimedia.org/r/#/q/status:open+project:mediawiki/core,n,z).
Yes, we have big hopes in this! :)
Nemo
On 07/24/2012 08:09 PM, Federico Leva (Nemo) wrote:
Sumana Harihareswara, 24/07/2012 22:47:
Ohloh would sure be a nice resource - I'm not sure how to get it fixed exactly, but please feel free to poke around, tell Ohloh where our new repository is, and try to get it fixed. Sorry, it's a low priority for me right now, but you have my authorization to try to get it fixed.
The new repo for core was already there, but extensions were missing; I've now added them. https://www.ohloh.net/p/mediawiki (Some seem to partially disagree, by the way.)
Thanks for stepping up and improving our Ohloh listing, Nemo! I hope others can help you in figuring out and resolving any contradictions.
By the way, the WMF analytics team is working on some new analysis tools for our use of Gerrit but it's still very rough. https://gerrit.wikimedia.org/r/gitweb?p=analytics/gerrit-stats.git;a=summary
is the repository to follow (https://gerrit.wikimedia.org/r/#/q/status:open+project:mediawiki/core,n,z).
Yes, we have big hopes in this! :)
Nemo
A note from Felipe Ortega that I got permission to forward onlist:
Hi Sumana.
I see that you have solved your request.
I'm not sure if you know the toolset from my current research group for extracting and analyzing data from Git, as well as for other version control systems:
http://git.libresoft.es/cvsanaly
In fact, despite the name it supports CVS, SVN and Git. Here you can find a glimpse of the type of data that it can generate:
http://git.libresoft.es/cvsanaly/tree/db/cvsanaly_model.svg
We also have another tools for analyzing issue tracking systems (Bugzilla, SF.net, Allura, GitHub, JIRA, and Launchpad, so far).
I'm not sure but they could probably help you monitor project resources and solve these kind of questions. My current group at URJC has started a spinoff based on these services, and they are already producing some interesting stuff for clients like Samsung (for their network of partners in Android) or OpenStack. Here is a mockup (they are using envision.js):
http://gsyc.es/~jgb/repro/2012-akademyes-kdevelop/swscopio.html
Well, hope this helps.
Felipe.
wiki-research-l@lists.wikimedia.org