Hi, I'm scanning the gerrit repo to extract the projects that we want to scan for our tech community metrics.
There is a lot of stuff under analytics/
https://gerrit.wikimedia.org/r/#/admin/projects/?filter=analytics
Should we include all of it or are there repositories that we can ignore?
What are discarding in general:
* Upstream projects that we simply repackage or fork with a few patches. * Data (as opposed to code) that would just bloat our metrics. * Sandboxes and personal experiments.
PS: this will be also useful to update https://www.ohloh.net/p/wmf-analytics (are you really managing 1M lines of code?)