I don't personally know of any central repository for data on past edit-a-thons.

There might be something out there. You could probably get some information from pinging folks in CE who've worked on Project & Event Grants (Asaf Bartov, Kacie Harold) or Program Evaluation (Amanda Bittaker, Edward Galvez), or search through past grant reports... but I'm guessing the data will be sparse and inconsistent, as it is still collected in a somewhat ad-hoc fashion.

If WMF were to support the development and maintenance of standardized infrastructure for edit-a-thon tracking--something like Harsh Kothari and Jeph Paul's platform for the Indian Wikiwomen edit-a-thons (site, code)--this would be easier. But AFAIK that hasn't happened. If someone takes up that cause I will voice my support.

On Mon, Dec 7, 2015 at 7:34 PM, Maximilian Klein <isalix@gmail.com> wrote:

Researchians,

I have a been collecting data on the gendered biographies of different Wikipedia Languages from Wikidata dumps, with the question of trying to understand the gender gap in content. After reading about Propensity Score Matching[1] today, I see it would be possible to test a (close to) causal link between the genders of Wikipedia Biographies being added to a language, and Editathon activity. Yet we'd need the data for editathon activity. Is it compiled somewhere, or can you think of how it could be compiled?

[1] https://en.wikipedia.org/wiki/Propensity_score_matching The idea in propensity score matching is to pretend a randomized experiment is being conducted, and to find a "control group" - a similar but untreated language, for each "treated group".

Make a great day,
Max Klein ‽ http://notconfusing.com/

_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l

Jonathan T. Morgan

Senior Design Researcher

Wikimedia Foundation

User:Jmorgan (WMF)