It is useful to know there would be a way using the Eventlog. Likewise, I totally understand using mediawiki to this purpose would require a formal collaboration. 

Since this is not an immediate project (I am asking funding for it at the moment), there would be time to arrange it and find the best way for both technical and formal parts.

By now I have the information I need. Thank you everyone.



El dv., 1 jul. 2016 a les 19:07, Leila Zia (<leila@wikimedia.org>) va escriure:
Hi Marc,

On Tue, Jun 28, 2016 at 6:36 AM, Marc Miquel <marcmiquel@gmail.com> wrote:
Since this would be for a research project I might ask funding for it, I would like to know if I could count on that, what is the nature of the available data, and what would be the procedure to obtain this data and if there would be any implication because of privacy concerns.

​We grant access to webrequest log data and the non-public derivatives of it not very frequently. When we do, we do it through creating formal collaborations with the researchers. What these collaborations are and how we set them up are explained at https://www.mediawiki.org/wiki/Wikimedia_Research/Formal_collaborations.

To provide more context:

Requiring formal collaborations as a necessary step for accessing the data means that we cannot scale rapidly, i.e, each researcher on our team is only able to be involved in so many of them. The practical cap is somewhere around 3 collaborations per researcher in my experience. We understand that this is a problem as we would like more researchers to work with this data. We reconsider ways for expanding our capacity to collaborate frequently. We also always consider releasing more data-sets publicly since ultimately, that's one of the best ways for us to empower others do what they want to work on and find value in.


Thank you very much!


Marc Miquel

Wiki-research-l mailing list

Analytics mailing list