Greetings, describe for me what you ideal data would look like.
On 3/28/07, Reid Priedhorsky reid@umn.edu wrote:
Dear Wikitechnicians,
My name is Reid Priedhorsky, and I'm a Ph.D. student at GroupLens Research, which is the human-computer interaction group at the University of Minnesota.
We are currently working on some research which is investigating Wikipedia contribution and vandalism. To this end, statistics on the view rate of different articles would be extremely helpful to us -- something along the lines of Leon Weber's WikiCharts tool, but with a larger limit (ideally all 1.7 million articles).
It seems to me that the easiest way to accomplish this would be to get copies of your sampled Squid logs (as described on http://lists.wikimedia.org/pipermail/wikitech-l/2007-January/029000.html and its links). We do not need the client IP or any other similarly sensitive data, though if you gave it to us we would protect it carefully as we protect the other sensitive research data we handle.
Would it be possible for us to have access to these log files?
If not, I would love to begin a discussion on what it would be possible for us to access.
Your help would be greatly appreciated. Please let me know if you have any questions.
Thanks,
Reid
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org http://lists.wikimedia.org/mailman/listinfo/wikitech-l