New subject: Article hit rates -- research at the University of Minnesota

28 Mar 2007


      Dear Wikitechnicians,
My name is Reid Priedhorsky, and I'm a Ph.D. student at GroupLens 
Research, which is the human-computer interaction group at the 
University of Minnesota.
We are currently working on some research which is investigating 
Wikipedia contribution and vandalism. To this end, statistics on the 
view rate of different articles would be extremely helpful to us -- 
something along the lines of Leon Weber's WikiCharts tool, but with a 
larger limit (ideally all 1.7 million articles).
It seems to me that the easiest way to accomplish this would be to get 
copies of your sampled Squid logs (as described on 
http://lists.wikimedia.org/pipermail/wikitech-l/2007-January/029000.html 
and its links). We do not need the client IP or any other similarly 
sensitive data, though if you gave it to us we would protect it 
carefully as we protect the other sensitive research data we handle.
Would it be possible for us to have access to these log files?
If not, I would love to begin a discussion on what it would be possible 
for us to access.
Your help would be greatly appreciated. Please let me know if you have 
any questions.
Thanks,
Reid