Hi Sérgio,
Some universities (like ours) receive a 1/100 sample of the whole set of petitions processed by Wikimedia Squid servers.
It is provided on direct request, however. As far as I know the data is not consistently archived in a public repository anywhere (but I maybe unaware of some system storing that info).
Some work has already been published on this topic:
* A. J. Reinoso, J. M. Gonzalez-Barahona, G. Robles, and F. Ortega, "A quantitative approach to the use of the wikipedia," in 2009 IEEE Symposium on Computers and Communications. IEEE, July 2009, pp. 56-61. [Online]. Available: http://dx.doi.org/10.1109/ISCC.2009.5202401
Regards, Felipe.
--- El mar, 13/4/10, S. Nunes snunes@gmail.com escribió:
De: S. Nunes snunes@gmail.com Asunto: [Wiki-research-l] Access to HTTP access logs for Wikipedia articles? Para: "Wikipedia Research List" wiki-research-l@lists.wikimedia.org Fecha: martes, 13 de abril, 2010 13:23 Hi all,
I presume that Wikipedia keeps data about HTTP accesses to all articles. Can anybody inform me if this data is available for research purposes?
I am particularly interested in HTTP referral information for each article. I suspect that this information could be used to estimate topical relevance for each document. Access to this information poses no risk to users' privacy since no user information is made available
- sessions' id, hour/minute timestamp data and IPs could be
easily discarded.
I am new to this list, so I really don't know if this has been previously discussed. I searched the archives and found no relevant results on this issue.
Thanks in advance for your feedback,
Sérgio Nunes
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l