Thanks for sharing the info. We are looking for the IP addresses as well, with anonymous names (like A, B, C) because we want to identify user sessions from the log.
Regards, Phani
On Wed, Sep 3, 2008 at 1:22 AM, Reid Priedhorsky reid@umn.edu wrote:
Phanikumar Bhamidipati wrote:
Hi All,
We are two research students looking for Wikipedia access data. We tried
to
use the statistics available at http://dammit.lt/wikistats/. But, we
would
like to know these data in detail: drilled down to per page access, i.e., triplets of the form <Page, IP, Date&Time>.
Could you please let us know if we can get such information? The IP
details
can be anonymous, if required. We are only looking for a detailed
Wikipedia
page access log information.
Wikimedia was kind enough to share a 1/10 streaming sample of their access logs with us and several other researchers. I do not know if they still do this. It's a LOT of data: several gigs per day even after compression. They consider IP addresses to be private data and share only <Page, Date&Time>.
Our contact for this is Tim Starling, tstarling@wikimedia.org I think.
Reid