We’re glad to announce the release of an aggregate clickstream dataset extracted from English WikipediaThis dataset contains counts of (referer, article) pairs aggregated from the HTTP request logs of English Wikipedia. This snapshot captures 22 million (referer, article) pairs from a total of 4 billion requests collected during the month of January 2015.This data can be used for various purposes:• determining the most frequent links people click on for a given article• determining the most common links people followed to an article• determining how much of the total traffic to an article clicked on a link in that article• generating a Markov chain over English Wikipedia
We created a page on Meta for feedback and discussion about this release: https://meta.wikimedia.org/wiki/Research_ talk:Wikipedia_clickstream Ellery and Dario
_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics