We’re glad to announce the release of an aggregate clickstream dataset extracted from English Wikipedia
This data can be used for various purposes:
• determining the most frequent links people click on for a given article
• determining the most common links people followed to an article
• determining how much of the total traffic to an article clicked on a link in that article
• generating a Markov chain over English Wikipedia
We created a page on Meta for feedback and discussion about this release:
https://meta.wikimedia.org/wiki/Research_talk:Wikipedia_clickstream