On Wed, Jan 30, 2013 at 4:20 PM, Jörn Hees wikistats@joernhees.de wrote:
afaik it's the other way around: stats.grok.se aggregates the data from the pagecounts-raw dumps.
Oh right! I never noticed the hostname changing when I clicked on the "data available here" link on stats.grok.se :-) That actually makes me feel a lot better knowing the data collection is happening on a wikimedia server.
The results you list are from the normal wikipedias in the different languages, so i guess wikidata stats are not collected yet. I'd also be very interested in that data to combine it with linked data…
Does anyone know who/what collects the pagecounts-raw dumps on dumps.wikimedia.org?
//Ed
PS. re: Linked Data I think wikidata is going to be a big deal in that area yes. I just saw this morning that it's pretty easy to use the MediaWiki API to get at the entities https://gist.github.com/4681747 I think other data formats (RDF) are in the pipeline.