On Wed, Jan 30, 2013 at 4:20 PM, Jörn Hees <wikistats(a)joernhees.de> wrote:
afaik it's the other way around: stats.grok.se
aggregates the data from the pagecounts-raw dumps.
Oh right! I never noticed the hostname changing when I clicked on the
"data available here" link on stats.grok.se :-) That actually makes me
feel a lot better knowing the data collection is happening on a
wikimedia server.
The results you list are from the normal wikipedias in
the different languages, so i guess wikidata stats are not collected yet.
I'd also be very interested in that data to combine it with linked data…
Does anyone know who/what collects the pagecounts-raw dumps on
dumps.wikimedia.org?
//Ed
PS. re: Linked Data I think wikidata is going to be a big deal in that
area yes. I just saw this morning that it's pretty easy to use the
MediaWiki API to get at the entities
https://gist.github.com/4681747 I
think other data formats (RDF) are in the pipeline.