Hay (Husky), 25/03/2015 11:03:
Answering my own question: until somebody puts up a
stats.grok.se-like
interface for the mediacounts, i've hacked together a Python script
that can be used to 'query' the TSV files with a file, or a list of
files:
https://github.com/hay/wiki-tools/blob/master/etc/mediacounts-stats.py
And I sent a small silly patch to give a category name like
https://commons.wikimedia.org/wiki/Category:Media_from_BEIC as input.
Example output attached for the lazy.
Some data I found particularly interesting:
1) the sum of columns 11–14 (big thumbs),
2) the ratio between (1) and column 3 (total transfers),
3) column 24 (no Wikimedia referrer).
Total transfers in this small sample seem even higher than pageviews.
(1) counts thumbs above 400 pixels, which are usually not embedded by
default: (2) should tell how many users probably clicked or did
something else. (3) may indicate which files "went viral".
Nemo