On 12/10/14, Andrew Lih andrew@andrewlih.com wrote:
Brian, there were some interesting results in the data you filtered from the database. The good news is that it syncs quite well with the data we had from January 2013, in terms of ogg, ogv and webm. A few notes:
- These are the most popular Commons videos in en.wp. Pretty much the same
as January 2013 except for #2, where someone really wanted to embed that Reagan Speech in a lot of places.
Commercial-LBJ1964ElectionAdDaisyGirl.ogv 13 Reagan Speech Beirut Bombing.ogv 12 Machinima sample reindeer full size.ogg 9 1946-10-08 21 Nazi Chiefs Guilty.ogv 9 SeaSnails.ogg 8 Shakinghands high.OGG 7 The Impact Of Wikipedia.webm 6 CollateralMurder.ogv 6 1946-07-15 Philippines Independence Proclaimed.ogv 6
- These are the most popular long GIFs on Commons, used in en.wp:
EC-EU-enlargement animation.gif 53 Linguistic map Southwestern Europe.gif 18 Canada provinces evolution 2.gif 12 Pangea animation 03.gif 11 Mohammad adil-Rashidun empire-slide.gif 10
- We may have to tweak the GIF filter. For some reason, it picked up some
odd results like classifying these LOCAL en.wp Mexico-related stub GIF icons as video. The metadata page does not suggest they should be seen as long animations. The files are, from the table listing:
Mx-actor.gif 275 Mx-singer.gif 49 Mx-actor.gif, Mx-singer.gif 43
https://en.wikipedia.org/wiki/File:Mx-actor.gif
-Andrew
According to the metadata, Mx-actor.gif is an animated gif consisting of 1 frame that's shown for 10 seconds... Which is odd. I've excluding all animated GIFs that are only a single frame long.
This report should automatically update once a week on tuesdays at roughly 7am UTC.
One thing I should note about that report is that the columns will get cut off if they exceed 4096 characters.
I also created a second report for videos on commons that are used on any wiki in any namespace. Its at https://tools.wmflabs.org/bawolff/usedVideosCommons.htm (The query for this report is actually a lot more efficient than the query of the other one. This suggests that if performance ever became an issue, the other query could probably be optimized, but I don't see it being an issue.) That report is updated every Wednesday at about 7am,
Cheers, --bawolff
p.s. The regan videos being everywhere is amusing.