On 12/10/14, Andrew Lih <andrew(a)andrewlih.com> wrote:
Brian, there were some interesting results in the data
you filtered from
the database. The good news is that it syncs quite well with the data we
had from January 2013, in terms of ogg, ogv and webm. A few notes:
1. These are the most popular Commons videos in en.wp. Pretty much the same
as January 2013 except for #2, where someone really wanted to embed that
Reagan Speech in a lot of places.
Commercial-LBJ1964ElectionAdDaisyGirl.ogv 13
Reagan Speech Beirut Bombing.ogv 12
Machinima sample reindeer full size.ogg 9
1946-10-08 21 Nazi Chiefs Guilty.ogv 9
SeaSnails.ogg 8
Shakinghands high.OGG 7
The Impact Of Wikipedia.webm 6
CollateralMurder.ogv 6
1946-07-15 Philippines Independence Proclaimed.ogv 6
2. These are the most popular long GIFs on Commons, used in en.wp:
EC-EU-enlargement animation.gif 53
Linguistic map Southwestern Europe.gif 18
Canada provinces evolution 2.gif 12
Pangea animation 03.gif 11
Mohammad adil-Rashidun empire-slide.gif 10
3. We may have to tweak the GIF filter. For some reason, it picked up some
odd results like classifying these LOCAL en.wp Mexico-related stub GIF
icons as video. The metadata page does not suggest they should be seen as
long animations. The files are, from the table listing:
Mx-actor.gif 275
Mx-singer.gif 49
Mx-actor.gif, Mx-singer.gif 43
https://en.wikipedia.org/wiki/File:Mx-actor.gif
-Andrew
According to the metadata, Mx-actor.gif is an animated gif consisting
of 1 frame that's shown for 10 seconds... Which is odd. I've excluding
all animated GIFs that are only a single frame long.
This report should automatically update once a week on tuesdays at
roughly 7am UTC.
One thing I should note about that report is that the columns will get
cut off if they exceed 4096 characters.
I also created a second report for videos on commons that are used on
any wiki in any namespace. Its at
https://tools.wmflabs.org/bawolff/usedVideosCommons.htm (The query for
this report is actually a lot more efficient than the query of the
other one. This suggests that if performance ever became an issue, the
other query could probably be optimized, but I don't see it being an
issue.) That report is updated every Wednesday at about 7am,
Cheers,
--bawolff
p.s. The regan videos being everywhere is amusing.