Brian, thanks yes that would be what I'd be looking for.
In fact, a monthly report on a regular basis would be really interesting to see.
I've worked in the past with Ward Cunningham on his fast parser to get some initial data for last year, but it'd be great to get an update.
FYI, here were some of our findings then:
A January 2013 dump of the English Wikipedia database, we were able to identify 4,061 instances of video files embedded in Wikipedia articles.
Count: Video file 169 Articleevolution.ogg 21 Verifiability and Neutral point of view (Common Craft)-600px-en.ogv 13 Commercial-LBJ1964ElectionAdDaisyGirl.ogv 9 SeaSnails.ogg 8 Machinima sample reindeer full size.ogg 8 1946-10-08 21 Nazi Chiefs Guilty.ogv 6 Wikipedia video tutorial-1-Editing-en.ogv 6 The Impact Of Wikipedia.webm 6 LightningCNP.ogg 6 Camouflage (1944).ogv
The number of actual unique videos used in articles was 3,100, removing duplicate uses of videos in multiple articles. Overall, the number of videos used in English Wikipedia articles is fairly low, at a rate of 0.1%, when compared to the 4.2 million articles in January 2013
-Andrew
-Andrew Lih Associate professor of journalism, American University Email: andrew@andrewlih.com WEB: http://www.andrewlih.com BOOK: The Wikipedia Revolution: http://www.wikipediarevolution.com PROJECT: Wiki Makes Video http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Wiki_Makes_Video
On Fri, Dec 5, 2014 at 1:19 PM, Brian Wolff bawolff@gmail.com wrote:
On Dec 5, 2014 12:09 PM, "Jan Ainali" jan.ainali@wikimedia.se wrote:
When structured data on Commons is live, this will be quite easy
(allowing time for media to be tagged also).
If you are looking for a solution that works now I do not have any
better ideas.
Med vänliga hälsningar, Jan Ainali
Verksamhetschef, Wikimedia Sverige 0729 - 67 29 48
Tänk dig en värld där varje människa har fri tillgång till
mänsklighetens samlade kunskap. Det är det vi gör.
Bli medlem.
2014-12-05 16:24 GMT+01:00 Andrew Lih andrew@andrewlih.com:
I'm wondering what people have found to be the best practices for
identifying video in Wikipedia articles.
A number of issues:
- One of the problems is the OGG is a container, so simply parsing
article Wikimarkup may not be sufficient to identify video content.
You can go by category, but this is not always fully accurate
Are GIFs that are animated considered video? Some are, and some
aren't.
Interested in hearing what people think, or whether we have a taxonomy
of video types that are well defined.
-Andrew
Are you looking for a list of articles with videos? We can probably do that now with a db query (there may be a small number of false negatives on the ogg front, but probably 98% of them can be identified from db. Gifs present a complicating factor but probably still do-able.).
--bawolff
Wikivideo-l mailing list Wikivideo-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikivideo-l