Thanks for these helpful stats!
We’re painfully aware that video remains under-utilised on our sites.
Until we can devote more resources to make it easier to transcode, upload and share video,
it may be helpful to start discussions with both video contributors and article editors,
to better understand what would be needed for video to become more frequently integrated
in articles. This could inform future plans to allocate more resources to this important
Are any conversations taking place along those lines? If not, would anyone like to
spearhead a public discussion in coming months?
On Dec 5, 2014, at 10:50 AM, Andrew Lih
Brian, thanks yes that would be what I'd be looking for.
In fact, a monthly report on a regular basis would be really interesting to see.
I've worked in the past with Ward Cunningham on his fast parser to get some initial
data for last year, but it'd be great to get an update.
FYI, here were some of our findings then:
A January 2013 dump of the English Wikipedia database, we were able to identify 4,061
instances of video files embedded in Wikipedia articles.
Count: Video file
21 Verifiability and Neutral point of view (Common Craft)-600px-en.ogv
8 Machinima sample reindeer full size.ogg
8 1946-10-08 21 Nazi Chiefs Guilty.ogv
6 Wikipedia video tutorial-1-Editing-en.ogv
6 The Impact Of Wikipedia.webm
6 Camouflage (1944).ogv
The number of actual unique videos used in articles was 3,100, removing duplicate uses of
videos in multiple articles. Overall, the number of videos used in English Wikipedia
articles is fairly low, at a rate of 0.1%, when compared to the 4.2 million articles in
Associate professor of journalism, American University
Email: andrew(a)andrewlih.com <mailto:email@example.com>
BOOK: The Wikipedia Revolution: http://www.wikipediarevolution.com
PROJECT: Wiki Makes Video
On Fri, Dec 5, 2014 at 1:19 PM, Brian Wolff <bawolff(a)gmail.com
On Dec 5, 2014 12:09 PM, "Jan Ainali" <jan.ainali(a)wikimedia.se
When structured data on Commons is live, this will be quite easy (allowing time for media
to be tagged also).
If you are looking for a solution that works now I do not have any better ideas.
Med vänliga hälsningar,
Verksamhetschef, Wikimedia Sverige
0729 - 67 29 48
Tänk dig en värld där varje människa har fri tillgång till mänsklighetens samlade
kunskap. Det är det vi gör.
2014-12-05 16:24 GMT+01:00 Andrew Lih <andrew(a)andrewlih.com
> I'm wondering what people have found to be the best practices for identifying
video in Wikipedia articles.
> A number of issues:
> - One of the problems is the OGG is a container, so simply parsing article Wikimarkup
may not be sufficient to identify video content.
> - You can go by category, but this is not always fully accurate
> - Are GIFs that are animated considered video? Some are, and some aren't.
> Interested in hearing what people think, or whether we have a taxonomy of video types
that are well defined.
Are you looking for a list of articles with videos? We can probably do that now with a db
query (there may be a small number of false negatives on the ogg front, but probably 98%
of them can be identified from db. Gifs present a complicating factor but probably still
Wikivideo-l mailing list
Wikivideo-l mailing list
Product Manager, Multimedia