Brian, thanks yes that would be what I'd be looking for.
In fact, a monthly report on a regular basis would be really interesting to
see.
I've worked in the past with Ward Cunningham on his fast parser to get some
initial data for last year, but it'd be great to get an update.
FYI, here were some of our findings then:
A January 2013 dump of the English Wikipedia database, we were able to
identify 4,061 instances of video files embedded in Wikipedia articles.
Count: Video file
169 Articleevolution.ogg
21 Verifiability and Neutral point of view (Common Craft)-600px-en.ogv
13 Commercial-LBJ1964ElectionAdDaisyGirl.ogv
9 SeaSnails.ogg
8 Machinima sample reindeer full size.ogg
8 1946-10-08 21 Nazi Chiefs Guilty.ogv
6 Wikipedia video tutorial-1-Editing-en.ogv
6 The Impact Of Wikipedia.webm
6 LightningCNP.ogg
6 Camouflage (1944).ogv
The number of actual unique videos used in articles was 3,100, removing
duplicate uses of videos in multiple articles. Overall, the number of
videos used in English Wikipedia articles is fairly low, at a rate of 0.1%,
when compared to the 4.2 million articles in January 2013
-Andrew
-Andrew Lih
Associate professor of journalism, American University
Email: andrew(a)andrewlih.com
WEB:
On Fri, Dec 5, 2014 at 1:19 PM, Brian Wolff <bawolff(a)gmail.com> wrote:
On Dec 5, 2014 12:09 PM, "Jan Ainali" <jan.ainali(a)wikimedia.se> wrote:
When structured data on Commons is live, this will be quite easy
(allowing time
for media to be tagged also).
If you are looking for a solution that works now I do not have any
better ideas.
Med vänliga hälsningar,
Jan Ainali
Verksamhetschef, Wikimedia Sverige
0729 - 67 29 48
Tänk dig en värld där varje människa har fri tillgång till
mänsklighetens samlade
kunskap. Det är det vi gör.
Bli medlem.
2014-12-05 16:24 GMT+01:00 Andrew Lih <andrew(a)andrewlih.com>om>:
>
> I'm wondering what people have found to be the best practices for
identifying video in Wikipedia articles.
>
> A number of issues:
>
> - One of the problems is the OGG is a container, so simply parsing
article
Wikimarkup may not be sufficient to identify video content.
>
> - You can go by category, but this is not always fully accurate
>
> - Are GIFs that are animated considered video? Some are, and some
aren't.
>
> Interested in hearing what people think, or whether we have a taxonomy
of
video types that are well defined.
>
>
> -Andrew
>
Are you looking for a list of articles with videos? We can probably do
that now with a db query (there may be a small number of false negatives on
the ogg front, but probably 98% of them can be identified from db. Gifs
present a complicating factor but probably still do-able.).
--bawolff
_______________________________________________
Wikivideo-l mailing list
Wikivideo-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikivideo-l