On Dec 5, 2014, at 10:50 AM, Andrew Lih <andrew@andrewlih.com> wrote:Brian, thanks yes that would be what I'd be looking for.In fact, a monthly report on a regular basis would be really interesting to see.I've worked in the past with Ward Cunningham on his fast parser to get some initial data for last year, but it'd be great to get an update.FYI, here were some of our findings then:A January 2013 dump of the English Wikipedia database, we were able to identify 4,061 instances of video files embedded in Wikipedia articles.Count: Video file169 Articleevolution.ogg21 Verifiability and Neutral point of view (Common Craft)-600px-en.ogv13 Commercial-LBJ1964ElectionAdDaisyGirl.ogv9 SeaSnails.ogg8 Machinima sample reindeer full size.ogg8 1946-10-08 21 Nazi Chiefs Guilty.ogv6 Wikipedia video tutorial-1-Editing-en.ogv6 The Impact Of Wikipedia.webm6 LightningCNP.ogg6 Camouflage (1944).ogvThe number of actual unique videos used in articles was 3,100, removing duplicate uses of videos in multiple articles. Overall, the number of videos used in English Wikipedia articles is fairly low, at a rate of 0.1%, when compared to the 4.2 million articles in January 2013
-Andrew_______________________________________________-Andrew Lih
Associate professor of journalism, American University
Email: andrew@andrewlih.com
WEB: http://www.andrewlih.com
BOOK: The Wikipedia Revolution: http://www.wikipediarevolution.com
PROJECT: Wiki Makes Video http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Wiki_Makes_VideoOn Fri, Dec 5, 2014 at 1:19 PM, Brian Wolff <bawolff@gmail.com> wrote:
On Dec 5, 2014 12:09 PM, "Jan Ainali" <jan.ainali@wikimedia.se> wrote:
>
> When structured data on Commons is live, this will be quite easy (allowing time for media to be tagged also).
>
> If you are looking for a solution that works now I do not have any better ideas.
>
> Med vänliga hälsningar,
> Jan Ainali
>
> Verksamhetschef, Wikimedia Sverige
> 0729 - 67 29 48
>
>
> Tänk dig en värld där varje människa har fri tillgång till mänsklighetens samlade kunskap. Det är det vi gör.
> Bli medlem.
>
>
> 2014-12-05 16:24 GMT+01:00 Andrew Lih <andrew@andrewlih.com>:
>>
>> I'm wondering what people have found to be the best practices for identifying video in Wikipedia articles.
>>
>> A number of issues:
>>
>> - One of the problems is the OGG is a container, so simply parsing article Wikimarkup may not be sufficient to identify video content.
>>
>> - You can go by category, but this is not always fully accurate
>>
>> - Are GIFs that are animated considered video? Some are, and some aren't.
>>
>> Interested in hearing what people think, or whether we have a taxonomy of video types that are well defined.
>>
>>
>> -Andrew
>>Are you looking for a list of articles with videos? We can probably do that now with a db query (there may be a small number of false negatives on the ogg front, but probably 98% of them can be identified from db. Gifs present a complicating factor but probably still do-able.).
--bawolff
_______________________________________________
Wikivideo-l mailing list
Wikivideo-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikivideo-l
Wikivideo-l mailing list
Wikivideo-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikivideo-l