Dear ones,
Where might I get or mirror a dump of Commons media files?
> It seems worth mentioning on the front page of
https://dumps.wikimedia.org/
> It looks like the compressed XML of the ~50M description pages is ~25GB.
> It looks like wiki-team set up a dump script that posted monthly dumps to
the internet archive; in 2013 it stopped include the month+year in the
title; in 2016 it stopped altogether.
https://archive.org/details/wikimediacommons
Replying without the public GLAM list, this is a 'behind the scenes'
observation...
Though it is great that more data about downloads are available, examining
the Wikimedia Commons downloads of video, audio and images, is actually
disheartening for me personally as someone who has worked for years
uploading media from GLAM archives. The most popular videos and audios are
almost all about graphic sex, which I guess tell us what we already know
about what is the most popular types of media that the public wants to
consume. Sadly GLAM photographic archives and lovely upload projects like
the XenoCanto birdsongs, do not make it to the first couple of pages of
results. As for images, the most used are related to nationalism, as
national flags are the most heavily used images, presumably because of
their mass usage in templates and auto-created infoboxes on articles.
So, I guess the learning point, is that we would need to re-frame what this
data means for GLAM content creators and donors rather than judging things
like "impact" or "importance to open knowledge" could ever be judged by
being high or low by this metric.
Thanks,
Fae
On Fri, 3 Jan 2020 at 03:07, Mutegeki Cliff <mutegekicliff(a)gmail.com> wrote:
> This is very good news.
>
> On Fri, Jan 3, 2020, 05:31 Kerry Raymond <kerry.raymond(a)gmail.com> wrote:
>
>> Good news indeed! Does this include metrics for articles within a
>> category (thinking here specifically of the categories related to GLAM
>> content partners) or do we stick with BaGLAMa 2 for that?
>>
>>
>>
>> Kerry
>>
>>
>>
>> *From:* GLAM [mailto:glam-bounces@lists.wikimedia.org] *On Behalf Of *Pine
>> W
>> *Sent:* Friday, 3 January 2020 7:34 AM
>> *To:* Wikimedia Commons Discussion List <commons-l(a)lists.wikimedia.org>;
>> Wikimedia Foundation Multimedia Team <Multimedia(a)lists.wikimedia.org>;
>> Wikimedia & GLAM collaboration [Public] <glam(a)lists.wikimedia.org>
>> *Subject:* [GLAM] Fwd: [Analytics] Introducing statistics for media files
>>
>>
>>
>> Forwarding good news.
>>
>>
>>
>> Pine
>> ( https://meta.wikimedia.org/wiki/User:Pine )
>>
>>
>>
>> ---------- Forwarded message ---------
>> From: *Francisco Dans* <fdans(a)wikimedia.org>
>> Date: Mon, Dec 23, 2019 at 5:52 PM
>> Subject: [Analytics] Introducing statistics for media files
>> To: A mailing list for the Analytics Team at WMF and everybody who has an
>> interest in Wikipedia and analytics. <analytics(a)lists.wikimedia.org>
>>
>>
>>
>> Hi everybody,
>>
>>
>>
>> Just in time for the holidays, we're announcing the addition of Media
>> Requests to our metrics catalog. Over the last few months we've been
>> working on a dataset offering request numbers for every single image,
>> audio, video and document in the Wiki universe, since 2015.
>>
>>
>>
>> This means we have 3 new metrics available in the Analytics Query Service:
>>
>> - Media requests per referrer: e.g. how many images, audio, videos...
>> have been accessed from English Wikipedia in the last month? *73
>> billion for November
>> <http://stats.wikimedia.org/v2/#/en.wikipedia.org/content/total-mediarequest…>.*
>> - Media requests per file: e.g. how many hits did this cool painting
>> <https://en.wikipedia.org/wiki/Christmas_tree#/media/File:Yggdrasil.jpg>
>> get in November? The answer is 483,791 hits
>> <https://wikimedia.org/api/rest_v1/metrics/mediarequests/per-file/all-refere…>
>> .
>> - Top files by media requests: e.g. what was the most popular video
>> yesterday, December 22nd? Fred Rogers testifying before the Senate
>> Subcommittee on Communications
>> <http://stats.wikimedia.org/v2/#/en.wikipedia.org/content/top-mediarequests/…>.
>> Fun! You can check out the top 1000 media files for any month or day, for
>> any media type.
>>
>> Media requests is, in terms of absolute numbers, a huge dataset, so the
>> per file and top metrics are still being loaded with data all the way to
>> 2015. We expect this loading to finish in mid January.
>>
>>
>>
>> You can read more about this in Wikitech
>> <https://wikitech.wikimedia.org/wiki/Analytics/AQS/Mediarequests>. As
>> usual if you have any questions about the dataset or the new metrics please
>> send them our way here on the list or via Phabricator.
>>
>>
>>
>> Happy holidays!
>>
>> Francisco + the A team
>>
>> --
>>
>> *Francisco Dans **(él, he, **彼**)*
>>
>> Software Engineer, Analytics Team
>>
>> Wikimedia Foundation
>>
>> _______________________________________________
>> Analytics mailing list
>> Analytics(a)lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>> _______________________________________________
>> GLAM mailing list
>> GLAM(a)lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/glam
>>
> _______________________________________________
> GLAM mailing list
> GLAM(a)lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/glam
>
--
faewik(a)gmail.com https://commons.wikimedia.org/wiki/User:Fae
Personal and confidential, please do not circulate or re-quote.
+1 to Kerry's request its import to be able to easily access such
information both a KPI for the those contributing and as a
demonstrable indicator for those being asked to contribute
On Fri, 3 Jan 2020 at 10:31, Kerry Raymond <kerry.raymond(a)gmail.com> wrote:
> Good news indeed! Does this include metrics for articles within a category
> (thinking here specifically of the categories related to GLAM content
> partners) or do we stick with BaGLAMa 2 for that?
>
>
>
> Kerry
>
>
>
> *From:* GLAM [mailto:glam-bounces@lists.wikimedia.org] *On Behalf Of *Pine
> W
> *Sent:* Friday, 3 January 2020 7:34 AM
> *To:* Wikimedia Commons Discussion List <commons-l(a)lists.wikimedia.org>;
> Wikimedia Foundation Multimedia Team <Multimedia(a)lists.wikimedia.org>;
> Wikimedia & GLAM collaboration [Public] <glam(a)lists.wikimedia.org>
> *Subject:* [GLAM] Fwd: [Analytics] Introducing statistics for media files
>
>
>
> Forwarding good news.
>
>
>
> Pine
> ( https://meta.wikimedia.org/wiki/User:Pine )
>
>
>
> ---------- Forwarded message ---------
> From: *Francisco Dans* <fdans(a)wikimedia.org>
> Date: Mon, Dec 23, 2019 at 5:52 PM
> Subject: [Analytics] Introducing statistics for media files
> To: A mailing list for the Analytics Team at WMF and everybody who has an
> interest in Wikipedia and analytics. <analytics(a)lists.wikimedia.org>
>
>
>
> Hi everybody,
>
>
>
> Just in time for the holidays, we're announcing the addition of Media
> Requests to our metrics catalog. Over the last few months we've been
> working on a dataset offering request numbers for every single image,
> audio, video and document in the Wiki universe, since 2015.
>
>
>
> This means we have 3 new metrics available in the Analytics Query Service:
>
> - Media requests per referrer: e.g. how many images, audio, videos...
> have been accessed from English Wikipedia in the last month? *73
> billion for November
> <http://stats.wikimedia.org/v2/#/en.wikipedia.org/content/total-mediarequest…>.*
> - Media requests per file: e.g. how many hits did this cool painting
> <https://en.wikipedia.org/wiki/Christmas_tree#/media/File:Yggdrasil.jpg>
> get in November? The answer is 483,791 hits
> <https://wikimedia.org/api/rest_v1/metrics/mediarequests/per-file/all-refere…>
> .
> - Top files by media requests: e.g. what was the most popular video
> yesterday, December 22nd? Fred Rogers testifying before the Senate
> Subcommittee on Communications
> <http://stats.wikimedia.org/v2/#/en.wikipedia.org/content/top-mediarequests/…>.
> Fun! You can check out the top 1000 media files for any month or day, for
> any media type.
>
> Media requests is, in terms of absolute numbers, a huge dataset, so the
> per file and top metrics are still being loaded with data all the way to
> 2015. We expect this loading to finish in mid January.
>
>
>
> You can read more about this in Wikitech
> <https://wikitech.wikimedia.org/wiki/Analytics/AQS/Mediarequests>. As
> usual if you have any questions about the dataset or the new metrics please
> send them our way here on the list or via Phabricator.
>
>
>
> Happy holidays!
>
> Francisco + the A team
>
> --
>
> *Francisco Dans **(él, he, **彼**)*
>
> Software Engineer, Analytics Team
>
> Wikimedia Foundation
>
> _______________________________________________
> Analytics mailing list
> Analytics(a)lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/analytics
> _______________________________________________
> GLAM mailing list
> GLAM(a)lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/glam
>
--
GN.
Noongarpedia: https://incubator.wikimedia.org/wiki/Wp/nys/Main_Page
Photo Gallery: http://gnangarra.redbubble.com
Out now: A.Gaynor, P. Newman and P. Jennings (eds.), *Never Again:
Reflections on Environmental Responsibility after Roe 8*, UWAP, 2017. Order
here
<https://uwap.uwa.edu.au/products/never-again-reflections-on-environmental-r…>
.
Forwarding good news.
Pine
( https://meta.wikimedia.org/wiki/User:Pine )
---------- Forwarded message ---------
From: Francisco Dans <fdans(a)wikimedia.org>
Date: Mon, Dec 23, 2019 at 5:52 PM
Subject: [Analytics] Introducing statistics for media files
To: A mailing list for the Analytics Team at WMF and everybody who has an
interest in Wikipedia and analytics. <analytics(a)lists.wikimedia.org>
Hi everybody,
Just in time for the holidays, we're announcing the addition of Media
Requests to our metrics catalog. Over the last few months we've been
working on a dataset offering request numbers for every single image,
audio, video and document in the Wiki universe, since 2015.
This means we have 3 new metrics available in the Analytics Query Service:
- Media requests per referrer: e.g. how many images, audio, videos...
have been accessed from English Wikipedia in the last month? *73 billion
for November
<http://stats.wikimedia.org/v2/#/en.wikipedia.org/content/total-mediarequest…>.*
- Media requests per file: e.g. how many hits did this cool painting
<https://en.wikipedia.org/wiki/Christmas_tree#/media/File:Yggdrasil.jpg>
get in November? The answer is 483,791 hits
<https://wikimedia.org/api/rest_v1/metrics/mediarequests/per-file/all-refere…>
.
- Top files by media requests: e.g. what was the most popular video
yesterday, December 22nd? Fred Rogers testifying before the Senate
Subcommittee on Communications
<http://stats.wikimedia.org/v2/#/en.wikipedia.org/content/top-mediarequests/…>.
Fun! You can check out the top 1000 media files for any month or day, for
any media type.
Media requests is, in terms of absolute numbers, a huge dataset, so the per
file and top metrics are still being loaded with data all the way to 2015.
We expect this loading to finish in mid January.
You can read more about this in Wikitech
<https://wikitech.wikimedia.org/wiki/Analytics/AQS/Mediarequests>. As usual
if you have any questions about the dataset or the new metrics please send
them our way here on the list or via Phabricator.
Happy holidays!
Francisco + the A team
--
*Francisco Dans (él, he, 彼)*
Software Engineer, Analytics Team
Wikimedia Foundation
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics