Dear ones,
Where might I get or mirror a dump of Commons media files?
> It seems worth mentioning on the front page of
https://dumps.wikimedia.org/
> It looks like the compressed XML of the ~50M description pages is ~25GB.
> It looks like wiki-team set up a dump script that posted monthly dumps to
the internet archive; in 2013 it stopped include the month+year in the
title; in 2016 it stopped altogether.
https://archive.org/details/wikimediacommons
Hello all! I am Sannita, and I am the contact person for the Structured Data Across Wikimedia (SDAW) project.[1] The project is a follow-up of the work done on Commons, as part of the previous Structured Data on Commons (SDC) grant.
I'm bothering you all, because we are looking for feedback that can help us design and build some new *image recommendation features*, to provide more and better suggestions for image matches to unillustrated articles.[2] We are looking for relatively experienced Wikimedia users, in particular users who have experience in uploading media to Commons, also in connection with writing and expanding articles on Wikipedia.
If you are interested in following this project, you can also subscribe to our newsletter.[3] We recently published our first issue, with a series of questions regarding the Image recommendation features we're focusing on at the moment.[4]
Thank you in advance and hope to hear from you soon! If you have any question, you can also contact me in private via email, Telegram (@Sannita) or on my talk page.[5]
L.
[1] https://www.mediawiki.org/wiki/Structured_Data_Across_Wikimedia
[2] https://www.mediawiki.org/wiki/Structured_Data_Across_Wikimedia/Image_Recom…
[3] https://www.mediawiki.org/wiki/Structured_Data_Across_Wikimedia/Newsletter
[4] https://www.mediawiki.org/wiki/Structured_Data_Across_Wikimedia/Newsletter/1
[5] https://www.mediawiki.org/wiki/User_talk:Sannita_(WMF)
We currently exclude a wealth of common file formats -- and as a result,
ecosystems of knowledge. How can we elevate the general priority of
broadening supported formats?
In the past year I've worked on projects generating public-domain *csv*s,
and public-domain *epub*s, and each time found that there is no place in
the wikiverse to store them (and no other community archive that is
remotely as reliable and useful). Each time I discover this, my mind
actually goes blank for a moment in crogglement, and I seem to forget that
it happened. But today I retained coherence long enough to write this note.
Commons's list of Unsupported file types
<https://commons.wikimedia.org/wiki/Commons:File_types#Unsupported_file_types>*
says
"help needed to support these" -- what help is needed, and how can we get
there? Are all 40 of the formats listed still really unsupported?
SJ
Hi everyone,
I'm very happy to announce:
OpenRefine [1] has two Junior Developer job openings (paid contractor
positions; part-time, fully remote) for building Structured Data on
Wikimedia Commons [2] functionalities.
Needless to say, we would love to receive applications from Wikimedians :-)
* Junior Developer - Wikimedia Development [3] (6 months, from September
2021 till February 2022)
* Junior Developer - OpenRefine Development [4] (8 months, from November
2021 till June 2022)
All the best!
Sandra (User:Spinster / User:SFauconnier)
[1] https://openrefine.org
[2] https://w.wiki/UR
[3]
https://openrefine.org/blog/2021/07/07/Wikimedia-Commons-reconciliation-bat…
[4] https://openrefine.org/blog/2021/07/07/OpenRefine-SDC-developer.html