Hi everyone,
The Search Platform Team
<https://www.mediawiki.org/wiki/Wikimedia_Search_Platform> usually holds
office hours the first Wednesday of each month. Come talk to us about
anything related to Wikimedia search, Wikidata Query Service, Wikimedia
Commons Query Service, etc.!
Feel free to add your items to the Etherpad Agenda for the next meeting.
Details for our next meeting:
Date: Wednesday, December 2nd, 2020
Time: 16:00-17:00 GMT / 08:00-09:00 PST / 11:00-12:00 EST / 17:00-18:00 CET
Etherpad: https://etherpad.wikimedia.org/p/Search_Platform_Office_Hours
Google Meet link: https://meet.google.com/vyc-jvgq-dww
Join by phone in the US: +1 786-701-6904 PIN: 262 122 849#
Hope to talk to you in a week!
—Trey
Trey Jones
Sr. Computational Linguist, Search Platform
Wikimedia Foundation
UTC-5 / EST
Hi all,
Join the Research Team at the Wikimedia Foundation [1] for their monthly
Office hours on 2020-12-01 at 17:00-18:00 PM UTC (9am PT/6pm CET).
To participate, join the video-call via this Wikimedia-meet link [2]. There
is no set agenda - feel free to add your item to the list of topics in the
etherpad [3] (You can do this after you join the meeting, too.), otherwise
you are welcome to also just hang out. More detailed information (e.g.
about how to attend) can be found here [4].
Through these office hours, we aim to make ourselves more available to
answer some of the research related questions that you as Wikimedia
volunteer editors, organizers, affiliates, staff, and researchers face in
your projects and initiatives. Some example cases we hope to be able to
support you in:
-
You have a specific research related question that you suspect you
should be able to answer with the publicly available data and you don’t
know how to find an answer for it, or you just need some more help with it.
For example, how can I compute the ratio of anonymous to registered editors
in my wiki?
-
You run into repetitive or very manual work as part of your Wikimedia
contributions and you wish to find out if there are ways to use machines to
improve your workflows. These types of conversations can sometimes be
harder to find an answer for during an office hour, however, discussing
them can help us understand your challenges better and we may find ways to
work with each other to support you in addressing it in the future.
-
You want to learn what the Research team at the Wikimedia Foundation
does and how we can potentially support you. Specifically for affiliates:
if you are interested in building relationships with the academic
institutions in your country, we would love to talk with you and learn
more. We have a series of programs that aim to expand the network of
Wikimedia researchers globally and we would love to collaborate with those
of you interested more closely in this space.
-
You want to talk with us about one of our existing programs [5].
Hope to see many of you,
Martin (WMF Research Team)
[1] https://research.wikimedia.org/team.html
[2] https://meet.wmcloud.org/ResearchOfficeHours
[3] https://etherpad.wikimedia.org/p/Research-Analytics-Office-hours
[4] https://www.mediawiki.org/wiki/Wikimedia_Research/Office_hours
[5] https://research.wikimedia.org/projects.html
--
Martin Gerlach
Research Scientist
Wikimedia Foundation
Hello all,
As you may know, you can include changes coming from Wikidata in your
Watchlist and Recent Changes on other Wikimedia projects. Until now, this
feature didn’t always include changes made on Wikidata descriptions. This
is due to how Wikidata tracks what data is used in a given article.
Starting on December 3rd, the Watchlist and Recent Changes will include
changes on the descriptions of Wikidata Items that are used in the pages
that you watch on the client wiki. This will only include descriptions in
the language of your wiki to make sure that you’re only seeing changes that
are relevant to your wiki.
This improvement was requested by many users from different projects. We
hope that it can help contributors of Wikipedia and the Wikimedia projects
to monitor the changes on Wikidata descriptions and participate in the
effort of improving the data quality.
If you encounter any issue or want to provide feedback, feel free to use this
Phabricator ticket <https://phabricator.wikimedia.org/T191831>. Thanks!
--
Léa Lacroix
Community Engagement Coordinator
Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.
Hello all,
As previously announced, the WikidataCon 2021
<https://www.wikidata.org/wiki/Wikidata:WikidataCon_2021> will take place
in October next year, in a distributed hybrid format: most of the contents
will be spread online, while user groups may be able to organize local
events, depending on the situation in their region.
We’re very happy to announce that after discussing with several
organizations who were particularly involved with Wikidata events in the
past, we partnered up with Wiki Movimento Brasil to support the
organization of the conference.
You may already have heard of the awesome series of events Wikidata Labs
<https://www.wikidata.org/wiki/Wikidata:Wikidata_Labs> organized by Wiki
Movimento Brasil
<https://meta.wikimedia.org/wiki/Wiki_Movement_Brazil_User_Group>. They are
also involved in several other Wikidata-related projects, building
partnerships with GLAM organizations and supporting the community in
Brazil. By working together, we will ensure that the program includes a
great diversity of speakers and we will prepare a physical event to gather
the Wikidata community in Brazil.
The selection process for the partner location included a pre-selection of
several organizations in countries outside of Europe and North America, and
took in account several criteria, focussing on the past experiences in
organizing events, the reliability and sustainability of the group, as well
as various practical considerations. Part of the WikidataCon budget
provided by Wikimedia Germany will be used to support events in Brazil and
including participants from this country. If you’re interested in knowing
more about the process, feel free to contact me, I’ll gladly give more
details.
Of course, the participation of other chapters, user groups and local
Wikidata communities is still very welcome: in the next few months, I’ll
share more information about how local groups can be involved in running
side events or participate in the program design.
Frequent updates about the conference and further announcements will take
place on wiki <https://www.wikidata.org/wiki/Wikidata:WikidataCon_2021>.
The talk page is the best place to ask questions and discuss about the
organization of the conference.
You can also contact me at any time if you have questions or suggestions
about the WikidataCon 2021.
We’re very excited to work hand in hand with Wiki Movimento Brasil and to
experiment with the new hybrid format of the conference together!
Thanks for your attention,
--
Léa Lacroix
Community Engagement Coordinator
Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.
Hi Everyone!
The second edition of the Coolest Tool Award
<https://meta.wikimedia.org/wiki/Coolest_Tool_Award> will happen online on
Friday 11 December 2020 at 17:00 UTC[^1].
The awarded tools will be showcased in a virtual event, with broadcasted
video and chat channels for socializing. We will send more details and
links soon.
Save the date, and join us celebrating the great work volunteer developers
do for the Wikimedia communities.
We hope to see you there!
Joaquin, for the Coolest Tool Academy 2020
[^1]: 17:00 UTC is 9:00 PST, 18:00 CEST, 22:30 IST. More timezones in
timeanddate.com
<https://www.timeanddate.com/worldclock/fixedtime.html?iso=20201211T17>
[image: Coolest_Tool_Award_2020_event_date.png]
--
Joaquin Oltra Hernandez
Developer Advocate - Wikimedia Foundation
On Wed, Nov 25, 2020 at 1:22 PM Daniel Garijo <dgarijo(a)isi.edu> wrote:
>
> Hello,
>
> I am writing this message because I am analyzing the Wikidata JSON dumps
> available in the Internet Archive and I have found there are no dumps
> available after Feb 8th, 2019 (see
> https://archive.org/details/wikimediadownloads?and%5B%5D=%22Wikidata%20enti…).
> I know the latest dumps are available at
> https://dumps.wikimedia.org/wikidatawiki/entities/, but unfortunately
> they only cover the last few months.
Which dump files are exactly looking for? Dumps like
https://dumps.wikimedia.org/wikidatawiki/entities/20201116/wikidata-2020111…
which can also be found on https://dumps.wikimedia.org/other/wikidata/
as 20201116.json.gz ?
> [...]
> Does anyone on this list know where some of these missing Wikidata dumps
> may be found? If anyone has pointers to a server where they can be
> downloaded, I would highly appreciate it.
If you are looking for these dumps, I have about 8 TB stored on
external disks. Transferring these over the network might be
difficult, however. Please contact me off-list, if this you need any
of these dumps, maybe we can arrange something.
I'm curious, what are you trying to do with all of these files?
Processing all of them must take months. My processor usually picks
up the dump on Wednesday and takes 80 hours to comb through it. But
my processor is written in Perl, something in C or Rust might be a lot
faster...
regards, Gerhard Gonter
Hi Daniel,
I am the one managing the archival process and indeed, it was around
end-2018 when the archival process just died (you can see the status here:
https://dumps.wmflabs.org/status.php).
The current status is that the software behind the archival process is
being reworked and will come with features that I will be announcing once
it is ready. The Wikidata JSON dumps will resume archival starting next
week, so unfortunately all information between end-2018 till around October
2020 will be lost (unless someone has a copy somewhere). As for the dumps
in 2017, there were other issues that caused the archival process to stall
as well (you can see the list of available and archived dumps here:
https://dumps.wmflabs.org/wikidata.txt).
I sincerely apologize for the lost information. The new version that I'm
currently working on right now will definitely be much better and more
robust to handle failures.
Warmest regards,
Hydriz
On Wed, 25 Nov 2020 at 20:22, Daniel Garijo <dgarijo(a)isi.edu> wrote:
> Hello,
>
> I am writing this message because I am analyzing the Wikidata JSON dumps
> available in the Internet Archive and I have found there are no dumps
> available after Feb 8th, 2019 (see
>
> https://archive.org/details/wikimediadownloads?and%5B%5D=%22Wikidata%20enti…).
>
> I know the latest dumps are available at
> https://dumps.wikimedia.org/wikidatawiki/entities/, but unfortunately
> they only cover the last few months.
>
> I also noticed some gaps in the years where there are JSON dumps
> available. For example, there are no JSON dumps available between end of
> Feb, 2017 and Aug 21st, 2017; or between August 21st, 2017 and Nov 16,
> 2017.
>
> Another strange finding is that while there are some entries for the
> dumps in the Internet Archive between March 19th, 2018 and Nov 26th,
> 2018 (e.g., https://archive.org/details/wikibase-wikidatawiki-20181104),
> none of them contain a JSON dump. That's another gap of more than 8 months.
>
> Does anyone on this list know where some of these missing Wikidata dumps
> may be found? If anyone has pointers to a server where they can be
> downloaded, I would highly appreciate it.
>
> Thanks in advance,
> Daniel
>
>
> _______________________________________________
> Xmldatadumps-l mailing list
> Xmldatadumps-l(a)lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
>
--
Hydriz Scholz
The University of Nevada, Las Vegas seeks candidates for the following
paid *temporary
hourly* donor funded position: *Digital Collections Wikimedian-in-Residence*
Description: Work with University Libraries’ Special Collections and
Archives staff in the Digital Collections department on a Library Advisory
Board grant-funded Wikidata pilot. Learn in a hands-on environment
about Wikidata creation and editing, and Wikidata tools while contributing
to a large-scale effort to expose underrepresented materials from UNLV
Special Collections & Archives on the semantic web.
This is a prime opportunity for a candidate interested in exploring the
practical applications of linked data principles using Wikidata to gain
valuable hands-on experience. For additional information on how to apply,
see full job description here: https://t.co/veKQEV5uLH?amp=1
[image: UNLV Logo] <http://unlv.edu/>
Cory Lampert
Professor and Head, Digital Collections
University Libraries
University of Nevada, Las Vegas
cory.lampert(a)unlv.edu
Office: 702-895-2209 <17028952209>
https://orcid.org/0000-0002-9467-5214