Hi everyone,
Summary: Wiki Workshop 2022 [0] will take place virtually as part of
The Web Conference 2022 [1]. Call for papers is now open:
https://wikiworkshop.org/2022/#call . Deadline to submit for paper to
appear in the proceedings of the conference is Feb 3, for all other
submissions March 10. The workshop will take place on April 25, 2022.
--
We are delighted to announce that Wiki Workshop 2022 [0] will be held
virtually April 25, 2022 and as part of the Web Conference 2022 [1].
In the past years, Wiki Workshop has traveled to Oxford, Montreal,
Cologne, Perth, Lyon, and San Francisco, and (virtually) to Taipei and
Ljubljana.
Last year, we had more than 150 participants in the workshop along
with 22 accepted paper presentations, keynote, panel, music and more.
The workshop is now a vibrant event for Wikimedia researchers and
those interested in this space to get together on an annual basis.
We encourage contributions by all researchers who study the Wikimedia
projects. We specifically encourage 1-2 page submissions of
preliminary research. You will have the option to publish your work as
part of the proceedings of The Web Conference 2022.
You can read more about the call for papers and the workshop at
http://wikiworkshop.org/2022/#call. Please note that the deadline for
the submissions to be considered for proceedings is February 3. All
other submissions should be received by March 10.
If you have questions about the workshop, please let us know on this
list or at wikiworkshop(a)googlegroups.com.
Looking forward to seeing many of you in this year's edition.
Best,
Srijan Kumar, Georgia Tech
Emily Lesack, Wikimedia Foundation
Miriam Redi, Wikimedia Foundation
Bob West, EPFL
Leila Zia, Wikimedia Foundation
[0] https://wikiworkshop.org/2022/
[1] https://www2022.thewebconf.org/
The Search Platform Team
<https://www.mediawiki.org/wiki/Wikimedia_Search_Platform> usually holds an
open meeting on the first Wednesday of each month. Come talk to us about
anything related to Wikimedia search, Wikidata Query Service (WDQS),
Wikimedia Commons Query Service (WCQS), etc.!
Feel free to add your items to the Etherpad Agenda for the next meeting.
Details for our next meeting:
Date: Wednesday, March 2nd, 2022
Time: 16:00-17:00 GMT / 08:00-09:00 PST / 11:00-12:00 EST / 17:00-18:00 CET
& WAT
Etherpad: https://etherpad.wikimedia.org/p/Search_Platform_Office_Hours
Google Meet link: https://meet.google.com/vgj-bbeb-uyi
Join by phone: https://tel.meet/vgj-bbeb-uyi?pin=8118110806927
Hope to talk to you next week!
—Trey
Trey Jones
Staff Computational Linguist, Search Platform
Wikimedia Foundation
UTC–5 / EST
Hi all,
Join the Research Team at the Wikimedia Foundation [1] for their monthly
Office hours this Tuesday, 2022-03-01, at 12:00-13:00 UTC (4:00 PT / 7:00
ET / 13:00 CET). Find your local time here
<https://zonestamp.toolforge.org/1646136000>.
To participate, join the video-call via this link [2]. There is no set
agenda - feel free to add your item to the list of topics in the etherpad
[3]. You are welcome to add questions / items to the etherpad in advance,
or when you arrive at the session. Even if you are unable to attend the
session, you can leave a question that we can address asynchronously. If
you do not have a specific agenda item, you are welcome to hang out and
enjoy the conversation. More detailed information (e.g. about how to
attend) can be found here [4].
Through these office hours, we aim to make ourselves more available to
answer research related questions that you as Wikimedia volunteer editors,
organizers, affiliates, staff, and researchers face in your projects and
initiatives. Here are some example cases we hope to be able to support you
with:
-
You have a specific research related question that you suspect you
should be able to answer with the publicly available data and you don’t
know how to find an answer for it, or you just need some more help with it.
For example, how can I compute the ratio of anonymous to registered editors
in my wiki?
-
You run into repetitive or very manual work as part of your Wikimedia
contributions and you wish to find out if there are ways to use machines to
improve your workflows. These types of conversations can sometimes be
harder to find an answer for during an office hour. However, discussing
them can help us understand your challenges better and we may find ways to
work with each other to support you in addressing it in the future.
-
You want to learn what the Research team at the Wikimedia Foundation
does and how we can potentially support you. Specifically for affiliates:
if you are interested in building relationships with the academic
institutions in your country, we would love to talk with you and learn
more. We have a series of programs that aim to expand the network of
Wikimedia researchers globally and we would love to collaborate with those
of you interested more closely in this space.
-
You want to talk with us about one of our existing programs [5].
Hope to see many of you,
Emily on behalf of the WMF Research Team
[1] https://research.wikimedia.org
[2] https://meet.jit.si/WMF-Research-Office-Hours
[3] https://etherpad.wikimedia.org/p/Research-Analytics-Office-hours
[4] https://www.mediawiki.org/wiki/Wikimedia_Research/Office_hours
[5] https://research.wikimedia.org/projects.html
--
Emily Lescak (she / her)
Senior Research Community Officer
The Wikimedia Foundation
Hello everyone,
TLDR; Wikimedia will soon be applying as a mentoring organization to Google
Summer of Code 2022 <https://summerofcode.withgoogle.com> [1] and Outreachy
Round 24 <https://www.outreachy.org/> [2]. We are currently working on a
list of interesting project ideas to include in the application. If you
have some ideas for coding or non-coding (design, documentation,
translation, outreach, research) projects, share them here: <
https://phabricator.wikimedia.org/T299453> [3].
*Timeline*
As a mentor, you will engage potential candidates in the application period
for both programs between March and April. You will help candidates make
small contributions to your project and answer any project-related queries
during this time. You will work more closely with the accepted candidates
during the coding period between May and August.
*New changes are coming to GSoC*
GSoC has exciting changes this year, including:
* Eligibility criteria redefined–the program is now open to all open-source
newcomers 18 years and older. It will no longer be solely focused on
university students or recent graduates.
* Multiple sizes of projects supported–ranging from ~175 to ~350 hr long.
* Increased flexibility in project timing–project deadline can be extended
to up to 22 weeks.
*Tips for proposing projects*
* Follow this task description template when you propose a project in
Phabricator: <
https://phabricator.wikimedia.org/tag/outreach-programs-projects> [4]. Add
#Google- Summer-of-Code (2022) or #Outreachy (Round 24) tag.
* Project should require an experienced developer ~15 days and a newcomer
~3 months to complete.
* Each project should have at least two mentors, and one of them should
hold a technical background.
* Ideally, the project has no tight deadlines, a moderate learning curve,
and fewer dependencies on Wikimedia's core infrastructure. Projects
addressing the needs of a language community are most welcome!
* If you don't have an idea in mind and would like to pick one from an
existing list, check out these projects: <
https://phabricator.wikimedia.org/tag/outreach-programs-projects/> [4]
* To learn more about the roles and responsibilities of mentors, visit our
resources on MediaWiki.org: <
https://www.mediawiki.org/wiki/Outreachy/Mentors> [5], <
https://www.mediawiki.org/wiki/Google_Summer_of_Code/Mentors> [6].
Cheers,
Srishti
[1] https://summerofcode.withgoogle.com
[2] https://www.outreachy.org/
[3] https://phabricator.wikimedia.org/T299453
[4] https://phabricator.wikimedia.org/tag/outreach-programs-projects/
[5] https://www.mediawiki.org/wiki/Outreachy/Mentors
[6] https://www.mediawiki.org/wiki/Google_Summer_of_Code/Mentors
*Srishti Sethi*
Senior Developer Advocate
Wikimedia Foundation <https://wikimediafoundation.org/>
Hello,
As you may know, the Wikidata development team has been working on a tool
that lets editors review mismatching data between Wikidata and external
databases. The tool is now ready to be used, and you can access it here
<https://mismatch-finder.toolforge.org/> and read more details on
Wikidata:Mismatch
Finder <https://www.wikidata.org/wiki/Wikidata:Mismatch_Finder>. We hope
that this tool can be useful to people who are working on data quality and
matching external databases with Wikidata, and we are looking forward to
your feedback if you give it a try!
What is the purpose of Mismatch Finder?
The tool helps highlight differences in the data between Wikidata and other
databases, in order to improve data quality in Wikidata and make the whole
linked open data web more robust. The tool itself doesn’t check these
databases automatically: it is necessary for someone to compare an external
database to Wikidata first and then upload a list of possible mismatches
into the Mismatch Finder, so they can be analyzed and processed by Wikidata
editors.
By providing such a tool, we hope to support the Wikidata editors to spot
and fix mistakes in Wikidata as well as organizations reusing Wikidata’s
data, who now have a convenient way to contribute back by reporting lists
of possible mismatches.
How to use the tool to check mismatches?
On the Mismatch Finder tool page <https://mismatch-finder.toolforge.org/>,
you can check Items by entering a list of Q-IDs (for example taken from a
SPARQL query). After clicking on “Check Items”, the tool will check if
there are mismatches for these Items in the mismatch store, and display any
issue that was found with a specific part of the data.
From this page and after logging in with your Wikidata account via OAuth,
you will be able to choose a status of the mismatch, indicating what part
of the data is wrong, and to access the Item on wikidata.org to edit the
data if needed. Mismatch Finder does not perform any automatic editing on
Wikidata.
Once the status is changed from “waiting for review” to another value, the
mismatch will not appear in the list anymore.
You can also use the Mismatch Finder
<https://www.wikidata.org/wiki/Wikidata:Mismatch_Finder/Gadget>user script
that will display an alert at the top of the Item pages on wikidata.org and
a link to the Mismatch Finder tool to learn more about the potential
mismatches. See Help:User scripts
<https://www.wikidata.org/wiki/Help:User_scripts> for how to enable the
user script for your account.
Where does the information come from?
Information about the potential mismatches is stored in the Mismatch Store,
a database separate from Wikidata where organizations, researchers and
editors can upload lists of mismatches.
The Mismatch Store is hosted on Toolforge and its content can be accessed
via an API. You can find more information about the database, how to get
data from the API, how to prepare and upload a mismatches file in this user
guide
<https://github.com/wmde/wikidata-mismatch-finder/blob/main/docs/UserGuide.md>
.
We hope that the Mismatch Finder tool will help to build up feedback loops
with data re-users to get them actively involved in improving the data on
Wikidata. Feel free to try out the tool and let us know what you think on the
talk page <https://www.wikidata.org/wiki/Wikidata_talk:Mismatch_Finder>.
You can also join us for an intro session and discussion at the upcoming Data
Reuse Days
<https://www.wikidata.org/wiki/Wikidata:Events/Data_Reuse_Days_2022>.
For a quick intro to and demo of how the Mismatches tool works, please see this
short video
<https://commons.wikimedia.org/wiki/File:Mismatch_Finder_intro.webm>.
We would especially like to thank Mike Peel and Marco Fosatti, for
providing the first mismatches and real-world testing data for the Mismatch
Finder to get us started. More will follow in the next days and weeks.
Cheers,
--
Mohammed Sadat
*Community Communications Manager for Wikidata/Wikibase*
Wikimedia Deutschland e. V. | Tempelhofer Ufer 23-24 | 10963 Berlin
Phone: +49 (0)30 219 158 26-0
https://wikimedia.de
Keep up to date! Current news and exciting stories about Wikimedia,
Wikipedia and Free Knowledge in our newsletter (in German): Subscribe now
<https://www.wikimedia.de/newsletter/>.
Imagine a world in which every single human being can freely share in the
sum of all knowledge. Help us to achieve our vision!
https://spenden.wikimedia.de
Wikimedia Deutschland – Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 B. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.
Hello all,
I'm very excited to share with you the upcoming event that we are
organizing: *Data Reuse Days
<https://www.wikidata.org/wiki/Wikidata:Events/Data_Reuse_Days_2022>*, a
series of online sessions that will take place from March 14th to 24th.
During this event, we would like to highlight some great *projects and
applications that are powered by Wikidata's data*, and discuss with their
creators about the workflows they use, but also the challenges they may
have encountered on the way, how they interact with the Wikidata editors,
and how we could find ways to give back to the project and improve the data
quality together.
We also want to present various *tools that allow people to retrieve,
query, analyze and display data from Wikidata*, and we want to use this
event as an opportunity to get more people onboard with using Wikidata's
data, inside or outside the Wikimedia projects.
Building on the previous events we tried with this format (30 Lexic-o-days
and Data Quality Days), this event is taking place online, with a very
flexible program, where speakers and facilitators can schedule a session at
any time during the ten days of the event. The sessions can be
presentations, workshops, lightning talks, discussions, live-coding,
editathons, and most of them will take place on the open videocall platform
Jitsi.
The schedule is still under construction, and will evolve until a few days
before the start of the event: you can already check the scheduled sessions
on this page
<https://diff.wikimedia.org/event/wikidata-data-reuse-days-2022/>, and we
will add more on the way. If you are interested in presenting something
during the Data Reuse Days, you can make a proposal directly on the talk
page
<https://www.wikidata.org/wiki/Wikidata_talk:Events/Data_Reuse_Days_2022>,
or reach out to me directly, so we can discuss the details together. We are
especially looking for proposals presenting tools and workflows to build an
application using Wikidata's data, examples of use of Wikidata's data on
other Wikimedia projects, as well as discussions about data reuse or data
quality.
If you have any questions or suggestions for the event, or if you would
like to help, feel free to reach out to me. I will certainly give you
updates closer to the event when the schedule will be populated.
Cheers,
--
Léa Lacroix
Community Engagement Coordinator
Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.