Hello,
As many of you may already know, we have been working on introducing a
new Wikidata
data type <https://www.wikidata.org/wiki/Help:Data_type> that will make it
easier to find EntitySchemas
<https://www.wikidata.org/wiki/Wikidata:Schemas> and use them to connect to
other Wikibase Entities. This will allow editors to refer to existing
EntitySchemas in statements to indicate what class of Items, Lexemes etc.
are governed by an EntitySchema. This new EntitySchema datatype is now live
on Test Wikidata <https://test.wikidata.org/wiki/Wikidata:Main_Page> for
testing and your feedback.
Background
EntitySchemas were first introduced in 2019 as a way to model the structure
of Wikidata Items and validate data against those specifications. There are
a number of shortcomings with EntitySchemas still, which means they are not
as useful and used as much as they should be. We are now addressing a
number of those issues, starting with this new data type.
In 2019, we built the first version of the EntitySchema datatype, but it
was eventually rolled back based on your feedback. We have made a lot of
progress since then and take your feedback into account when developing
this new iteration.
The main goal of this development is to help editors model data more
consistently by making EntitySchemas more visible and integrated into
day-to-day editing work. The new EntitySchema data type offers the
following features:
-
A new data type that allows making statements that take an EntitySchema
ID as a value
-
A canonical URI scheme for EntitySchemas has been developed that matches
prefixes of other Semantic Entities (Items, Lexemes, and Properties) to
identify them as concepts and access them when they are referred to in
statements in various formats such as RDF
-
"What Links Here" now enables you to see what Items, Lexemes, and
Properties link to an EntitySchema in a statement
-
A “Concept URI” link has also been added to the EntitySchema’s sidebar,
mirroring the same format as Items
What will come next for EntitySchemas:
-
Displaying EntitySchemas linked in statements by their labels instead of
their IDs, making them more readable and easier to understand.
-
Support for language fallback to make EntitySchemas legible across
languages.
-
An updated termbox (the table with labels, descriptions and aliases) to
provide a more consistent experience between Items, Properties and
EntitySchemas in the future.
Testing and Feedback
Today, we’d love for you to explore EntitySchemas on Test Wikidata
<https://test.wikidata.org/wiki/Wikidata:Main_Page> and provide feedback.
We hope that the new EntitySchema data type will increase centralized
discussions around the modelling of specific classes in Wikidata. This new
visibility will allow for more integration of EntitySchemas into the
ecosystem, leading to improved data quality through more consistent
modelling. Ultimately making the reuse of our data easier, especially for
small to medium-sized reusers.
Here is an example we prepared earlier Q497
<https://test.wikidata.org/wiki/Q497>.
If you encounter any issues, have questions or concerns, or want to provide
feedback, please don’t hesitate to reach out to us on Wikidata talk:Schemas
<https://www.wikidata.org/wiki/Wikidata_talk:Schemas#New_EntitySchema_data_t…>
or
leave a comment on this ticket phab:T332724
<https://phabricator.wikimedia.org/T332724>.
Thanks so much,
Arian
--
Arian Bozorg (he/him)Junior Product Manager Wikidata
Wikimedia Deutschland e. V. | Tempelhofer Ufer 23-24 | 10963 Berlin
Phone: +49 (0)30-577 11 62-230https://wikimedia.de
Keep up to date! Current news and exciting stories about Wikimedia,
Wikipedia and Free Knowledge in our newsletter (in German): Subscribe
now <https://www.wikimedia.de/newsletter/>.
Imagine a world in which every single human being can freely share in
the sum of all knowledge. Help us to achieve our
vision!https://spenden.wikimedia.de
Wikimedia Deutschland – Gesellschaft zur Förderung Freien Wissens e.
V. Eingetragen im Vereinsregister des Amtsgerichts
Berlin-Charlottenburg unter der Nummer 23855 B. Als gemeinnützig
anerkannt durch das Finanzamt für Körperschaften I Berlin,
Steuernummer 27/029/42207.
Hi everyone,
Later this year WikidataCon
(https://www.wikidata.org/wiki/Wikidata:WikidataCon_2023) is happening
in Taipei and online. The call for proposals is now open and I'd love
to see many of you submit talks about cool stuff you're working on as
well as important topics we should discuss as a community. You can
submit proposals at https://pretalx.com/wikidatacon2023/. If you need
any help with your submission please feel free to reach out.
Cheers
Lydia
--
Lydia Pintscher - http://about.me/lydia.pintscher - WD:Q18016466
Portfolio Lead for Wikidata
Wikimedia Deutschland e. V. | Tempelhofer Ufer 23-24 | 10963 Berlin
https://wikimedia.de
Wikimedia Deutschland – Gesellschaft zur Förderung Freien Wissens e.
V. Eingetragen im Vereinsregister des Amtsgerichts
Berlin-Charlottenburg unter der Nummer 23855 B. Als gemeinnützig
anerkannt durch das Finanzamt für Körperschaften I Berlin,
Steuernummer 27/029/42207.
The 18th International Workshop on
ONTOLOGY MATCHING
(OM-2023)
http://om2023.ontologymatching.org/
November 6th or 7th, 2023,
International Semantic Web Conference (ISWC) Workshop Program,
M.A.I.C.C., Athens, Greece
BRIEF DESCRIPTION AND OBJECTIVES
Ontology matching is a key interoperability enabler for the Semantic Web,
as well as a useful technique in some classical data integration tasks
dealing with the semantic heterogeneity problem. It takes ontologies
as input and determines as output an alignment, that is, a set of
correspondences between the semantically related entities of those
ontologies.
These correspondences can be used for various tasks, such as ontology
merging, data interlinking, query answering or navigation over knowledge
graphs.
Thus, matching ontologies enables the knowledge and data expressed
with the matched ontologies to interoperate.
The workshop has three goals:
1.
To bring together leaders from academia, industry and user institutions
to assess how academic advances are addressing real-world requirements.
The workshop will strive to improve academic awareness of industrial
and final user needs, and therefore, direct research towards those needs.
Simultaneously, the workshop will serve to inform industry and user
representatives about existing research efforts that may meet their
requirements. The workshop will also investigate how the ontology
matching technology is going to evolve, especially with respect to
data interlinking, knowledge graph and web table matching tasks.
2.
To conduct an extensive and rigorous evaluation of ontology matching
and instance matching (link discovery) approaches through
the OAEI (Ontology Alignment Evaluation Initiative) 2023 campaign:
http://oaei.ontologymatching.org/2023/
3.
To examine similarities and differences from other, old, new and emerging,
techniques and usages, such as web table matching or knowledge embeddings.
TOPICS of interest include but are not limited to:
Business and use cases for matching (e.g., big, open, closed data);
Requirements to matching from specific application scenarios;
Formal foundations and frameworks for matching;
Novel matching methods, including link prediction, ontology-based
access;
Matching and knowledge graphs;
Matching and deep learning;
Matching and embeddings;
Matching and big data;
Matching and linked data;
Instance matching, data interlinking and relations between them;
Privacy-aware matching;
Process model matching;
Large-scale and efficient matching techniques;
Matcher selection, combination and tuning;
User involvement (including both technical and organizational aspects);
Explanations in matching;
Social and collaborative matching;
Uncertainty in matching;
Expressive alignments;
Reasoning with alignments;
Alignment coherence and debugging;
Alignment management;
Matching for traditional applications (e.g., data science);
Matching for emerging applications (e.g., web tables, knowledge graphs).
SUBMISSIONS
Contributions to the workshop can be made in terms of technical papers and
posters/statements of interest addressing different issues of ontology
matching
as well as participating in the OAEI 2023 campaign. Long technical papers
should
be of max. 12 pages. Short technical papers should be of max. 6 pages.
Posters/statements of interest should not exceed 3 pages.
All contributions have to be prepared using the CEUR-ART, 1-column style.
Overleaf page for LaTeX users is available at
https://www.overleaf.com/read/gwhxnqcghhdt,
while offline version with the style files is available from
http://ceur-ws.org/Vol-XXX/CEURART.zip.
Submissions should be uploaded in PDF format
through the workshop submission site at:
https://www.easychair.org/conferences/?conf=om2023
Contributors to the OAEI 2023 campaign have to follow the campaign
conditions
and schedule at http://oaei.ontologymatching.org/2023/.
DATES FOR TECHNICAL PAPERS AND POSTERS:
July 31st, 2023: Deadline for the submission of papers.
August 28th, 2023: Deadline for the notification of
acceptance/rejection.
September 4th, 2023: Workshop camera ready copy submission.
November 6th or 7th, 2023: OM-2023, M.A.I.C.C., Athens, Greece.
Contributions will be refereed by the Program Committee.
Accepted papers will be published in the workshop proceedings as a volume
of CEUR-WS as well as indexed on DBLP.
ORGANIZING COMMITTEE
1. Pavel Shvaiko (main contact)
Trentino Digitale, Italy
2. Jérôme Euzenat
INRIA & Univ. Grenoble Alpes, France
3. Ernesto Jiménez-Ruiz
City, University of London, UK & SIRIUS, University of Oslo, Norway
4. Oktie Hassanzadeh
IBM Research, USA
5. Cássia Trojahn
IRIT, France
PROGRAM COMMITTEE:
Alsayed Algergawy, Jena University, Germany
Manuel Atencia, Universidad de Málaga, Spain
Jiaoyan Chen, University of Oxford, UK
Jérôme David, University Grenoble Alpes & INRIA, France
Gayo Diallo, University of Bordeaux, France
Daniel Faria, INESC-ID&IST, University of Lisbon, Portugal
Alfio Ferrara, University of Milan, Italy
Marko Gulić, University of Rijeka, Croatia
Wei Hu, Nanjing University, China
Ryutaro Ichise, National Institute of Informatics, Japan
Antoine Isaac, Vrije Universiteit Amsterdam & Europeana, Netherlands
Naouel Karam, Fraunhofer, Germany
Prodromos Kolyvakis, EPFL, Switzerland
Patrick Lambrix, Linköpings Universitet, Sweden
Oliver Lehmberg, University of Mannheim, Germany
Fiona McNeill, University of Edinburgh, UK
Hoa Ngo, CSIRO, Australia
George Papadakis, University of Athens, Greece
Catia Pesquita, University of Lisbon, Portugal
Henry Rosales-Méndez, University of Chile, Chile
Booma Sowkarthiga, Microsoft, USA
Kavitha Srinivas, IBM, USA
Giorgos Stoilos, University of Oxford, UK
Valentina Tamma, University of Liverpool, UK
Ludger van Elst, DFKI, Germany
Xingsi Xue, Fujian University of Technology, China
Ondřej Zamazal, Prague University of Economics, Czech Republic
Songmao Zhang, Chinese Academy of Sciences, China
Lu Zhou, TigerGraph, USA
-------------------------------------------------------
More about ontology matching:
http://www.ontologymatching.org/http://book.ontologymatching.org/
-------------------------------------------------------
Best Regards,
Pavel
-------------------------------------------------------
Pavel Shvaiko, PhD
Trentino Digitale, Italy
http://www.ontologymatching.org/https://www.trentinodigitale.it/http://www.dit.unitn.it/~pavel
--
Cap. Soc. Euro 6.433.680,00 - REG. IMP. / C.F. / P.IVA 00990320228
E-mail:
tndigit(a)tndigit.it <mailto:tndigit@tndigit.it> - www.trentinodigitale.it
<http://www.trentinodigitale.it>
Società soggetta ad attività di direzione
e coordinamento da parte della Provincia Autonoma di Trento - C.Fisc.
00337460224.
Questo messaggio è indirizzato esclusivamente ai destinatari
in intestazione, può contenere informazioni protette e riservate ai sensi
della normativa vigente e ne è vietato qualsiasi impiego diverso da quello
per cui è stato inviato. Se lo avete ricevuto per errore siete pregati di
eliminarlo in ogni sua parte e di avvisare il mittente
Dear everyone,
As presented at last year's WikidataCon
<https://www.youtube.com/watch?v=e_VxTlBNkyk>, Wikimedia Deutschland has
set out to find new ways for collaboration around Wikidata software
development to enhance the diversity of our movement, increase Wikibase’s
scalability and robustness and breathe life into our movement principles of
knowledge equity. With a grant from Arcadia
<https://www.arcadiafund.org.uk/>, a charitable fund administered by Lisbet
Rausing and Peter Baldwin, we will be able to implement such a
collaboration in the next two years.
Today, we are happy to share an exciting update on the progress of this
project with all of you. After spending the last few months with
conversations with the movement groups who were interested in joining such
a partnership, we have now reached a point where we can spread the news
about the future partners and projects that will shape this Wikidata
software collaboration.
Wikimedia Indonesia, the Igbo Wikimedians User Group and Wikimedia
Deutschland will be joining forces to advance the technical capacities of
the movement around Wikidata development and with this, make the software
and tools more usable by cultures underrepresented in technology, people of
the Global South and speakers of minority languages.
Wikimedia Indonesia, a non-profit organization based in Jakarta, Indonesia
and established in 2008, is dedicated to encouraging the growth,
development & dissemination of knowledge in Indonesian and other languages
spoken in Indonesia. Since then, Wikimedia Indonesia has supported the
development of 14 Wikipedias in the languages spoken in Indonesia, 12
regional Wikimedian communities spread across the country, and two
Wikimedia project-based communities.
For this project, in collaboration with Wikimedia Deutschland, Wikimedia
Indonesia wants to build up a software team of their own in the course of
the next 2 years. The tools will hopefully help under-resourced language
communities contributing to the flourishing of their languages online
through lexicographical data, and also involving the local language
communities in contributing to lexemes in Wikidata.
Igbo Wikimedians is a group of Wikimedians that are committed to working on
various wiki projects related to Igbo language
<https://en.wikipedia.org/wiki/Igbo_language> and culture. The user group
is organizing projects around community building in the Igbo community,
content improvement for Wikipedia and its sister project and has
established its own Wikidata hub in 2021.
The Igbo Wikimedia User Group and their program of the Wiki Mentor Africa
<https://m.wikidata.org/wiki/Wikidata:Wiki_Mentor_Africa> is aiming at
building up technical capacity in African Wikimedia communities by
mentoring African developers for Wikidata Tool Development. Wikimedia
Deutschland will support the user group in the implementation of their
project and mentoring program.
Wikimedia Deutschland has been founded in 2004 as a member’s association
and is located in Berlin, Germany. Wikimedia Deutschland support
communities like the Wikipedia community, develop software for Wikimedia
projects and the ecosystem of Free Knowledge, and wants to improve the
political and legal framework for Wikipedia and for Free Knowledge in
general.
Specifically, Wikimedia Deutschland has been working on the development of
Wikidata since 2012. Since then, an active and vibrant community of
volunteer editors and programmers, re-users, data donors, affiliates and
more has formed around Wikidata.
Wikimedia Deutschland will be responsible for the administrative setup of
those collaborations and the communication with Arcadia. We are also happy
to share our experiences and knowledge about establishing software teams,
software development in the Wikidata/Wikibase environment, the Wikidata
community and providing support for emerging tech communities.
If you want to find out more about the partnership, you can read up on this
on our project page on Meta
<https://meta.wikimedia.org/wiki/Software_Collaboration_for_Wikidata>,
where we will keep updating the community on the progress of this
collaboration. If you have any comments, suggestions or questions please
use the talk page there to get in contact with us.
We are all excited to see those collaborations coming to life!
With kind regards,
Igbo Wikimedians User Group
Wikimedia Indonesia
Wikimedia Deutschland
--
Maria Heuschkel
Projektmanagerin
Softwareentwicklung
Wikimedia Deutschland e. V. | Tempelhofer Ufer 23-24 | 10963 Berlin
Tel. (030) 219 158 26-0
https://wikimedia.de
Unsere Vision ist eine Welt, in der alle Menschen am Wissen der Menschheit
teilhaben, es nutzen und mehren können. Helfen Sie uns dabei!
https://spenden.wikimedia.de
Wikimedia Deutschland — Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 B. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.
*(apologies for cross-posting)*
Hi everyone,
Our next Wikidata+Wikibase office hours
<https://www.wikidata.org/wiki/Wikidata:Events#Office_hours> will be held
on Wednesday, July 12th 2023 at 16:00 UTC (18:00 Berlin) in the Wikidata
Telegram group <https://t.me/joinchat/IeCRo0j5Uag1qR4Tk8Ftsg>.
The Wikidata and Wikibase office hours are online events where the
development team presents what we have been working on over the past
quarter, and the community is welcome to ask questions and discuss
important issues related to the development of Wikidata and Wikibase.
We hope to see you there.
--
Mohammed Sadat
*Community Communications Manager, Wikidata*
Wikimedia Deutschland e. V. | Tempelhofer Ufer 23-24 | 10963 Berlin
Phone: +49 (0) 30 577 116 2466
https://wikimedia.de
Grab a spot in my calendar for a chat: calendly.com/masssly.
Keep up to date! Current news and exciting stories about Wikimedia,
Wikipedia and Free Knowledge in our newsletter (in German): Subscribe now
<https://www.wikimedia.de/newsletter/>.
Imagine a world in which every single human being can freely share in the
sum of all knowledge. Help us to achieve our vision!
https://spenden.wikimedia.de
Wikimedia Deutschland – Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 B. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.
Hi everyone,
A while ago we did a survey among reusers about the different types of
ontology issues they are facing when building applications and more
using data from Wikidata. The results are available now. More details
here: https://www.wikidata.org/wiki/Wikidata_talk:Ontology_issues_prioritization#…
Cheers
Lydia
--
Lydia Pintscher - http://about.me/lydia.pintscher - WD:Q18016466
Portfolio Lead for Wikidata
Wikimedia Deutschland e. V. | Tempelhofer Ufer 23-24 | 10963 Berlin
https://wikimedia.de
Wikimedia Deutschland – Gesellschaft zur Förderung Freien Wissens e.
V. Eingetragen im Vereinsregister des Amtsgerichts
Berlin-Charlottenburg unter der Nummer 23855 B. Als gemeinnützig
anerkannt durch das Finanzamt für Körperschaften I Berlin,
Steuernummer 27/029/42207.
Hi all,
The next Research Showcase, with the theme of *Wikimedia and LGBTQIA+*,
will be live-streamed Wednesday, June 21 at 16:30 UTC. Find your local time
here <https://zonestamp.toolforge.org/1687365012>.
YouTube stream: https://www.youtube.com/watch?v=AOD2ZdxRNfo
You can join the conversation on IRC at #wikimedia-research or on the
YouTube chat.
This month's presentations:
- *Multilingual Contextual Affective Analysis of LGBT People Portrayals
in Wikipedia*
- *Speaker*: Chan Park, Carnegie Mellon University
- *Abstract*: In this talk, I present our research on analyzing the
portrayal of LGBT individuals in their biographies on Wikipedia, with a
particular focus on subtle word connotations and cross-cultural
comparisons. We aim to address two primary research questions: 1) How can
we effectively measure the nuanced connotations of words in multilingual
texts, which reflect sentiments, power dynamics, and agency? 2)
How can we
analyze the portrayal of a specific group, such as the LGBT
community, and
compare these portrayals across different languages? To answer these
questions, we collect the Multilingual Contextualized Connotation Frames
dataset, comprising 2,700 examples in English, Spanish, and Russian. We
also develop a new multilingual model based on pre-trained multilingual
language models. Additionally, we devise a matching algorithm to
construct
a comparison corpus for the target corpus, isolating the attribute of
interest. Finally, we showcase how our developed models and constructed
corpora enable us to conduct cross-cultural analysis of LGBT People
Portrayals on Wikipedia. Our results reveal systematic differences in how
the LGBT community is portrayed across languages, surfacing cultural
differences in narratives and signs of social biases.
- *Paperː* Park, C. Y., Yan, X., Field, A., & Tsvetkov, Y. (2021,
May). Multilingual contextual affective analysis of LGBT people
portrayals
in Wikipedia. In Proceedings of the International AAAI Conference on Web
and Social Media (Vol. 15, pp. 479-490).
<https://arxiv.org/pdf/2010.10820.pdf>
- *Visual gender biases in Wikipediaː A systematic evaluation across the
ten most spoken languages*
- *Speaker*: Daniele Metilli, University College London
- *Abstract*: Wikidata Gender Diversity (WiGeDi) is a one-year
project funded through the Wikimedia Research Fund. The project
is studying
gender diversity in Wikidata, focusing on marginalized gender identities
such as those of trans and non-binary people, and adopting a queer and
intersectional feminist perspective. The project is organised in three
strands — model, data, and community. First, we are looking at how the
current Wikidata ontology model represents gender, and the
extent to which
this representation is inclusive of marginalized gender
identities. We are
analysing the data stored in the knowledge base to gather insights and
identify possible gaps and biases. Finally, we are looking at how the
community has handled the move towards the inclusion of a wider
spectrum of
gender identities by studying a corpus of user discussions through
computational linguistics methods. This presentation will report on the
current status of the Wikidata Gender Diversity project and the
envisioned
outcomes. We will discuss the main challenges that we are facing and the
opportunities that our project will potentially enable, on Wikidata and
beyond.
- *Paperː* Metilli D. & Paolini C. (in press). ‘Non-binary gender
representation in Wikidata’. In: Provo A., Burlingame K. & Watson B.M.
Ethics in Linked Data. Litwin Books. <https://wigedi.com/chapter.pdf>
You can watch our past Research Showcases here:
https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase
Hope you can join us!
Warm regards,
--
*Pablo Aragón (he/him)*
Research Scientist
Wikimedia Foundation
https://research.wikimedia.org