Wikidata July 2020

wikidata@lists.wikimedia.org

39 participants
24 discussions

Wikidata Workshop: Second Call for Papers
by Lucie Kaffee 22 Jul '20

22 Jul '20

*The First Wikidata Workshop* Co-located with the 19th International Conference on Semantic Web (ISWC 2020). Date: October 29, 2020 The workshop will be held online, afternoon European time. Website: https://wikidataworkshop.github.io/ == Important dates == Papers due: August 10, 2020 Notification of accepted papers: September 11, 2020 Camera-ready papers due: September 21, 2020 Workshop date: October 29, 2020 == Overview == Wikidata is an openly available knowledge base, hosted by the Wikimedia Foundation. It can be accessed and edited by both humans and machines and acts as a common structured-data repository for several Wikimedia projects, including Wikipedia, Wiktionary, and Wikisource. It is used in a variety of applications by researchers and practitioners alike. In recent years, we have seen an increase in the number of publications around Wikidata. While there are several dedicated venues for the broader Wikidata community to meet, none of them focuses on publishing original, peer-reviewed research. This workshop fills this gap - we hope to provide a forum to build this fledgling scientific community and promote novel work and resources that support it. The workshop seeks original contributions that address the opportunities and challenges of creating, contributing to, and using a global, collaborative, open-domain, multilingual knowledge graph such as Wikidata. We encourage a range of submissions, including novel research, opinion pieces, and descriptions of systems and resources, which are naturally linked to Wikidata and its ecosystem, or enabled by it. What we’re less interested in are works which use Wikidata alongside or in lieu of other resources to carry out some computational task - unless the work feeds back into the Wikidata ecosystem, for instance by improving or commenting on some Wikidata aspect, or suggesting new design features, tools and practices. We also encourage submissions on the topic of Abstract Wikipedia, particularly around collaborative code management, natural language generation by a community, the abstract representation of knowledge, and the interaction between Abstract Wikipedia and Wikidata on the one, and Abstract Wikipedia and the language Wikipedias on the other side. We welcome interdisciplinary work, as well as interesting applications which shed light on the benefits of Wikidata and discuss areas of improvement. The workshop is planned as an interactive half-day event, in which most of the time will be dedicated to discussions and exchange rather than frontal presentations. For this reason, all accepted papers will be presented in short talks and accompanied by a poster. We are considering online options in response to ongoing challenges such as travel restrictions and the recent Covid-19 pandemic. == Topics == Topics of submissions include, but are not limited to: - Data quality and vandalism detection in Wikidata - Referencing in Wikidata - Anomaly, bias, or novelty detection in Wikidata - Algorithms for aligning Wikidata with other knowledge graphs - The Semantic Web and Wikidata - Community interaction in Wikidata - Multilingual aspects in Wikidata - Machine learning approaches to improve data quality in Wikidata - Tools, bots and datasets for improving or evaluating Wikidata - Participation, diversity and inclusivity aspects in the Wikidata ecosystem - Human-bot interaction - Managing knowledge evolution in Wikidata - Abstract Wikipedia == Submission guidelines == We welcome the following types of contributions: - Full research paper: Novel research contributions (7-12 pages) - Short research paper: Novel research contributions of smaller scope than full papers (3-6 pages) - Position paper: Well-argued ideas and opinion pieces, not yet in the scope of a research contribution (6-8 pages) - Resource paper: New dataset or other resource directly relevant to Wikidata, including the publication of that resource (8-12 pages) - Demo paper: New system critically enabled by Wikidata (6-8 pages) Submissions must be as PDF or HTML, formatted in the style of the Springer Publications format for Lecture Notes in Computer Science (LNCS). For details on the LNCS style, see Springer’s Author Instructions. The papers will be peer-reviewed by at least two researchers. Accepted papers will be published as open access papers on CEUR (we only publish to CEUR if the authors agree to have their papers published). Papers have to be submitted through easychair: https://easychair.org/conferences/?conf=wikidataworkshop2020 == Proceedings == The complete set of papers will be published with the CEUR Workshop Proceedings (CEUR-WS.org). == Organizing committee == - Lucie-Aimée Kaffee, University of Southampton - Oana Tifrea-Marciuska, Bloomberg - Elena Simperl, King’s College London - Denny Vrandečić, Wikimedia Foundation == Programme committee == - Dan Brickley, Google - Andrew D. Gordon, Microsoft Research & University of Edinburgh - Dennis Diefenbach, University Jean Monet - Aidan Hogan, Universidad de Chile - Markus Krötzsch, Technische Universität Dresden - Edgar Meij, Bloomberg - Claudia Müller-Birn, FU Berlin - Finn Årup Nielsen, Technical University of Denmark - Thomas Pellissier Tanon, Télécom ParisTech - Lydia Pintscher, Wikidata, Wikimedia Deutschland - Alessandro Piscopo, BBC - Marco Ponza, University of Pisa - Simon Razniewski, Max Planck Institute for Informatics - Miriam Redi, Wikimedia Foundation - Cristina Sarasua, University of Zurich - Maria-Esther Vidal, TIB Hannover - Pavlos Vougiouklis, Huawei Technologies, Edinburgh - Zainan Victor Zhou, Google -- Lucie-Aimée Kaffee

4 4

Wikidata & Wikibase office hour, Query Service special on July 21st
by Léa Lacroix 22 Jul '20

22 Jul '20

Hello all, As every quarter, the Wikidata development team will host an Office Hour on July 21st at 16:00 UTC (18:00 CEST), on the Wikidata Telegram channel <https://t.me/joinchat/AZriqUj5UagVMHXYzfZFvA>. This session will be a bit special because we will have a guest: Guillaume Lederrey from WMF's Search Team, who will present what they are working on at the moment related to the Wikidata Query service: research that they have been doing around the use of the WDQS, the reasons behind the issues that we encounter for the past months with keeping the data up to date, and different future paths for the service. So if you're interested in the topic, you can prepare your questions until July 21st! As usual, notes of the discussions will be published onwiki <https://www.wikidata.org/wiki/Category:Office_hour_notes> after the meeting. Cheers, -- Léa Lacroix Project Manager Community Communication for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.

1 2

NEW: Lists of technical resources and recommendations for smaller language wikis!
by Srishti Sethi 21 Jul '20

21 Jul '20

Hello everyone, As part of the Small wiki toolkits <https://meta.wikimedia.org/wiki/Small_wiki_toolkits> initiative, a Starter kit <https://meta.wikimedia.org/wiki/Small_wiki_toolkits/Starter_kit> has been developed for smaller language Wikimedia wikis! This Starter kit lists resources, tools, and recommendations in technical areas (e.g., templates, bots, gadgets, etc.) relevant to smaller wikis that are just getting started. Small wiki contributors can use it to make their community's workflow easier. You can now use and promote the Starter kit in your wiki community, and start translating the landing page and its subpages in a language you want. If you have any questions, ideas for venues where it should be shared or wiki pages where it should be linked, or any other suggestions for improving it further, please share on this talk page <https://meta.wikimedia.org/wiki/Talk:Small_wiki_toolkits/Starter_kit>. If you are interested in helping with the Small Wiki Toolkits initiative and can offer help with running workshops, developing toolkits, or exchanging problems and challenges in smaller wiki communities, add yourself as a member here: https://meta.wikimedia.org/wiki/Small_wiki_toolkits#Members Cheers, Srishti *Srishti Sethi* Developer Advocate Wikimedia Foundation <https://wikimediafoundation.org/>

2 1

Weekly Summary #425
by Mohammed Sadat Abdulai 20 Jul '20

20 Jul '20

*Here's your quick overview of what has been happening around Wikidata over the last week.* Discussions - New request for comments: General semi protection for all property pages <https://www.wikidata.org/wiki/Wikidata:Requests_for_comment/General_semi_pr…> Events <https://www.wikidata.org/wiki/Special:MyLanguage/Wikidata:Events> - Upcoming: The First Wikidata Workshop: Second Call for Papers <https://wikidataworkshop.github.io/>. Papers due: August 10, 2020 | Workshop date: October 29, 2020 - Upcoming: next Wikidata office hour, July 21st at 16:00 UTC (18:00 CEST) in the Wikidata Telegram group <https://t.me/joinchat/AZriqUj5UagVMHXYzfZFvA>. Query Service special with guests from WMF Search Team. - Upcoming: Wikidata Lab XXIV: Posicionamento digital relativo <https://en.wikipedia.org/wiki/pt:Wikip%C3%A9dia:Edit-a-thon/Atividades_em_p…> with Ederporto <https://www.wikidata.org/wiki/User:Ederporto> - July 23 17:00 UTC (14:00 BRT). In this technical training, we'll study the possibilities and functionalities of relative digital positioning in images and do practical activities on this topic using historical photographs of the city of São Paulo. The event will be held in Portuguese. Join us! - Upcoming video: July 21 - Wikipedia Weekly Network - Entity Schemas and Shape Expressions (ShEx) Facebook <https://www.facebook.com/groups/wikipediaweekly/permalink/3070432199671257/> YouTube <https://www.youtube.com/watch?v=nM8kXuZM3lQ> - Upcoming video: July 25 - Wikipedia Weekly Network - LIVE Wikidata editing #13 Facebook <https://www.facebook.com/groups/wikipediaweekly/permalink/3068304126550731/> YouTube <https://www.youtube.com/watch?v=__1jNt3K-6w> - Upcoming: Kidok-Workshop <https://www.wikidata.org/wiki/Wikidata:Kidok-Workshop>, online workshop about church building data. In German, non-native users welcome. Currently looking for a date in the upcoming week and people to help! - Upcoming video: Beyond Wikipedia - Knowledge that even a computer can understand <https://www.mediawiki.org/wiki/Wikimedia_Technical_Talks#Episode_5:_Beyond_…>. July 22, 2020 at 17:00 UTC. In this talk, Zbyszko Papierski will present Wikidata Query Service as one way that developers can interact with Wikidata. Press, articles, blog posts, videos <https://www.wikidata.org/wiki/Special:MyLanguage/Wikidata:Press_coverage> - Consortium of European Taxonomic Facilities botany pilot <https://services.bgbm.org/botanypilot/person/q/Q1349394> using Wikidata QIDs - Video: Wikipedia Weekly Network - LIVE Wikidata editing #11 Facebook <https://www.facebook.com/groups/wikipediaweekly/permalink/3051148598266284/>, YouTube <https://www.youtube.com/watch?v=0Btvs8POzUk> - Video: Wikispore Day <https://www.youtube.com/watch?v=q75Fv7Snc_w>, including an interview of Denny Vrandečić about Abstract Wikipedia <https://meta.wikimedia.org/wiki/Abstract_Wikipedia> at 34'00 - Video: Merging elements on Wikidata (in Italian) - YouTube <https://www.youtube.com/watch?v=W8RpzrsG6rQ> - Video: Wikibase Ecosystem - taking Wikidata further (by Lydia Pintscher), YouTube <https://www.youtube.com/watch?v=9uuNU3ohkXo> Tool of the week - User:Frettie/consistency_check_add.js <https://www.wikidata.org/wiki/User:Frettie/consistency_check_add.js> check consistency and add missing relations of a Wikidata object like father/son... Other Noteworthy Stuff - Breaking change: removing special pageterms behavior on repo wikis, use entityterms instead <https://lists.wikimedia.org/pipermail/wikidata-tech/2020-July/001569.html> (relevant for people using pageterms submodule of the query API module on Wikibase repository wikis) Did you know? - Newest properties <https://www.wikidata.org/wiki/Special:ListProperties>: - General datatypes: version control system <https://www.wikidata.org/wiki/Property:P8423>, interactive elements <https://www.wikidata.org/wiki/Property:P8428>, course <https://www.wikidata.org/wiki/Property:P8431>, peak bagging classification <https://www.wikidata.org/wiki/Property:P8450>, public transport stop <https://www.wikidata.org/wiki/Property:P8453>, descriptive solubility <https://www.wikidata.org/wiki/Property:P8459>, applies if regular expression matches <https://www.wikidata.org/wiki/Property:P8460>, energy consumption per transaction <https://www.wikidata.org/wiki/Property:P8461>, content partnership category <https://www.wikidata.org/wiki/Property:P8464>, Van Der Waals Constant a <https://www.wikidata.org/wiki/Property:P8465>, Van Der Waals Constant b <https://www.wikidata.org/wiki/Property:P8466> - External identifiers: Spanish Olympic Committee athlete ID <https://www.wikidata.org/wiki/Property:P8421>, EHESS ID of a French commune <https://www.wikidata.org/wiki/Property:P8422>, OpenHistoricalMap relation ID <https://www.wikidata.org/wiki/Property:P8424>, IAPH code <https://www.wikidata.org/wiki/Property:P8425>, Knot Atlas identifier <https://www.wikidata.org/wiki/Property:P8426>, Knotinfo identifier <https://www.wikidata.org/wiki/Property:P8427>, Swiss Tunnel ID <https://www.wikidata.org/wiki/Property:P8429>, Wien Kulturgut: Kunstwerke im öffentlichen Raum ID <https://www.wikidata.org/wiki/Property:P8430>, Österreichisches Musiklexikon Online ID <https://www.wikidata.org/wiki/Property:P8432>, Swedish Riksdag document ID <https://www.wikidata.org/wiki/Property:P8433>, Artprice artist ID <https://www.wikidata.org/wiki/Property:P8434>, AnyDecentMusic album ID <https://www.wikidata.org/wiki/Property:P8435>, SAH Archipedia architect ID <https://www.wikidata.org/wiki/Property:P8439>, Personendatenbank Germania Sacra ID <https://www.wikidata.org/wiki/Property:P8440>, Cistercian Biography Online ID <https://www.wikidata.org/wiki/Property:P8441>, Digital Library of the Caribbean ID <https://www.wikidata.org/wiki/Property:P8442>, Homebrew formula name <https://www.wikidata.org/wiki/Property:P8443>, Our Campaigns candidate ID <https://www.wikidata.org/wiki/Property:P8445>, UK Research and Innovation person ID <https://www.wikidata.org/wiki/Property:P8446>, Unique Street Reference Number <https://www.wikidata.org/wiki/Property:P8447>, Benerail station ID <https://www.wikidata.org/wiki/Property:P8448>, Churches in Limburg <https://www.wikidata.org/wiki/Property:P8449>, Vmusic.bg artist ID <https://www.wikidata.org/wiki/Property:P8454>, US Quaternary Fault ID <https://www.wikidata.org/wiki/Property:P8455>, ASCRG 2016 ID <https://www.wikidata.org/wiki/Property:P8456>, ANZSOC 2011 ID <https://www.wikidata.org/wiki/Property:P8457>, ANZSCO 2013 ID <https://www.wikidata.org/wiki/Property:P8458>, Political Graveyard politician ID <https://www.wikidata.org/wiki/Property:P8462>, SAH Archipedia building ID <https://www.wikidata.org/wiki/Property:P8463> - New property proposals <https://www.wikidata.org/wiki/Special:MyLanguage/Wikidata:Property_proposal> to review: - General datatypes: BTI Status Index <https://www.wikidata.org/wiki/Wikidata:Property_proposal/BTI_Status_Index>, BTI Governance Index <https://www.wikidata.org/wiki/Wikidata:Property_proposal/BTI_Governance_Ind…>, lyricsmule <https://www.wikidata.org/wiki/Wikidata:Property_proposal/lyricsmule>, number of majors offered <https://www.wikidata.org/wiki/Wikidata:Property_proposal/number_of_majors_o…>, nationality (cultural identity) <https://www.wikidata.org/wiki/Wikidata:Property_proposal/nationality_(cultu…>, image of view <https://www.wikidata.org/wiki/Wikidata:Property_proposal/image_of_view>, web interface software <https://www.wikidata.org/wiki/Wikidata:Property_proposal/web_interface_soft…> - External identifiers: Native Plants Database ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Native_Plants_Data…>, Český hudební slovník osob a institucí ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/%C4%8Cesk%C3%BD_hu…>, OpenCritic critic ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/OpenCritic_critic_…>, DATAtourisme ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/DATAtourisme_ID>, ASCCEG 2019 ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/ASCCEG_2019_ID>, AHECC 2017 ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/AHECC_2017_ID>, ANZSIC 2006 ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/ANZSIC_2006_ID>, Unsplash User ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Unsplash_User_ID>, South Africa EMIS code <https://www.wikidata.org/wiki/Wikidata:Property_proposal/South_Africa_EMIS_…>, Denkmaldatenbank Thurgau ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Denkmaldatenbank_T…>, HLTV Player ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/HLTV_Player_ID>, HLTV Team ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/HLTV_Team_ID>, SÚKL code <https://www.wikidata.org/wiki/Wikidata:Property_proposal/S%C3%9AKL_code>, Nomenclature de tous les noms de rosiers <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Nomenclature_de_to…>, Science Fiction Awards Database author id <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Science_Fiction_Aw…>, Fototeka person ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Fototeka_person_ID>, Henrik Ibsen skrifter ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Henrik_Ibsen_skrif…> - Query examples: - Painters in Wikidata with the surname Ruiz <https://w.wiki/Wvv> ( Source <https://twitter.com/theklaneh/status/1283163269725814786>) - Cities with female mayor <https://w.wiki/3ev> (Source <https://twitter.com/_pablog/status/1283370859927920642>) - MPs in the current UK parliament with identified mythical ancestors <https://w.wiki/XBC> (Source <https://twitter.com/Tagishsimon/status/1284180929800556552>) - Indian subcontinent rail lines <https://w.wiki/Jp6> (Source <https://twitter.com/wikidataindia/status/1284255744326545408>) - List of countries by age of the head of government <https://w.wiki/XFS> (Source <https://twitter.com/Mr_Shark/status/1284573644249878529>) - Cities of the United States between 100,000 and 600,000 inhabitants with the counties and states they are located in <https://w.wiki/XJQ> (Source <https://twitter.com/ash_crow/status/1284878719111311360>) - Newest database reports: list of 1000 <https://www.wikidata.org/wiki/Wikidata:WikiProject_Random/1000>, 2000 <https://www.wikidata.org/wiki/Wikidata:WikiProject_Random/2000> (without statements) and 10000 <https://www.wikidata.org/wiki/Wikidata:WikiProject_Random/10000> (filtered) random items Development - More work on the consistent design system - More work on decoupling the different Wikibase extensions from each other to make development easier: - load WikibaseClient through the JSON file (phab:T257435 <https://phabricator.wikimedia.org/T257435>), - decoupling Client from Lib/Repo in the JavaScript code (phab:T256054 <https://phabricator.wikimedia.org/T256054>), - reduce the uses of Repo classes (phab:T255885 <https://phabricator.wikimedia.org/T255885>, phab:T255882 <https://phabricator.wikimedia.org/T255882>) - Move all the special pages (phab:T257444 <https://phabricator.wikimedia.org/T257444>), API modules ( phab:T257443 <https://phabricator.wikimedia.org/T257443>) and hooks ( phab:T257445 <https://phabricator.wikimedia.org/T257445>) from the PHP entry point to the JSON file - Merge the registration of WikibaseLib into the Repo and Client extensions (phab:T257432 <https://phabricator.wikimedia.org/T257432>) and merge WikibaseView into Repo (phab:T258043 <https://phabricator.wikimedia.org/T258043>) - Prepare the first development phase for the Simple Query Builder - Reference game: run a scraper, work on a dashboard that will gives an overview of the potential references that have already been judged in the game - Fix a bug related to maximum limit of search results (phab:T256885 <https://phabricator.wikimedia.org/T256885>) - Resolve some configuration issues regarding Entity Sources ( phab:T254315 <https://phabricator.wikimedia.org/T254315>) - Federated Properties: - Changes to special pages that interact with both items/properties when federation is enabled (phab:T246886 <https://phabricator.wikimedia.org/T246886>) - Changes to Special:ListDatatypes when federation is enabled ( phab:T255581 <https://phabricator.wikimedia.org/T255581>) - Add new monolingual language codes: mic (Miꞌkmaq), gil (Kiribati) You can see all open tickets related to Wikidata here <https://phabricator.wikimedia.org/maniphest/query/4RotIcw5oINo/#R>. If you want to help, you can also have a look at the tasks needing a volunteer <https://phabricator.wikimedia.org/project/board/71/query/zfiRgTnZF7zu/?filt…>. Monthly Tasks - Add labels, in your own language(s), for the new properties listed above. - Comment on property proposals: all open proposals <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Overview> - Contribute to a Showcase item <https://www.wikidata.org/wiki/Special:MyLanguage/Wikidata:Showcase_items> . - Help translate <https://www.wikidata.org/wiki/Special:LanguageStats> or proofread the interface and documentation pages, in your own language! - Help merge identical items <https://www.wikidata.org/wiki/User:Pasleim/projectmerge> across Wikimedia projects. - Help write the next summary! <https://www.wikidata.org/wiki/Wikidata:Status_updates/Next> -- Mohammed Sadat *Community Communications Manager for Wikidata/Wikibase* Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de

1 0

New URL for OpenRefine reconciliation service
by Antonin Delpeuch (lists) 20 Jul '20

20 Jul '20

Hi, The upcoming domain name migration to on the Wikimedia Toolforge implies that OpenRefine users need to update their Wikidata reconciliation service to the new endpoint: https://wdreconcile.toolforge.org/en/api or by replacing "en" by any other Wikimedia language code. The new home page of the service is at: https://wdreconcile.toolforge.org/ This new endpoint will be available by default in the upcoming release of OpenRefine (3.4). For details about why an automatic migration via redirects is sadly not possible, see this Phabricator ticket: https://phabricator.wikimedia.org/T254172 Cheers, Antonin

3 4

Introducing Diff – a blog by and for the Wikimedia volunteer community
by Chris Koerner 16 Jul '20

16 Jul '20

Hello, Today the Wikimedia Foundation would like to introduce a new community blog. It's called "Diff" (diff.wikimedia.org) and is a blog by – and for – the Wikimedia volunteer community to connect and share learnings, stories, and ideas from across our movement. We'd like to encourage you to learn more about Diff and how it can help you in sharing and learning from your fellow Wikimedians. Everyone is invited to contribute! https://diff.wikimedia.org/2020/07/14/welcome-to-diff-a-community-blog-for-… The name “Diff” is in reference to the wiki interface that displays the difference between one version and another of a Wikipedia page. It also reflects the “difference” our communities and movement make in the world every day. For some background, Diff builds on lessons and experiences from the Wikimedia Blog, the Wikimedia Foundation News, and Wikimedia Space; previous posts from these channels are archived on Diff. The channel is primarily intended for community-authored posts, in which volunteers can share their stories, learnings, and ideas with each other. Diff offers a simple and accessible editorial process, moderated by Foundation communications staff and open to volunteers, to encourage participation from all — especially emerging and under-represented communities. Additionally, content on Diff can be written and translated into languages to reach a wide audience. Diff also has a code of conduct and comments can be flagged and moderated. Still curious to learn more? https://diff.wikimedia.org/2020/07/14/welcome-to-diff-a-community-blog-for-… Yours, Chris Koerner (he/him) Community Relations Specialist Wikimedia Foundation

1 0

BREAKING CHANGE: removing special pageterms behavior on repo wikis, use entityterms instead
by Lucas Werkmeister 15 Jul '20

15 Jul '20

This is an announcement for a breaking change to the pageterms submodule of the query API module, which only affects Wikibase repository wikis. If you do not use that API module, or only use it on client wikis (e. g. Wikipedias) and not on repository wikis (Wikidata, Wikimedia Commons), you can ignore this message. For years, the pageterms <https://www.wikidata.org/wiki/Special:ApiHelp/query%2Bpageterms> API module has served a double role: on client wikis, it returned the “terms” (Label, Description, Aliases) of the Wikidata Item linked to the given page(s), whereas on repo wikis, it would return the terms of the Item (or other Entity) on that page itself. For example, querying for the Label of Wikipedia:Village pump on English Wikipedia <https://en.wikipedia.org/w/api.php?action=query&prop=pageterms&titles=Wikip…> would return “Project:Village pump” (the Label of Q16503 <https://www.wikidata.org/wiki/Q16503>), but querying for the Label of Wikidata:Project chat on Wikidata <https://www.wikidata.org/w/api.php?action=query&prop=pageterms&titles=Wikid…> would not return anything, even though that page is linked to the same Item – you would have to query for the Label of Q16503 <https://www.wikidata.org/w/api.php?action=query&prop=pageterms&titles=Q1650…> instead. This behavior is inconsistent and also mixes repo and client concerns in a way that makes the Wikibase code harder to maintain. To resolve this, we introduced a new entityterms <https://www.wikidata.org/wiki/Special:ApiHelp/query%2Bentityterms> API module (a submodule of the query module, just like the pageterms module) which has the same behavior as the pageterms module currently has for Item (or other Entity) pages, and which is only available on repo wikis. If you want to get the terms of Q16503, you can now use action=query&prop=entityterms&titles=Q16503 <https://www.wikidata.org/w/api.php?action=query&prop=entityterms&titles=Q16…> instead of action=query&prop=pageterms&titles=Q16503 <https://www.wikidata.org/w/api.php?action=query&prop=pageterms&titles=Q16503>. (You can also use wbgetentities <https://www.wikidata.org/wiki/Special:ApiHelp/wbgetentities>, which gives you much more control over the returned data; pageterms/entityterms may be faster and can also be combined with other submodules of the query module.) On or shortly after 5 August 2020, we will remove the special repo behavior of the pageterms module, and it will then behave just like it always has on client wikis, and return the terms of the Item linked to a page, not the terms of the Item (or other Entity) on a page. (Because the new API module is already available on Wikidata, and you can start using it immediately, we are not making this pageterms behavior change available on Test Wikidata significantly before that date.) If you have any issue or question, feel free to leave a comment at T257658 <https://phabricator.wikimedia.org/T257658>. For more information, see also T115117 <https://phabricator.wikimedia.org/T115117>, T255882 <https://phabricator.wikimedia.org/T255882> and T256255 <https://phabricator.wikimedia.org/T256255>. Cheers, Lucas -- Lucas Werkmeister (he/er) Full Stack Developer Wikimedia Deutschland e. V. | Tempelhofer Ufer 23-24 | 10963 Berlin Phone: +49 (0)30 219 158 26-0 https://wikimedia.de Imagine a world in which every single human being can freely share in the sum of all knowledge. Help us to achieve our vision! https://spenden.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 B. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.

1 0

[Virtual OM-2020] Final CFP: 15th workshop on Ontology Matching collocated with ISWC
by Pavel Shvaiko 14 Jul '20

14 Jul '20

------------------------------------------------------------------------------ FINAL CALL FOR CONTRIBUTIONS THE SUBMISSION DEADLINE IS ON AUGUST 10TH, 2020 ------------------------------------------------------------------------------ The Fifteenth International Workshop on ONTOLOGY MATCHING (OM-2020) http://om2020.ontologymatching.org/ November 2nd or 3rd, 2020, International Semantic Web Conference (ISWC) Workshop Program, VIRTUAL CONFERENCE BRIEF DESCRIPTION AND OBJECTIVES Ontology matching is a key interoperability enabler for the Semantic Web, as well as a useful technique in some classical data integration tasks dealing with the semantic heterogeneity problem. It takes ontologies as input and determines as output an alignment, that is, a set of correspondences between the semantically related entities of those ontologies. These correspondences can be used for various tasks, such as ontology merging, data interlinking, query answering or navigation over knowledge graphs. Thus, matching ontologies enables the knowledge and data expressed with the matched ontologies to interoperate. The workshop has three goals: 1. To bring together leaders from academia, industry and user institutions to assess how academic advances are addressing real-world requirements. The workshop will strive to improve academic awareness of industrial and final user needs, and therefore, direct research towards those needs. Simultaneously, the workshop will serve to inform industry and user representatives about existing research efforts that may meet their requirements. The workshop will also investigate how the ontology matching technology is going to evolve, especially with respect to data interlinking, knowledge graph and web table matching tasks. 2. To conduct an extensive and rigorous evaluation of ontology matching and instance matching (link discovery) approaches through the OAEI (Ontology Alignment Evaluation Initiative) 2020 campaign: http://oaei.ontologymatching.org/2020/ 3. To examine similarities and differences from other, old, new and emerging, techniques and usages, such as web table matching or knowledge embeddings. This year, in sync with the main conference, we encourage submissions specifically devoted to: (i) datasets, benchmarks and replication studies, services, software, methodologies, protocols and measures (not necessarily related to OAEI), and (ii) application of the matching technology in real-life scenarios and assessment of its usefulness to the final users. TOPICS of interest include but are not limited to: Business and use cases for matching (e.g., big, open, closed data); Requirements to matching from specific application scenarios (e.g., public sector, homeland security); Application of matching techniques in real-world scenarios (e.g., in cloud, with mobile apps); Formal foundations and frameworks for matching; Novel matching methods, including link prediction, ontology-based access; Matching and knowledge graphs; Matching and deep learning; Matching and embeddings; Matching and big data; Matching and linked data; Instance matching, data interlinking and relations between them; Privacy-aware matching; Process model matching; Large-scale and efficient matching techniques; Matcher selection, combination and tuning; User involvement (including both technical and organizational aspects); Explanations in matching; Social and collaborative matching; Uncertainty in matching; Expressive alignments; Reasoning with alignments; Alignment coherence and debugging; Alignment management; Matching for traditional applications (e.g., data science); Matching for emerging applications (e.g., web tables, knowledge graphs). SUBMISSIONS Contributions to the workshop can be made in terms of technical papers and posters/statements of interest addressing different issues of ontology matching as well as participating in the OAEI 2020 campaign. Long technical papers should be of max. 12 pages. Short technical papers should be of max. 5 pages. Posters/statements of interest should not exceed 2 pages. All contributions have to be prepared using the LNCS Style: http://www.springer.com/computer/lncs?SGWID=0-164-6-793341-0 and should be submitted in PDF format (no later than August 10th, 2020) through the workshop submission site at: https://www.easychair.org/conferences/?conf=om2020 Contributors to the OAEI 2020 campaign have to follow the campaign conditions and schedule at http://oaei.ontologymatching.org/2020/. DATES FOR TECHNICAL PAPERS AND POSTERS: August 10th, 2020: Deadline for the submission of papers. September 11th, 2020: Deadline for the notification of acceptance/rejection. September 21st, 2020: Workshop camera ready copy submission. November 2nd or 3rd, 2020: OM-2020, Virtual Conference. Contributions will be refereed by the Program Committee. Accepted papers will be published in the workshop proceedings as a volume of CEUR-WS as well as indexed on DBLP. ORGANIZING COMMITTEE 1. Pavel Shvaiko (main contact) Trentino Digitale, Italy 2. Jérôme Euzenat INRIA & Univ. Grenoble Alpes, France 3. Ernesto Jiménez-Ruiz City, University of London, UK & SIRIUS, University of Oslo, Norway 4. Oktie Hassanzadeh IBM Research, USA 5. Cássia Trojahn IRIT, France PROGRAM COMMITTEE (to be completed): Alsayed Algergawy, Jena University, Germany Manuel Atencia, INRIA & Univ. Grenoble Alpes, France Zohra Bellahsene, LIRMM, France Jiaoyan Chen, University of Oxford, UK Valerie Cross, Miami University, USA Jérôme David, University Grenoble Alpes & INRIA, France Gayo Diallo, University of Bordeaux, France Daniel Faria, Instituto Gulbenkian de Ciéncia, Portugal Alfio Ferrara, University of Milan, Italy Marko Gulic, University of Rijeka, Croatia Wei Hu, Nanjing University, China Ryutaro Ichise, National Institute of Informatics, Japan Antoine Isaac, Vrije Universiteit Amsterdam & Europeana, Netherlands Naouel Karam, Fraunhofer, Germany Prodromos Kolyvakis, EPFL, Switzerland Patrick Lambrix, Linköpings Universitet, Sweden Oliver Lehmberg, University of Mannheim, Germany Majeed Mohammadi, TU Delft, Netherlands Peter Mork, MITRE, USA Andriy Nikolov, Metaphacts GmbH, Germany George Papadakis, University of Athens, Greece Catia Pesquita, University of Lisbon, Portugal Henry Rosales-Méndez, University of Chile, Chile Kavitha Srinivas, IBM, USA Giorgos Stoilos, Huawei Technologies, Greece Pedro Szekely, University of Southern California, USA Ludger van Elst, DFKI, Germany Xingsi Xue, Fujian University of Technology, China Ondrej Zamazal, Prague University of Economics, Czech Republic Songmao Zhang, Chinese Academy of Sciences, China ------------------------------------------------------- More about ontology matching: http://www.ontologymatching.org/ http://book.ontologymatching.org/ Best Regards, Pavel ------------------------------------------------------- Pavel Shvaiko, PhD Trentino Digitale, Italy http://www.ontologymatching.org/ https://www.trentinodigitale.it/ http://www.dit.unitn.it/~pavel ------------------------------------------------------- -- Cap. Soc. Euro 6.433.680,00 - REG. IMP. / C.F. / P.IVA 00990320228 E-mail: tndigit(a)tndigit.it <mailto:infotn@infotn.it> - www.trentinodigitale.it <http://www.infotn.it> Società soggetta ad attività di direzione e coordinamento da parte della Provincia Autonoma di Trento - C.Fisc. 00337460224. Questo messaggio è indirizzato esclusivamente ai destinatari in intestazione, può contenere informazioni protette e riservate ai sensi della normativa vigente e ne è vietato qualsiasi impiego diverso da quello per cui è stato inviato. Se lo avete ricevuto per errore siete pregati di eliminarlo in ogni sua parte e di avvisare il mittente

1 0

Weekly Summary #424
by Mohammed Sadat Abdulai 13 Jul '20

13 Jul '20

Here's your quick overview of what has been happening around Wikidata over the last week. Events <https://www.wikidata.org/wiki/Special:MyLanguage/Wikidata:Events> - Upcoming: Next Linked Data for Libraries LD4 Wikidata Affinity Group call: New co-facilitator introductions and tools for editathons, 14 July. Agenda <https://docs.google.com/document/d/1Sxv0tbESnZVOqyPvLMMbcnxEObfngKmtq7phQXh…> - Upcoming video: Wikipedia Weekly Network - LIVE Wikidata editing #12, 18 July Facebook <https://www.facebook.com/groups/wikipediaweekly/permalink/3051148598266284/>, YouTube <https://www.youtube.com/watch?v=0Btvs8POzUk> - Past: Celtic Knot Wikimedia language conference <https://meta.wikimedia.org/wiki/Celtic_Knot_Conference_2020>, online, July 9-10. Videos, documentation and slides of sessions are accessible from the program page. Press, articles, blog posts, videos <https://www.wikidata.org/wiki/Special:MyLanguage/Wikidata:Press_coverage> - Using OpenRefine with Wikidata for the first time <https://addshore.com/2020/07/using-openrefine-with-wikidata-for-the-first-t…>, by Addshore - Using Wikidata in Mathematica and Wolfram <https://blog.wolfram.com/2020/07/09/accessing-the-world-with-the-wolfram-la…>, by Toni Schindler - Fostering the presence of museums on Wikipedia and Wikidata <http://icom.museum/en/news/imd2020-fostering-the-presence-of-museums-on-wik…>, by Debora Lopomo - Extending the Met’s reach with Wikidata <https://wikiedu.org/blog/2020/07/07/extending-the-mets-reach-with-wikidata/>, by Jennie Choi - Wikidata-related videos from the Celtic Knot Conference: - Wikidata and how you can use it to support minority languages <https://www.youtube.com/watch?v=wfhkk4qdkTE> (Mohammed Sadat) - Wikidata-powered infoboxes <https://www.youtube.com/watch?v=sk9AdUIzeeA?t=221> (Pau Cabot) - Hands-on Wikidata and lexicographical data <https://www.youtube.com/watch?v=oDM5QJAJzNc> (Nicolas Vigneron and Léa Lacroix) - FAIR linguistic data thanks to norm data – Wikidata as part of the research project VerbaAlpina <https://www.youtube.com/watch?v=P7ROTQ9N_8w> (Christina Mutter) - Sámi placenames on Wikidata <https://www.youtube.com/watch?v=5L5uoMiJoA4> (Jon Harald Søby) - Video: Wikipedia Weekly Network - Wikidata editing #11 Facebook <https://www.facebook.com/groups/wikipediaweekly/permalink/3043539535693857/>, YouTube <https://www.youtube.com/watch?v=mzqX5iTfzb4> - Video: Introducing the Wikidata powered Dictionary of Welsh Biography timeline (Jason Evans), YouTube <https://www.youtube.com/watch?v=L6uMVRVji-U> - Video: Diving into Wikidata workshop (Will Kent), YouTube <https://www.youtube.com/watch?v=f756VlWN360> - Video: Linked data for libraries with Wikibase, Infrastructure track (Jens Ohlig), YouTube <https://www.youtube.com/watch?v=FVmxpaB-Th4> Tool of the week - DragNDrop.js <https://www.wikidata.org/wiki/User:Yarl/DragNDrop.js> makes it possible to show a Wikipedia article and drag items to Wikidata. Other Noteworthy Stuff - GQIS can now render results from a Wikidata SPARQL query with a new plugin <https://twitter.com/AlbinPCLarsson/status/1280007567859576832>. - New feature in *V for Wikipedia* that uses Wikidata identifiers to direct users to their favorite streaming app <https://twitter.com/frankrausch/status/1281291081452306434>. Did you know? - Newest properties <https://www.wikidata.org/wiki/Special:ListProperties>: - General datatypes: Dowker-Thistlethwaite name <https://www.wikidata.org/wiki/Property:P8416> - External identifiers: AlloCiné theater ID <https://www.wikidata.org/wiki/Property:P8414>, Maniadb artist ID <https://www.wikidata.org/wiki/Property:P8415>, Group Properties wiki ID <https://www.wikidata.org/wiki/Property:P8417>, Oberwolfach mathematician ID <https://www.wikidata.org/wiki/Property:P8418>, Archive Of Our Own tag <https://www.wikidata.org/wiki/Property:P8419>, GameBanana video game ID <https://www.wikidata.org/wiki/Property:P8420>, Spanish Olympic Committee athlete ID <https://www.wikidata.org/wiki/Property:P8421> - New property proposals <https://www.wikidata.org/wiki/Special:MyLanguage/Wikidata:Property_proposal> to review: - General datatypes: Image link <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Image_link>, numeric ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/numeric_ID>, peak bagging classification <https://www.wikidata.org/wiki/Wikidata:Property_proposal/peak_bagging_class…>, vanDerWaalsConstantA <https://www.wikidata.org/wiki/Wikidata:Property_proposal/vanDerWaalsConstan…>, vanDerWaalsConstantB <https://www.wikidata.org/wiki/Wikidata:Property_proposal/vanDerWaalsConstan…>, road name formatter <https://www.wikidata.org/wiki/Wikidata:Property_proposal/road_name_formatter>, road number formatter <https://www.wikidata.org/wiki/Wikidata:Property_proposal/road_number_format…>, Medical Service <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Medical_Service>, Ofsted inspection rating <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Ofsted_inspection_…>, OpenStreetMap numeric user ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/OpenStreetMap_nume…>, traffic sign template image <https://www.wikidata.org/wiki/Wikidata:Property_proposal/traffic_sign_templ…>, Vietnamese middle name <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Vietnamese_middle_…> - External identifiers: Wiki Loves Earth Ukraine identifier <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Wiki_Loves_Earth_U…>, OHM Relation ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/OHM_Relation_ID>, Filmstarts title ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Filmstarts_title_ID>, UK Research and Innovation person ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/UK_Research_and_In…>, SmallGroup ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/SmallGroup_ID>, Personendatenbank Germania Sacra ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Personendatenbank_…>, SAH Archipedia building ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/SAH_Archipedia_bui…>, SAH Archipedia architect ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/SAH_Archipedia_arc…>, Our Campaigns candidate ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Our_Campaigns_cand…>, VPE railway line ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/VPE_railway_line_ID>, Churches in Limburg <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Churches_in_Limburg>, The Draft Review ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/The_Draft_Review_ID>, ANZSCO 2013 ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/ANZSCO_2013_ID>, ANZSOC ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/ANZSOC_ID>, ASCRG ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/ASCRG_ID>, Encyclopédie berbère <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Encyclop%C3%A9die_…>, 雑誌コード <https://www.wikidata.org/wiki/Wikidata:Property_proposal/%E9%9B%91%E8%AA%8C…>, OpenStreetMap user name <https://www.wikidata.org/wiki/Wikidata:Property_proposal/OpenStreetMap_user…>, National Registry of Exonerations Case ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/National_Registry_…>, Svenska Akademiens Ordbok ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Svenska_Akademiens…>, QLD Biota ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/QLD_Biota_ID>, Australian Weed ID <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Australian_Weed_ID> - Query examples: - Illustrated Basque word of the day for children <https://w.wiki/WMN> - Oldest UK MPs at the time of election <https://w.wiki/WRB> (Source <https://twitter.com/heald_j/status/1280844765135802369>) - Map of archives in Wikidata <https://w.wiki/Wg3> (Source <https://twitter.com/M_Gasser/status/1281989283302301696>) Development - Continued building out documentation for Federated Properties ( phabricator:T255651 <https://phabricator.wikimedia.org/T255651>) and making interface improvements to the first stage of the feature (incl. phabricator:T246886 <https://phabricator.wikimedia.org/T246886>, changes to special pages that interact with both Items/Properties, and phabricator:T255581 <https://phabricator.wikimedia.org/T255581>, changes to Special:ListDataTypes when federation is enabled) - More work on the consistent design system - More work on decoupling the different Wikibase extensions from each other to make development easier - Finalizing research and interviews to better understand what could be improved in the way developers access Wikidata's data (APIs, SPARQL) - Testing the first prototype of the Simple Query Builder <https://www.wikidata.org/wiki/Wikidata:Improve_the_workflows_for_queries_an…> with some editors to get final input before coding starts - Sorting of language links on Wikipedia and the other Wikimedia projects was broken (presumably by a change in MediaWiki core). A fix is being worked on. (phabricator:T257625 <https://phabricator.wikimedia.org/T257625>) You can see all open tickets related to Wikidata here <https://phabricator.wikimedia.org/maniphest/query/4RotIcw5oINo/#R>. If you want to help, you can also have a look at the tasks needing a volunteer <https://phabricator.wikimedia.org/project/board/71/query/zfiRgTnZF7zu/?filt…>. Monthly Tasks - Add labels, in your own language(s), for the new properties listed above. - Comment on property proposals: all open proposals <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Overview> - Contribute to a Showcase item <https://www.wikidata.org/wiki/Special:MyLanguage/Wikidata:Showcase_items> . - Help translate <https://www.wikidata.org/wiki/Special:LanguageStats> or proofread the interface and documentation pages, in your own language! - Help merge identical items <https://www.wikidata.org/wiki/User:Pasleim/projectmerge> across Wikimedia projects. - Help write the next summary! <https://www.wikidata.org/wiki/Wikidata:Status_updates/Next> -- Mohammed Sadat *Community Communications Manager for Wikidata/Wikibase* Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de

1 0

WDQS status
by Guillaume Lederrey 09 Jul '20

09 Jul '20

Hello all! The Search Platform team will join the WIkidata office hours on July 21st 16:00 UTC [1]. We are looking forward to discussing Wikidata Query Service and anything else you might find of interest. We've been hard at work on Wikimedia Commons Query Service (WCQS) [2]. This will be a SPARL endpoint similar to WDQS, but serving the Structured Data on Commons dataset. Our goal is to open a beta service, hosted on Wikimedia Cloud Service (WMCS) by the end of July. The service will require an account on Commons for authentication and will allow federation with WDQS. We don't have a streaming update process ready yet, the data will be reloaded from Commons dumps weekly for a start. As part of that work, the dumps for Structured Data on Commons are now available [3]. Note that the prefix used in the TTL dumps is "wd", which does not make much sense. We are working with WMDE on renaming the prefixes, but this is more complex than expected since "wd" is hardcoded in more places than it should be. Those prefix should only be valid in the local context of the dumps, so renaming them is technically a non breaking change. That being said, if you start using those dumps, make sure you don't rely on this prefix, or that you are ready for a rename [4]. We are planning to dig more into the data we have to get a better understanding of the use cases around WDQS [5] (not much content on that task yet, but it is coming). Some very preliminary analysis indicates that less then 2% of the queries on WDQS generate more than 90% of the load. This is definitely something we need to better understand. We will be working on defining the kind of questions we need to answer, and improving our data collection to be able to answer those questions. We have started an internal discussion around "planning for disaster" [6]. We want to better understand the potential failure scenarios around WDQS and have a plan if that worst case does happen. This will include some analytics work and some testing to better understand the constraints and what degraded mode we might still be able to provide in case of catastrophic failure. Thanks for reading! Guillaume [1] https://www.wikidata.org/wiki/Wikidata:Events#Office_hours [2] https://phabricator.wikimedia.org/T251488 [3] https://dumps.wikimedia.org/other/wikibase/commonswiki/ [4] https://dumps.wikimedia.org/other/wikibase/commonswiki/README_commonsrdfdum… [5] https://phabricator.wikimedia.org/T257045 [6] https://phabricator.wikimedia.org/T257055 -- Guillaume Lederrey Engineering Manager, Search Platform Wikimedia Foundation UTC+1 / CET

4 5

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

Wikidata July 2020