Wikidata August 2016

wikidata@lists.wikimedia.org

68 participants
57 discussions

We plan to drop the wb_entity_per_page table
by hoo 01 Jun '17

01 Jun '17

Hey folks, we plan to drop the wb_entity_per_page table sometime soon[0], because it is just not required (as we will likely always have a programmatic mapping from entity id to page title) and it does not supported non -numeric entity ids as it is now. Due to this removing it is a blocker for the commons metadata. Is anybody using that for their tools (on tool labs)? If so, please tell us so that we can give you instructions and a longer grace period to update your scripts. Cheers, Marius [0]: https://phabricator.wikimedia.org/T95685

5 5

Hello Wikidata!
by Glorian Yapinus 02 Apr '17

02 Apr '17

Hi folks! My name is Glorian Yapinus, but you can simply call me Glorian ;) . For the next 6 months, I will assist Lydia in supporting you all. Regarding to my educational background, I hold a bachelor's degree in Information Technology and currently, I am working on my Master's in Software Engineering and Management. I am a warm and nice person. So, please do not hesitate to reach out to me for any queries :-) Last but not least, I am looking forward to working with you. Cheers, Glorian -- Glorian Yapinus Product Management Intern for Wikidata Imagine a world, in which every single human being can freely share in the sum of all knowledge. That‘s our commitment. Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 B. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.

11 12

Wikiobject - Project Proposal
by Quico Prol 29 Oct '16

29 Oct '16

Hi, how about a wikipedia about objects? Instead of generic articles of , for example, "Ballpoint pen" or "Bic cristal" it would be "Ballpoint pen Bic cristal 2014", using Wikidata propreties and doing these for millions of objects would allow people to have an open, free, universal and structured place to refer specific objects . Project page: https://meta.wikimedia.org/wiki/WikiObject grettings

2 3

(semi-)automatic statement references fro Wikidata from DBpedia
by Dimitris Kontokostas 08 Sep '16

08 Sep '16

Hi, I had this idea for some time now but never got to test/write it down. DBpedia extracts detailed context information in Quads (where possible) on where each triple came from, including the line number in the wiki text. Although each DBpedia extractor is independent, using this context there is a small window for combining output from different extractors, such as the infobox statements we extract from Wikipedia and the very recent citation extractors we announced [1] I attach a very small sample from the article about Germany where I filter out the related triples and order them by the line number they were extracted from e.g. dbr:Germany dbo:populationTotal "82175700"^^xsd:nonNegativeInteger < http://en.wikipedia.org/wiki/Germany?oldid=736355524#*absolute-line=66* &template=Infobox_country&property=population_estimate&split=1& wikiTextSize=10&plainTextSize=10&valueSize=8> . <https://www.destatis.de/DE/PresseService/Presse/Pressemitteilungen/2016/08/ PD16_295_12411pdf.pdf;jsessionid=996EC2DF0A8D510CF89FDCBC74DBAE 9F.cae2?__blob=publicationFile> dbp:isCitedBy dbr:Germany < http://en.wikipedia.org/wiki/Germany?oldid=736355524#*absolute-line=66*> . Looking at the wikipedia article we see: |population_estimate = 82,175,700<ref>{{cite web|url=https://www.destatis. de/DE/PresseService/Presse/Pressemitteilungen/2016/08/PD16_295_12411pdf.pdf; jsessionid=996EC2DF0A8D510CF89FDCBC74DBAE9F.cae2?__blob= publicationFile|title=Population at 82.2 million at the end of 2015 – population increase due to high immigration|date=26 August 2016|work= destatis.de}}</ref> Could this approach be a good candidate reference suggestions in Wikidata? (This particular one is already a reference but the anthem and GDP in the attachment are not for example) There are many things that can be done to improve the matching but before getting into details I would like to see if this idea is worth exploring more or not Cheers, Dimitris [1] http://www.mail-archive.com/dbpedia-discussion%40lists.sourceforge.net/ msg07739.html -- Dimitris Kontokostas Department of Computer Science, University of Leipzig & DBpedia Association Projects: http://dbpedia.org, http://rdfunit.aksw.org, http://aligned-project.eu Homepage: http://aksw.org/DimitrisKontokostas Research Group: AKSW/KILT http://aksw.org/Groups/KILT

6 6

List generation input
by Léa Lacroix 02 Sep '16

02 Sep '16

Hello folks, The Wikidata development team is currently working on tools to improve *list creation on Wikipedia*, based on Wikidata data. In order to understand what could be useful for you and why, we suggest you* three examples of user scenarios <https://www.wikidata.org/wiki/Wikidata:List_generation_input>*, in which you could recognize some of your current uses: how do you currently edit some lists on Wikipedia, which tools or processes do you use, and what can be improved. You can answer some short questions and add comments on our assumptions on each related talk page. This input is very important to help us understand how you edit the lists on Wikipedia, and what tools could be useful for you. Thanks to all of you who will take a few minutes to answer our questions! Jan & Léa -- Léa Lacroix Community Communication Manager for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.

5 5

Item Label from ItemId
by Sumit Asthana 01 Sep '16

01 Sep '16

Hi, I've written a code to scrape Wikidata dump following Wikidata Toolkit examples. In processItemDocument, I have extracted the target entityId for the property 'instanceof' for the current item. However I'm unable to find a way to get the label of the target entity given that I have the entityId, but not the entityDocument? Help would be appreciated :) -Sumit Asthana, B.Tech Final Year, Dept. of CSE, IIT Patna

2 1

People who died in 2015 who were Dutch
by Gerard Meijssen 31 Aug '16

31 Aug '16

Hoi, Jura1 created a wonderful list of people who died in Brazil in 2015 [1]. It is a page that may update regularly from Wikidata thanks to the ListeriaBot. Obviously, there may be a few more because I am falling ever more behind with my quest for registering deaths in 2015. I have copied his work and created a page for people who died in the Netherlands in 2015 [2]. It is trivially easy to do this and, the result is great. The result looks great, it can be used for any country in any Wikipedia The Dutch Wikipedia indicated that they nowadays maintain important metadata at Wikidata. I am really happy that we can showcase their work. It is important work because as someone reminded me at some stage, this is part of what amounts to the policy of living people... Thanks, GerardM [1] https://www.wikidata.org/wiki/User:Jura1/Recent_deaths_in_Brazil [2] https://www.wikidata.org/wiki/User:Jura1/Recent_deaths_in_the_Netherlands

10 35

Weeky Summary #224
by Léa Lacroix 29 Aug '16

29 Aug '16

Hello all, *Here's your quick overview of what has been happening around Wikidata over the last week.* Discussions - We need your imput about how you edit lists on Wikipedia <https://www.wikidata.org/wiki/Wikidata:List_generation_input> Events <https://www.wikidata.org/wiki/Wikidata:Events>/Press/Blogs <https://www.wikidata.org/wiki/Wikidata:Press_coverage> - Upcoming: First Wikidata workshop in the Czech Republic <https://www.wikidata.org/wiki/Wikidata:Mezi_bajty/Workshop_2016>, 3rd September, University of Pardubice <https://www.wikidata.org/wiki/Q728840> - Past: Wikidata workshop <https://twitter.com/vgratian/status/769775636403617792/photo/1> by Asaf Bartov during the CEE WikiConv <https://meta.wikimedia.org/wiki/Wikimedia_CEE_Meeting_2016> in Dilijan, Armenia - Past: Wikidata presentation <https://twitter.com/juschuetze/status/769510979289964544> during Django Girls in Berlin <https://djangogirls.org/berlin/> - GSoC at Wikimedia (Part 1 of 3) <https://medium.com/@alangiderick/gsoc-at-wikimedia-part-1-of-3-38a089a36fe8…>, by Alangi Derick - Modeling books in Wikidata <https://www.wikidata.org/wiki/Wikidata_talk:WikiProject_Books#Wikiproject_B…> by Aubrey and Chiara Storti - Paper: Vandalism Detection in Wikidata <http://is.uni-paderborn.de/uploads/tx_sibibtex/heindorf2016_CIKM.pdf>, by Stefan Heindorf, Martin Potthast, Benno Stein and Gregor Engels Other Noteworthy Stuff - Planet to Earth <https://tools.wmflabs.org/planettoearth/>: this tool uses data from Wikidata to visualise the links between places on an astronomical body named after a place on planet Earth. - WikiDataScape <http://apps.cytoscape.org/apps/wikidatascape> is a Cytoscape app for interactive browsing of Wikidata. - The property "KML file" was created last week (see list below) and has already a full list of values, sample LUA module and property documentation page available (Property talk:P3096 <https://www.wikidata.org/wiki/Property_talk:P3096>) - enabling data access in user language <https://www.wikidata.org/wiki/Wikidata:Project_chat/Archive/2016/08#enablin…> will be deployed on Wikidata on August 29 Did you know? - Newest properties <https://www.wikidata.org/wiki/Special:ListProperties>: Tela Botanica ID (Metropolitan France) <https://www.wikidata.org/wiki/Property:P3105>, Gare & Connexions ID <https://www.wikidata.org/wiki/Property:P3104>, has tense <https://www.wikidata.org/wiki/Property:P3103>, Plantarium ID <https://www.wikidata.org/wiki/Property:P3102>, FloraBase ID <https://www.wikidata.org/wiki/Property:P3101>, Flora of Australia ID <https://www.wikidata.org/wiki/Property:P3100>, Internet Bird Collection species ID <https://www.wikidata.org/wiki/Property:P3099>, ClinicalTrials.gov Identifier <https://www.wikidata.org/wiki/Property:P3098>, ISBN identifier group <https://www.wikidata.org/wiki/Property:P3097>, KML file <https://www.wikidata.org/wiki/Property:P3096>, practiced by <https://www.wikidata.org/wiki/Property:P3095> - Query examples: - RADA alumni with a good or featured article on Wikipedia <https://query.wikidata.org/#%23%20RADA%20alumni%20with%20a%20good%20or%20fe…> (source <https://twitter.com/ash_crow/status/769527843369938944>) - Historians with links to French Wikipedia or Wikisource, but missing VIAF ID <https://query.wikidata.org/#%23%20Historians%20with%20links%20to%20French%2…> (source <https://twitter.com/ash_crow/status/769443574282870784>) - Unicorn taxa, including the Indian rhinoceros <https://query.wikidata.org/#SELECT%20%3Ftaxon%20%3FtaxonName%20%3FtaxonLabe…> (source <https://twitter.com/WikidataFacts/status/769274960552615937>) - Mithras shrines, as a map <https://query.wikidata.org/embed.html#%23defaultView%3AMap%0ASELECT%20%3Fit…> (source <https://twitter.com/Harmonia_Amanda/status/769257983968772096>) - 2.300 Wikidata archaeological sites without coordinates <https://query.wikidata.org/#SELECT%20%3Fq%20%3FqLabel%20%3Flocation%20%3Flo…> (source <https://twitter.com/MagnusManske/status/769195301152456704>) - All items on Wikidata whom we know we don't know their sex or gender: <https://query.wikidata.org/#SELECT%20%3Fhuman%20%3FhumanLabel%20where%20%7B…> (source <https://twitter.com/Harmonia_Amanda/status/768799132479782912>) - Drug-disease interactions <https://query.wikidata.org/#%23defaultView%3AGraph%0ASELECT%20DISTINCT%20%3…> (source <https://twitter.com/ReaderMeter/status/768477000935714816>) - Nicknames of serial killers <https://query.wikidata.org/embed.html#SELECT%20%2a%20WHERE%20%7B%0A%3Fsk%20…> (source <https://twitter.com/Coyau/status/768469756831752192>) - Dynamic data map of all U-bahn lines in Berlin with colors <https://query.wikidata.org/embed.html#%23Map%20of%20all%20subway%20stations…> (source <https://twitter.com/innovimax/status/768364397551030272>) - Timeline of internet services by type to celebrate 25 yrs of the web <http://histropedia.com/showcase/wikidata-viewer.html?query=%23Internet%20Se…> (source <https://twitter.com/Histropedia/status/768151563571462144>) - Network of color <https://pbs.twimg.com/media/CqjhzcPWYAA39al.jpg:large> (source <https://twitter.com/WikidataFacts/status/768112859213623296>) - Dynamic data map of all country by year of joining the United Nations <https://query.wikidata.org/embed.html#%23defaultView%3AMap%0ASELECT%20%3Fpa…> (source <https://twitter.com/innovimax/status/768401700667400192>) - Newest WikiProjects <https://www.wikidata.org/wiki/Special:MyLanguage/Wikidata:WikiProjects>: Theatre <https://www.wikidata.org/wiki/Wikidata:WikiProject_Theatre> - Newest gadgets: - Newest external tools: Wikidata Class Browser (includes subclass counts and instance counts) <https://tools.wmflabs.org/bambots/WikidataClasses.php> - Newest database reports: - New feature/gadget requests: - Showcase items <https://www.wikidata.org/wiki/Wikidata:Showcase_items>: Development - Fixed a but where the text field doesn't always load when trying to add a statement (phabricator:T115267 <https://phabricator.wikimedia.org/T115267>) - We're now also creating mediainfo entities when a statement is added to a non-existent media info entity (phabricator:T140760 <https://phabricator.wikimedia.org/T140760>) - Fixed a but where the suggester would show information twice ( phabricator:T143645 <https://phabricator.wikimedia.org/T143645>) - The ArticlePlaceholder now also shows the links to other projects in the In Other Projects sidebar (phabricator:T141771 <https://phabricator.wikimedia.org/T141771>) - Worked more in figuring out how to show usage tracking data ( phabricator:T103091 <https://phabricator.wikimedia.org/T103091>) - Made progress on Citoid support for Wikidata to make it easier to add useful and complete references (phabricator:T141856 <https://phabricator.wikimedia.org/T141856>) - Worked on restore "purge without confirm" user right ( phabricator:T143435 <https://phabricator.wikimedia.org/T143435>) You can see all open tickets related to Wikidata here <https://phabricator.wikimedia.org/maniphest/query/4RotIcw5oINo/#R>. Monthly Tasks - Hack on one of these <https://phabricator.wikimedia.org/maniphest/query/R8GRzX1eH0tb/#R>. - Help develop the next summary here! <https://www.wikidata.org/wiki/Wikidata:Status_updates/Next> - Contribute to a Showcase item <https://www.wikidata.org/wiki/Special:MyLanguage/Wikidata:Showcase_items> - Help translate <https://www.wikidata.org/wiki/Special:LanguageStats> or proofread pages in your own language! - Help merge identical items <https://www.wikidata.org/wiki/User:Pasleim/projectmerge> across Wikimedia projects. - Add labels, in your own language(s), for the new properties listed above. - Comment on property proposals: all open proposals <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Overview> - proposals needing attention <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Attention_needed> -- Léa Lacroix Community Communication Manager for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.

1 0

CFP: TDWG 2016 Workshop on the use of wikis in biodiversity informatics
by joel sachs 28 Aug '16

28 Aug '16

Everyone, The submission deadline for the below is soon, but the bar is low - a paragraph or two describing what you're up to, and any challenges that you're facing. Our main goal is to get a sense of who's doing what in this space, and to discuss prospects for helping each other. If you'd like to be involved in the conversation, but won't be able to make it to Costa Rica, please let me know. Best, Joel. ---- The Use of Wikis in Biodiversity Informatics Workshop to be held at TDWG 2016 (Dec. 5-9) in Santa Clara de San Carlos, Costa Rica Instruction for submitting abstracts are at https://mbgserv18.mobot.org/ocs/index.php/tdwg/tdwg2016/schedConf/cfp Abstracts due: Sept. 6, 2016 Wiki technologies, in particular those built on top of Semantic MediaWiki and Wikidata, are being used to store, curate, query, integrate, and reason over a range of biodiversity-related data. This workshop will comprise a selection of talks describing some of these uses, followed by a discussion of gaps in the wiki ecology, and opportunities that might exist to fill those gaps through coordinated research and development.

1 0

A plea for incremental dumps
by Neil Harris 28 Aug '16

28 Aug '16

I know it's been mentioned on this list before, but it would be incredibly useful to have incremental dumps of Wikidata, as downloading the current dumps can now take several hours over a poor-bandwidth Internet connection. Here's my proposal: * the incremental dumps should have exactly the same format as the current JSON dumps, with two exceptions: ** entries which are unchanged since the previous dump (as determined by their "modified" timestamp) should be omitted ** entries which have been deleted since the previous dump should have stub entries of the form {"id": "Q123", "deleted": true} I would imagine that these dumps would be vastly smaller than the standard dumps, and would, for many re-users who only want to know about changed data, be just as useful, with a fraction of the download time and in many cases without significant modification of any of their tools. Doing this would only need a small amount of processing time, and add only an insignificant amount to the disk storage needed on the servers, yet could save considerable amounts of Internet bandwidth. This difference-file format should be easy to generate using slight tweaks to the existing dump code, but, if needed, I can easily write a simple Python script to take two existing dump files and generate the differences between them in the format above. Please drop me an email, or reply here, if you would like me to write this. Kind regards, Neil

3 2

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

Wikidata August 2016