Wikidata July 2016

wikidata@lists.wikimedia.org

39 participants
28 discussions

weekly summary #218
by Lydia Pintscher 18 Jul '16

18 Jul '16

Hey folks :) Here is your summary of what happened around Wikidata over the past week. Events <https://www.wikidata.org/wiki/Wikidata:Events>/Press/Blogs <https://www.wikidata.org/wiki/Wikidata:Press_coverage> - From Damascus to Berlin: A very special internship at Wikimedia Deutschland <https://blog.wikimedia.de/2016/07/12/from-damascus-to-berlin-a-very-special…> - Wikidata als Universalbibliographie: ein Kommentar <http://libreas.eu/ausgabe29/02voss/> - First image, good image? <http://magnusmanske.de/wordpress/?p=399> Other Noteworthy Stuff - Since Pokémon is all the rage at the moment here is a short reminder that we have WikiProject Pokémon <https://www.wikidata.org/wiki/Wikidata:WikiProject_Pok%C3%A9mon> for them - TIB is looking for a Wikimedian in Residence in the Open Science Lab <https://www.tib.eu/de/die-tib/karriere-und-ausbildung/stellenangebote/detai…> - The code for the Primary Sources Tool has been moved from the Google to the Wikidata organisation on github. <https://github.com/Wikidata/primarysources> - The ISCB competition <https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Computational_Biology/I…> for 2016 has been announced - Use Wikipedia “article main images” to find candidate images for Wikidata <https://tools.wmflabs.org/fist/wdfist/index.html?depth=3&language=en&projec…> - StrepHit fact extraction agent v.1.1-beta has been released <https://github.com/Wikidata/StrepHit> Did you know? - Newest properties <https://www.wikidata.org/wiki/Special:ListProperties>: GCatholic church ID <https://www.wikidata.org/wiki/Property:P2971>, GoodReads book ID <https://www.wikidata.org/wiki/Property:P2969>, QUDT unit ID <https://www.wikidata.org/wiki/Property:P2968>, Queensland Heritage Register ID <https://www.wikidata.org/wiki/Property:P2967>, National Library of Wales Archives and Manuscripts <https://www.wikidata.org/wiki/Property:P2966>, EU River Basin District code <https://www.wikidata.org/wiki/Property:P2965>, right to vote <https://www.wikidata.org/wiki/Property:P2964>, GoodReads author ID <https://www.wikidata.org/wiki/Property:P2963>, title of chess player <https://www.wikidata.org/wiki/Property:P2962>, BVPH ID <https://www.wikidata.org/wiki/Property:P2961> - Query examples: letters with more than two forms <https://query.wikidata.org/#SELECT%20%3Fletter%20%3FletterLabel%20%28COUNT%…> (source <https://twitter.com/WikidataFacts/status/752829801954680832>), metro stations <https://query.wikidata.org/#%23defaultView%3AMap%0ASELECT%20DISTINCT%20%3Fi…> (source <https://twitter.com/PoulpyWP/status/752936275599032321>), railway incidents <https://query.wikidata.org/embed.html#%23defaultView%3ATimeline%0ASELECT%20…> (source <https://twitter.com/innovimax/status/753140210960101376>), pyramids in Egypt <https://query.wikidata.org/#%23defaultView%3AMap%0ASELECT%20%3Fitem%20%3Fit…> (source <https://twitter.com/PoulpyWP/status/753287958648721409>), women described as wife <https://query.wikidata.org/#SELECT%20DISTINCT%20%3Fwife%20%3FwifeLabel%20%3…> and men described as husband <https://query.wikidata.org/#SELECT%20DISTINCT%20%3Fhusband%20%3FhusbandLabe…> (source <https://twitter.com/WikidataFacts/status/753243808800858112>), neuroinformatics coauthor network <https://query.wikidata.org/#%23defaultView%3AGraph%0Aselect%20%3Fauthor1%20…> (source <https://twitter.com/fnielsen/status/753298419586834432>), nationalities of people with an article in the Bavarian Wikipedia <https://query.wikidata.org/#select%20%3Fnation%20%28count%28distinct%20%3Fp…> (source <https://twitter.com/vrandezo/status/753313818000756736>), Irish general elections and their winners <https://query.wikidata.org/#%23defaultView%3ATimeline%0ASELECT%20%3Fdate%20…> (source <https://twitter.com/vrandezo/status/753652225247842304>), types of historical monuments <https://query.wikidata.org/#SELECT%20DISTINCT%20%3Fidentifier%20%28SAMPLE%2…> (source <https://twitter.com/PoulpyWP/status/753644019683618816>), Alpine four-thousanders <https://query.wikidata.org/#SELECT%20%3Fitem%20%3FitemLabel%20%3FmassifLabe…> (source <https://twitter.com/PoulpyWP/status/754745959112904704>), Alpine peaks <https://query.wikidata.org/#SELECT%20%3Fitem%20%3FitemLabel%20%3FmassifLabe…> (source <https://twitter.com/PoulpyWP/status/754762471735517184>), language statements that point to a country instead of a language <https://query.wikidata.org/#SELECT%20DISTINCT%20%3Fwork%20%3FworkLabel%20%3…> (source <https://twitter.com/WikidataFacts/status/754972015585558528>) - Newest database reports: list of people who died on their birthday <https://www.wikidata.org/wiki/Wikidata:WikiProject_Q5/lists/people_who_died…> Development - Map layers are coming soon to the Query Service <https://twitter.com/JonasMKress/status/752949261696991232> - A lot of clean-up under the hood for the user interface - More interviews with editors in preparation for automated list generation - Fixed a bug where forms on the mobile site looked broken ( phabricator:T138413 <https://phabricator.wikimedia.org/T138413>) - More work on Citoid integration for easier reference adding - Added Cape Verdean Creole (phabricator:T127435 <https://phabricator.wikimedia.org/T127435>) You can see all open tickets related to Wikidata here <https://phabricator.wikimedia.org/maniphest/query/4RotIcw5oINo/#R>. Monthly Tasks - Hack on one of these <https://phabricator.wikimedia.org/maniphest/query/R8GRzX1eH0tb/#R>. - Help develop the next summary here! <https://www.wikidata.org/wiki/Wikidata:Status_updates/Next> - Contribute to a Showcase item <https://www.wikidata.org/wiki/Wikidata:Showcase_items> - Help translate <https://www.wikidata.org/wiki/Special:LanguageStats> or proofread pages in your own language! - Help merge identical items <https://www.wikidata.org/wiki/User:Pasleim/projectmerge> across Wikimedia projects. - Add labels, in your own language(s), for the new properties listed above. Anything to add? Please share! :) Cheers Lydia -- Lydia Pintscher - http://about.me/lydia.pintscher Product Manager for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.

1 0

Creating humans with no sitelink
by Loic Dachary 17 Jul '16

17 Jul '16

Hi, Volker Kohaupt https://www.wikidata.org/wiki/Q17147547 was created because he is an author of https://www.wikidata.org/wiki/Q13716155. Is this ok ? It does not seem to satisfy the first notability criterion for https://www.wikidata.org/wiki/Wikidata:Notability (no sitelink). But maybe that's an exception ? Cheers -- Loïc Dachary, Artisan Logiciel Libre

2 2

How many languages supports Wikibase/Wikidata?
by Jan Macura 16 Jul '16

16 Jul '16

Hi all, looking into [1] I read that Wikidata supports 358 languages. Is it still true? For example, I tried to add label in language coded as "nan" (defined in ISO 639-3) and it worked. However it didn't worked for e.g. "arb", which is also part of the ISO 639-3 standard. So how many? Thanks Jan [1] VRANDEČIĆ, Denny, KRÖTZSCH, Markus. Wikidata: A Free Collaborative Knowledgebase. *Communications of the ACM*. 2014-10, Vol. 57 No. 10, 7885. DOI 10.1145/2629489. http://cacm.acm.org/magazines/2014/10/178785-wikidata /fulltext

4 3

[WDQS] SPARQL sorting and collation
by Envel Le Hir 16 Jul '16

16 Jul '16

Hello, I have a SPARQL query that returns French labels of people with the family name (P734) Labrousse (Q25273100), sorting them by label: http://tinyurl.com/hq44ea8 The problem is that French rules for sorting are not applied: Élisabeth Labrousse and Émile Labrousse should be between Audran Labrousse and Ernest Labrousse, and not at the end of the results. This seems conform to SPARQL specifications (ordering is undefined for literals with language tags): https://www.w3.org/TR/2013/REC-sparql11-query-20130321/#modOrderBy Some SPARQL engines like Dydra use language tags to sort strings: http://blog.dydra.com/2015/05/06/collation It seems that Blazegraph should be able to do the same thing (using ICU library), but the documentation is old (yep, 2013 is old ! :p) and I don't know how WDQS is configured: https://wiki.blazegraph.com/wiki/index.php/Unicode Is there a solution to use French (or other languages) sorting in WDQS? Thanks, Envel

3 2

Wikidata property for items about software
by Loic Dachary 15 Jul '16

15 Jul '16

Hi, I created "Wikidata property for items about software" https://www.wikidata.org/wiki/Q25857383 modeled after "Wikidata property for items about works" https://www.wikidata.org/wiki/Q18618644. It is my understanding that it will automagically display all properties that are an instance of "Wikidata property for items about software" in the "Software" part (which does not exist yet) of https://www.wikidata.org/wiki/Wikidata:List_of_properties/all There also is an existing "Wikidata property for software" https://www.wikidata.org/wiki/Q21126229 which seems identical but worded slightly differently. I realize creating a new item with an identical purpose was a mistake but I don't know how to remove this new item. And I don't understand why properties that are instance of "Wikidata property for software" do not show in https://www.wikidata.org/wiki/Wikidata:List_of_properties/all. I would very much appreciate any advice you may have on this topic :-) Cheers -- Loïc Dachary, Artisan Logiciel Libre

2 2

Serialization issues from Wikidata to SPARQL triple store
by Sebastian Burgstaller 13 Jul '16

13 Jul '16

Hi all! First of all, let me say that we all love the SPARQL endpoint, it's a great service and it has become essential to how we interact with Wikidata and run our bots. Great job by Stas and others! I am also aware that it is still in beta mode. There is just one issue, which plagues us and I have filed a bug report regarding that in Sep 2015 (https://phabricator.wikimedia.org/T112397), so the issue got alleviated, but it turned out that it did not get fully resolved: -Occassionally, data written to an item in Wikidata via the API does not make it into the triple store. (Frequency of the issue is hard to determine) -It is a crucial issue because it can lead to data inconsistency by creating duplicate items or incorrect properties/values on items. -It seems to happen while the SPARQL endpoint is under high load (just my impression) How data is affected: -New data does not make it into the triple store -Updates to and merges of items do not make it to the triple store, so 'ghost items' are returned which have actually been merged or queries show/miss resutls/items incorreclty because freshly added/deleted data has not been completely serialized. Example: item https://www.wikidata.org/wiki/Q416356, a protein, recently got added protein domains via the 'has part' property. This did not show up in SPARQL queries and a DESCRIBE query for that item returned that these triples were not there indeed. (item has been modified, so it is fine now.) A solution seems to be to modify the item as this seems to trigger re-serialization. But this is certainly not practical for larger imports. Furthermore, as long as such an item does not get modified, data could be missing/ghosting from/in the triple store for weeks or even months. And it turns out to be quite difficult to determine how much of a certain import effort finally made it into the triple store, if you do not want to iterate through all items modified and check if everything is in the triple store, which would take significant amounts of time. Could you maybe give us more info on the status of this issue and if we could do something to help alleviating it? Thank you! Sebastian (sebotic) -- Sebastian Burgstaller-Muehlbacher, PhD Research Associate Andrew Su Lab MEM-216, Department of Molecular and Experimental Medicine The Scripps Research Institute 10550 North Torrey Pines Road La Jolla, CA 92037

2 1

Drag'n'drop Gadget functionality issues
by Brill Lyle 13 Jul '16

13 Jul '16

Hello -- There is a new *Drag'n'drop gadget* available in Wikidata > Preferences > Gadgets https://www.wikidata.org/wiki/Special:Preferences#mw-prefsection-gadgets * Drag'n'drop*: Add statements and references from Wikidata or Wikipedia by dragging and dropping them. Please note that there are issues with the gadget. I would like to evangelize this gadget but for my purposes it is not functioning ---> i.e., I drag the reference and get a shadow image of text and even after waiting 5 minutes the reference does not apply; refresh clears the attempted addition It might be a browser issue, but I have tried it on both Mac and PC, as well as on Chrome, Firefox, Safari, and SeaMonkey with exactly the same unsuccessful results. I have talked with Magnus about this (he is not having the problems I have had). It might also be a Wikidata response issue, but I think that issue was resolved. Currently if you use Wiki Markup and want to use one of the four Cite templates via the RefToolbar these templates I don't think will transfer - Cite books: not configured. Will get error message - I have not tested Cite web, Cite journal, Cite news, but assume none of these are configured to be captured by this tool. Obviously to configure the tool to work with these RefToolbar citation templates would be significant amount of time & effort. But if this tool was fully functional and robust -- and was able to transfer ALL of the piped data -- it would allow for a great interoperability of citations between Wikipedia and Wikidata. I don't build citations without using templates, as my assumption is that templates are more machine readable and more useful -- and are more consistent -- but obviously others are using different approaches. I assume bare urls are probably the most transferrable. But those didn't work for me either, so..... I really appreciate the fact that this gadget is available and the hard work it took to create it. Magnus has been very patient and kind offlist trying to problem-solve the issues I have had. I just wanted to follow up and provide this information, as it seems an important tool for us citation-focused editors. Best, - Erika *Erika Herzog* Wikipedia *User:BrillLyle <https://en.wikipedia.org/wiki/User:BrillLyle>*

3 4

Software API : {{P:part of}} or {{P:API}} ?
by Loic Dachary 13 Jul '16

13 Jul '16

Hi, An API such as the OpenStreetMap API ( https://www.wikidata.org/wiki/Q25822543 ) is a subclass of the API item https://www.wikidata.org/wiki/Q16519, it makes sense to me that the OpenStreetMap API is a part of ( https://www.wikidata.org/wiki/Property:P361 ) OpenStreetmap ( https://www.wikidata.org/wiki/Q936 ). Another way of looking at it would be to have an "API" property which would link OpenStreetmap to its API. I'm new to wikidata and would very much appreciate your advice on this. Cheers -- Loïc Dachary, Artisan Logiciel Libre

4 8

Problems with Serbian on Wikidata
by Smolenski Nikola 13 Jul '16

13 Jul '16

Something very weird is going on with Serbian language on Wikidata, so I wanted to draw more attention to it. As always, this is probably applicable to Chinese etc. as well. It used to be that, if someone visits Wikidata from Serbia, Serbian language did not appear in the list of languages for adding label and description. This is described in https://phabricator.wikimedia.org/T121747 However, as of right now, if someone visits Wikidata from Serbia, he will get Serbian language twice: in Cyrillic (српски) and Latin (srpski) variant. To my knowledge, it has never been discussed to conclusion whether there should be independent labels for the variants. Either way, one of the consequences of this is that all the Serbian labels that have been entered so far are invisible in this list. It gets even funnier if you go to https://www.wikidata.org/wiki/Q3711?uselang=sr-el since now you get Serbian three times: as "srpski (latinica)‎", "Serbian (Cyrillic script)" and "српски". It appears that the first is sr-el, the second is sr-ec and the third is the new sr-cyrl. I didn't want to play with editing, since this is a mess already. It appears that this is caused by an attempt to fix T121747 while simultaneously changing Serbian-language codes (https://phabricator.wikimedia.org/T117845). Either way, I believe it warrants more attention.

1 1

Primary Sources Tool code has been moved to Wikidata org
by Lydia Pintscher 12 Jul '16

12 Jul '16

Hey folks :) Based on requests here Denny and I have worked on getting the Primary Sources Tool code moved from the Google to the Wikidata organisation. This has now happened and it is available at https://github.com/Wikidata/primarysources from now on. I hope this will lead to more contributions from more people as I believe it is an important part of Wikidata's data flow. Cheers Lydia -- Lydia Pintscher - http://about.me/lydia.pintscher Product Manager for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.

4 3

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

Wikidata July 2016