Wikidata October 2017

wikidata@lists.wikimedia.org

64 participants
36 discussions

Wiki PageID
by Gintautas Sulskus 05 Oct '18

05 Oct '18

Hi, I have a couple of questions regarding the Wiki Page ID. Does it always stay unique for the page, where the page itself is just a placeholder for any kind of information that might change over time? Consider the following cases: 1. The first time someone creates page "Moon" it is assigned ID=1. If at some point the page is renamed to "The_Moon", the ID=1 remains intact. Is this correct? 2. What if we have page "Moon" with ID=1. Someone creates a second-page "The_Moon" with ID=2. Is it possible that page "Moon" is transformed into a redirect? Then, "Moon" would be redirecting to page "The_Moon"? 3. Is it possible for page "Moon" to become a category "Category:Moon" with the same ID=1? Thanks, Gintas

7 11

Wikidata HDT dump
by Laura Morales 03 Oct '18

03 Oct '18

Hello everyone, I'd like to ask if Wikidata could please offer a HDT [1] dump along with the already available Turtle dump [2]. HDT is a binary format to store RDF data, which is pretty useful because it can be queried from command line, it can be used as a Jena/Fuseki source, and it also uses orders-of-magnitude less space to store the same data. The problem is that it's very impractical to generate a HDT, because the current implementation requires a lot of RAM processing to convert a file. For Wikidata it will probably require a machine with 100-200GB of RAM. This is unfeasible for me because I don't have such a machine, but if you guys have one to share, I can help setup the rdf2hdt software required to convert Wikidata Turtle to HDT. Thank you. [1] http://www.rdfhdt.org/ [2] https://dumps.wikimedia.org/wikidatawiki/entities/

15 81

360 panoramic views
by Thad Guidry 19 Dec '17

19 Dec '17

Hi folks ! So we at Schema.org like the idea of your new Panorama View property as noted here and gaining interest... https://github.com/schemaorg/schemaorg/issues/1768 We also thought a bit about that and are also thinking of an additional property (a subtype for us) to hold a 360 panoramic view (a complete spherical projection view, also known as photo bubbles, photo spheres, etc). Any seasoned Wikidata folks that can create that 360 panoramic view property proposal for us ? :) Thad +ThadGuidry <https://www.google.com/+ThadGuidry>

2 5

Coordinate precision in Wikidata, RDF & query service
by Stas Malyshev 06 Nov '17

06 Nov '17

Hi! I would like to initiate a discussion about coordinate precision in Wikidata and Query Service. The reason is that right now we do not have any limit to precision, coordinates are basically doubles, and that allows to specify over-precise coordinates and makes it harder to compare them - both between themselves within Wikidata and with outside services. >From the precision description in [1], we would rarely need beyond third or fourth digit after the decimal point. However, we have in the database coordinates like: Point(13.366666666 41.766666666) which pretends to specify it with sub-millimeter accuracy - for an entity that describes a municipality[2]! We do have precision on values - e.g. the above has specified precision of "arcseconds" - so it may be just a formatting issue, but even arcsecond looks somewhat over-precise for a city. And it may be a bit challenging to convert DMS precision DD precision. But the bigger question is whether we should store over-precise coordinates in the database at all, or we should round them up on export or inside the data. The formulae that are used to calculate distances have, by obvious reasons, limited precision, and direct comparisons can't take precision into account, which may lead to such coordinates very hard to work with. Should we maybe just put a limit on how precise we put coordinates into RDF and in query service? Would four decimals after the dot be enough? According to [4] this is what commercial GPS device can provide. If not, why and which accuracy would be appropriate? We do export precision of the coordinate as wikibase:geoPrecision[3] - and we currently have 258060 distinct values for it. This is very weird. I am not sure precision is useful in this form. Can anybody tell me any use case for this number now? If not, maybe we should change how we represent it. I'm also not sure where these come from as we only have 13 options in the UI. Bots? [1] https://en.wikipedia.org/wiki/Decimal_degrees [2] https://www.wikidata.org/wiki/Q116746 [3] https://www.mediawiki.org/wiki/Wikibase/Indexing/RDF_Dump_Format#Globe_coor… [4] https://gis.stackexchange.com/questions/8650/measuring-accuracy-of-latitude… -- Stas Malyshev smalyshev(a)wikimedia.org

8 19

Do you use the Wikidata entity dump dcatap.rdf?
by Marius Hoch 03 Nov '17

03 Nov '17

Hi folks, is anyone using the Wikidata entity dump dcatap.rdf at https://dumps.wikimedia.org/wikidatawiki/entities/dcatap.rdf? It is very rarely used and is thus causing us a (probably) undue maintenance burden, because of which we plan to remove it. If anyone is making use of it, please speak up so that we can keep it or find a viable alternative. Cheers, Marius

4 4

How to get direct link to image
by Laura Morales 01 Nov '17

01 Nov '17

- wikidata entry: https://www.wikidata.org/wiki/Q161234 - "logo image" property pointing to: https://commons.wikimedia.org/wiki/File:0_A.D._logo.png However... that's a HTML page... How do I get a reference to the .png file? In this case https://upload.wikimedia.org/wikipedia/commons/1/1c/0_A.D._logo.png Thanks.

7 7

Wikimedia Blog - Wikidata at Five
by Andrew Lih 01 Nov '17

01 Nov '17

Here’s a piece I wrote with Rob Fernandez for the Wikimedia blog about Wikidata at five and Wikidatacon. https://blog.wikimedia.org/2017/10/30/wikidata-fifth-birthday/ -Andrew

3 2

WDCM: Wikidata usage in Wikivoyage
by Goran Milovanovic 31 Oct '17

31 Oct '17

Hi, responding to Yaroslav Blanter's following observation on this mailing list: "However, when I look at the statistics of usage, http://wdcm.wmflabs.org/WDCM_UsageDashboard/ I see that Wikivoyage allegedly uses, in particular, genes, humans (quite a lot, actually), and scientific articles. How could this be? I am pretty sure it does not use any of these." Please note that The *Wikidata item usage per semantic category in each project type* chart that you have referred to in a later message has a logarithmic y-scale (there's a Note explaining this immediately below the title of the chart). Also, even from the chart that you were referring to you can see that Wikivoyage projects taken together make no use of the categories Gene an Scientific Article. The usage of the logarithmic y-axis there is a necessity, otherwise we could not offer a comparison across the project types (because the differences in usage statistics are huge). Here's my suggestion on how to obtain a more readable (and more precise) information: - go to the WDCM Usage Dashboard: http://wdcm.wmflabs.org/WDCM_UsageDashboard/ - Tab: Dashboard, and then Tab: Tabs/Crosstabs - Enter only: _Wikivoyage in the "Search projects:" field, and select all semantic categories in the "Search categories:" field - Click "Apply Selection" What you should be able to learn from the results is that on all Wikivoyage projects taken together the total usage of Q5 (Human) is 26, and that no items from the Gene (Q7187) or Scientific Article (Q13442814) category are used there at all. Important reminder. The usage statistic in WDCM has the following semantics: - pick an item; - count on how many pages in a particular project is that item used; - sum up the counts to obtain the usage statistic for that particular item in the particular project. All WDCM Dashboards have a section titled "Description" which provides this and similarly important definitions, as well as (hopefully) simple descriptions of the respective dashboard's functionality. Hope this helps. Best, Goran Goran S. Milovanović, PhD Data Analyst, Software Department Wikimedia Deutschland ------------------------------------------------ "It's not the size of the dog in the fight, it's the size of the fight in the dog." - Mark Twain ------------------------------------------------

2 1

Wikidata Concepts Monitor
by Lydia Pintscher 31 Oct '17

31 Oct '17

Hey folks :) As you might already have seen in the birthday presents list there is another birthday present: the Wikidata Concepts Monitor (WDCM - http://wdcm.wmflabs.org). It is a tool that enables you to browse and build an understanding of the way Wikidata is used across the Wikimedia projects. Here’s the technical gist behind it: Currently 789 projects have client-side Wikidata usage tracking enabled, which allowed us to built a system that counts the number of pages using a particular Wikidata item per project. The count data were subjected to statistical modeling (1) by an unsupervised statistical learning algorithm - https://en.wikipedia.org/wiki/Latent_Dirichlet_allocation (2) that is typically used in distributional semantics - https://en.wikipedia.org/wiki/Distributional_semantics (3) to discover the most natural groupings of Wikidata items in 14 semantic categories - https://en.wikipedia.org/wiki/Topic_model (4) in respect to the way they are used across the Wikimedia universe by the respective communities. We hope for the WDCM system to become a tool that helps you discover. Beyond Wikidata’s syntax and semantics we are now beginning to learn about its pragmatics: the way Wikidata items will cluster in respect to how they are used is not necessarily the same as the way they go together in the Wikidata formal ontology. WDCM is the first step towards building an understanding of the highly complicated structure of Wikidata usage. This system can help you discover what Wikidata client projects are similar and in what respect, what semantic categories of items are used more or less frequently across 789 projects, how do items connect in respect to how similarly they are used by our communities, what are the most popular items per project, and many more (hopefully) interesting things. Check out the WDCM and don’t forget to let us know what you think on the WDCM Wikidata project discussion page! I'd love to hear about any cool or interesting things you find in the visualizations. https://www.wikidata.org/wiki/Wikidata:Wikidata_Concepts_Monitor Thanks to Goran who put in a lot of time to get this up and running and everyone who helped him. Cheers Lydia -- Lydia Pintscher - http://about.me/lydia.pintscher Product Manager for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.

2 3

Weekly Summary #284
by Léa Lacroix 30 Oct '17

30 Oct '17

Hello all, I'm back in your mailboxes for the Weekly Summary :) Discussions - New request for comments: Defining account creators <https://www.wikidata.org/wiki/Wikidata:Requests_for_comment/Defining_accoun…> Events <https://www.wikidata.org/wiki/Special:MyLanguage/Wikidata:Events>/ Press/Blogs <https://www.wikidata.org/wiki/Special:MyLanguage/Wikidata:Press_coverage> - Past: WikiArabia 2017 <https://meta.wikimedia.org/wiki/WikiArabia/2017>, Cairo, Egypt, 23-25 October - Past: WikidataCon <https://www.wikidata.org/wiki/Wikidata:WikidataCon_2017>, Berlin, 28-29 October. You can find the slides, notes <https://www.wikidata.org/wiki/Wikidata:WikidataCon_2017/Documentation/Proce…> and video recordings <https://media.ccc.de/b/conferences/wikidatacon/2017> for most of the sessions - Past: Wikidata's fifth birthday <https://www.wikidata.org/wiki/Wikidata:Fifth_Birthday> - Upcoming: Using Wikidata to create a multi-lingual multi-dialectal dictionary for Arabic dialects <http://www.aiccsa.net/AICCSA2017/images/final-detailed-program-17102017.pdf> *in* the 14th ACS/IEEE International Conference on Computer Systems and Applications (AICCSA 2017), Yasmine Hammamet, Tunisia, 30 October-3 November. - Upcoming : Wikidata and Wiki Loves Monuments editathon in Berlin, 24-26 November (in German <https://de.wikipedia.org/wiki/Wikipedia:WLM-Wikidata-Editathon_2017>). We're looking for trainers to explain the basic use of QuickStatements, Open Refine and Mix'n'Match. If you want to join, please contact MB-one <https://www.wikidata.org/wiki/User:MB-one> Wikidata's birthday The fifth anniversary of Wikidata was celebrated, as every year, with a lot of presents, events and stories. You can find an overview on the birthday page <https://www.wikidata.org/wiki/Wikidata:Fifth_Birthday>. - *Birthday presents* - Search index now contains statements <https://phabricator.wikimedia.org/T175199> with instance of (P31) <https://www.wikidata.org/wiki/Property:P31> and subclass of (P279) <https://www.wikidata.org/wiki/Property:P279>. These can be used to influence search rankings <https://phabricator.wikimedia.org/T148411> and in the future also for matching. - inventaire.io CC0 Wikidata-ready NDJSON and TTL dumps are now accessible at dumps.inventaire.io - You can now move claims to new items with moveClaim.js <https://www.wikidata.org/wiki/User:Mat%C4%9Bj_Such%C3%A1nek/moveClaim.js>. You can do so by typing "new". - Prefix search on Wikidata (wbsearchentities API) now is using ElasticSearch as backend <https://phabricator.wikimedia.org/T175741>. This should improve search quality and also is more flexible and tunable. - Wikidata becomes a proper citizen of the linked open data web <https://lists.wikimedia.org/pipermail/wikidata/2017-October/011314.html> - graves.wiki – visualization of grave locations stored in Wikidata <http://graves.wiki/> - Wikidata Concepts Monitor (WDCM) <http://wdcm.wmflabs.org/> - Wikidata semantic topics and usage statistics - Happy Birthday to You, Dear Wikidata <https://lucaswerkmeister.de/music/happy-birthday-dear-wikidata-2017/> – recording and improvisation (Lucas Werkmeister <https://www.wikidata.org/wiki/User:Lucas_Werkmeister>) - You can now create items from the command-line <https://github.com/maxlath/wikidata-cli/blob/master/docs/write_operations.m…> - Crochet your own cute structured data bee <https://commons.wikimedia.org/wiki/File:Structured_Data_Bee_-_crochet_patte…> to express your appreciation for structured data in the Wikimedia movement! And ask her/him/it to join the Wikimedia Cuteness Association (Q29169245) <https://www.wikidata.org/wiki/Q29169245>. - Wikidata Query Service UI: Geoshapes from Commons are now displayed in the Map view: Constituencies for the election to the German Bundestag 2017, with winning candidate and party <https://query.wikidata.org/embed.html#%23defaultView%3AMap%0A%23%20constitu…> - Wikidata Query Service UI: The query service now shows you code examples for how to use your query in many programming languages (example query <https://query.wikidata.org/#%23Cats%0ASELECT%20%3Fitem%20%3FitemLabel%20%0A…> ) - Dungeon of Knowledge <https://www.johl.io/dungeonofknowledge/> ( source <https://github.com/johl/dungeonofknowledge>), a roguelike game where you explore a dungeon full of bits of Wikidata wisdom - Simple WD <http://tools.wmflabs.org/simplewd/> an experiment of a simple frontend API on top of Wikidata using JSON-LD and schema.org. - mapview <https://www.wikidata.org/wiki/User:Aude/mapview.js> widget/user script on Wikidata - Images fragments for an item depicted on several artworks <http://zone47.com/crotos/lab/cropper/p180iiif.php?q=Q79746&l=en> - Histropedia Query Timeline tool <http://histropedia.com/showcase/wikidata-viewer.html?> - Demo of cool new features launched today for WikidataCon :) - Q42 (Q42395533) <https://www.wikidata.org/wiki/Q42395533> - the first item about a notable item! - SQID <https://tools.wmflabs.org/sqid/> now suggests statements that we are missing (example page, log in to see suggestions) <https://tools.wmflabs.org/sqid/#/view?id=Q1249802> -- more on Wikidata:WikiProject_Reasoning <https://www.wikidata.org/wiki/Wikidata:WikiProject_Reasoning> soon - From labs VM to Wikibase Query Service in 2 minutes (using Docker images and docker-compose <https://github.com/addshore/wikibase-docker>) - *Stories and reflections* - Wishes for the year to come, by Tpt <https://www.wikidata.org/wiki/User:Tpt/Fifth_Birthday> - Happy Birthday Wikidata! by the Gene Wiki Team <http://sulab.org/2017/10/happy-birthday-wikidata/> - Some random thoughts on the occasion of Wikidata's fifth birthday, by PKM <https://www.wikidata.org/wiki/User:PKM/bday5> - Wishes and a personal story, by Spinster <https://medium.com/@sandrafauconnier/happy-5th-birthday-wikidata-176fb85f79…> - Happy Birthday, Wikidata by RolandUnger from Wikivoyage <https://www.wikidata.org/wiki/User:RolandUnger/Happy_Birthday,_Wikidata%21> - Happy Birthday, Wikidata by Katherine Maher <https://commons.wikimedia.org/wiki/File:Happy_Birthday_Wikidata%21.webm> - Two years of Wikidata experience from ArthurPSmith ArthurPSmith <https://www.wikidata.org/wiki/User:ArthurPSmith/5th_Birthday> - Happy Birthday wishes from Wikimedia Deutschland <https://commons.wikimedia.org/wiki/File:Wikidata_WMDE.webm> - Message from the development team <https://www.wikidata.org/wiki/Wikidata:Fifth_Birthday/Lydia> Did you know? - Newest properties <https://www.wikidata.org/wiki/Special:ListProperties>: Biblioteca Nacional de México ID <https://www.wikidata.org/wiki/Property:P4440>, MNCARS artist ID <https://www.wikidata.org/wiki/Property:P4439>, BFI Film and TV ID <https://www.wikidata.org/wiki/Property:P4438>, FPB rating <https://www.wikidata.org/wiki/Property:P4437>, The Coptic Library ID <https://www.wikidata.org/wiki/Property:P4436>, snap package <https://www.wikidata.org/wiki/Property:P4435>, LesBiographies.com ID <https://www.wikidata.org/wiki/Property:P4434>, Indian Foundation for Butterflies ID <https://www.wikidata.org/wiki/Property:P4433>, AKL Online Artist ID <https://www.wikidata.org/wiki/Property:P4432>, Google Doodle <https://www.wikidata.org/wiki/Property:P4431>, New York City Parks Monument ID <https://www.wikidata.org/wiki/Property:P4430>, Pro14 player ID <https://www.wikidata.org/wiki/Property:P4429>, implementation of <https://www.wikidata.org/wiki/Property:P4428>, GACS ID <https://www.wikidata.org/wiki/Property:P4427>, Y-DNA Haplogroup <https://www.wikidata.org/wiki/Property:P4426>, mtDNA haplogroup <https://www.wikidata.org/wiki/Property:P4425>, mandates <https://www.wikidata.org/wiki/Property:P4424>, Portuguese lighthouse ID <https://www.wikidata.org/wiki/Property:P4423>, U.S. Ski and Snowboard Hall of Fame athlete ID <https://www.wikidata.org/wiki/Property:P4422>, Sportbox.ru ID <https://www.wikidata.org/wiki/Property:P4421>, VNDB ID <https://www.wikidata.org/wiki/Property:P4420>, Videolectures ID <https://www.wikidata.org/wiki/Property:P4419>, New Zealand Sports Hall of Fame ID <https://www.wikidata.org/wiki/Property:P4418>, rfpl.org player ID <https://www.wikidata.org/wiki/Property:P4417>, Panthéon des sports du Québec ID <https://www.wikidata.org/wiki/Property:P4416>, Sport Australia Hall of Fame inductee ID <https://www.wikidata.org/wiki/Property:P4415>, New Brunswick Sports Hall of Fame athlete ID <https://www.wikidata.org/wiki/Property:P4414>, Manitoba Sports Hall of Fame athlete ID <https://www.wikidata.org/wiki/Property:P4413>, Ontario Sports Hall of Fame athlete ID <https://www.wikidata.org/wiki/Property:P4412>, Quora username <https://www.wikidata.org/wiki/Property:P4411>, Women's Basketball Hall of Fame ID <https://www.wikidata.org/wiki/Property:P4410> - Query examples: - Organic acids with images <https://query.wikidata.org/#%23defaultView%3AImageGrid%0ASELECT%20%3Fcompou…> (source <https://twitter.com/egonwillighagen/status/924640136301875201>) - People who are their father's father <https://query.wikidata.org/#SELECT%20%3FaLabel%20%3FbLabel%20WHERE%20%7B%0A…> (source <https://twitter.com/L3viathan2142/status/924701259583705090>) - 2017 German federal election results by district with geoshapes and color depending on the % of votes <https://query.wikidata.org/#%23defaultView%3AMap%0A%23%20constituencies%20f…> (source <https://twitter.com/WikidataFacts/status/924640696186884096>) - Cities with most first performances of works <https://query.wikidata.org/#%23%20cities%20with%20most%20first%20performanc…> (source <https://twitter.com/WikidataFacts/status/924417988207497216>) - Count of women Vs. guys names “John” in the UK parliament <https://query.wikidata.org/#%23%20UK%20parliaments%20with%20count%20of%20Jo…> (source <https://twitter.com/JeanFred/status/924370535508832256>) - Percentage of popes who died in Rome <https://query.wikidata.org/#%23%20%25%20of%20popes%20who%20died%20in%20Rome…> (source <https://twitter.com/WikidataFacts/status/924198828240199680>) - First sites registered at Unesco for each country <https://query.wikidata.org/#%23defaultView%3AMap%0ASELECT%20DISTINCT%20%3Fi…> (source <https://twitter.com/PoulpyWP/status/922563280043835392>) Development - Continued working on persistent storage of edits on Lexeme pages - Fixed an encoding problem in the SVG download of query results in the query service (phabricator:T178564 <https://phabricator.wikimedia.org/T178564>) - Fixed a problem with change dispatching changes to Wikipedia and co, that sometimes lead to single wikis falling way behind. (T179060 <https://phabricator.wikimedia.org/T179060>) - Added full URIs for external identifiers to the RDF export - You can leave feedback on how improved fulltext search result page <https://lists.wikimedia.org/pipermail/wikidata-tech/2017-October/001180.html> should look like You can see all open tickets related to Wikidata here <https://phabricator.wikimedia.org/maniphest/query/4RotIcw5oINo/#R>. Monthly Tasks - Add labels, in your own language(s), for the new properties listed above. - Comment on property proposals: all open proposals <https://www.wikidata.org/wiki/Wikidata:Property_proposal/Overview> - Suggested and open tasks <https://www.wikidata.org/wiki/Wikidata:Contribute/Suggested_and_open_tasks> ! - Contribute to a Showcase item <https://www.wikidata.org/wiki/Special:MyLanguage/Wikidata:Showcase_items> . - Help translate <https://www.wikidata.org/wiki/Special:LanguageStats> or proofread the interface and documentation pages, in your own language! - Help merge identical items <https://www.wikidata.org/wiki/User:Pasleim/projectmerge> across Wikimedia projects. - Help write the next summary! <https://www.wikidata.org/wiki/Wikidata:Status_updates/Next> -- Léa Lacroix Project Manager Community Communication for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.

1 0

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

Wikidata October 2017