Wikidata December 2015

wikidata@lists.wikimedia.org

56 participants
46 discussions

by Lydia Pintscher

Hey folks :) Here's your summary of what happened around Wikidata over the past week. Enjoy! Discussions - Successful request for adminship: Eurodyne <https://www.wikidata.org/wiki/User:Eurodyne> Events <https://www.wikidata.org/wiki/Wikidata:Events>/Press/Blogs <https://www.wikidata.org/wiki/Wikidata:Press_coverage> - World Health Summit yearbook for 2015 <http://www.worldhealthsummit.org/fileadmin/downloads/2015/WHS/Documents/151…> - You can apply for a scholarship to attend Wikimania 2016 <https://wikimania2016.wikimedia.org/wiki/Scholarships> (deadline: January 9th) - Talk submissions for Wikimania 2016 <https://wikimania2016.wikimedia.org/wiki/Submissions> are open. We'd love to see many Wikidata-related submissions. If you need help with your submission contact Lydia <https://www.wikidata.org/wiki/User:Lydia_Pintscher_%28WMDE%29>. Other Noteworthy Stuff - Google has launched the Knowledge Graph Search API, replacing the deprecated Freebase API <https://developers.google.com/knowledge-graph/> - Meta now has access to the data on Wikidata as well - Want to get an overview of the classes and properties on Wikidata? The Miga class and property browser was updated. <https://tools.wmflabs.org/wikidata-exports/miga/> - WikiBrowser <https://twitter.com/JavaFXpert/status/675667476386975744> - semantically browse Wikipedia with the help of Wikidata - WikiFamou.us <http://wikifamo.us> lets you compare topics by popularity across languages with the help of Wikidata - Chronas <http://www.chronas.org> is a history project linking Wikipedia and Wikidata with a chronological and cartographical view - Visiting some place for the holidays? Check out the items nearby. <https://www.wikidata.org/wiki/Special:Nearby> Did you know? - Newest properties <https://www.wikidata.org/wiki/Special:ListProperties>: Fashion Model Directory magazine ID <https://www.wikidata.org/wiki/Property:P2413>, Fashion Model Directory designer ID <https://www.wikidata.org/wiki/Property:P2412>, Artsy gene <https://www.wikidata.org/wiki/Property:P2411>, WikiPathways ID <https://www.wikidata.org/wiki/Property:P2410>, NII Article ID <https://www.wikidata.org/wiki/Property:P2409>, set in period <https://www.wikidata.org/wiki/Property:P2408>, short-term exposure limit <https://www.wikidata.org/wiki/Property:P2407>, maximum peak exposure limit <https://www.wikidata.org/wiki/Property:P2406>, ceiling exposure limit <https://www.wikidata.org/wiki/Property:P2405>, time-weighted average exposure limit <https://www.wikidata.org/wiki/Property:P2404>, Total assets <https://www.wikidata.org/wiki/Property:P2403>, total expenditure <https://www.wikidata.org/wiki/Property:P2402>, Six Degrees of Francis Bacon ID <https://www.wikidata.org/wiki/Property:P2401> - Query example: works created by females who died in 1945 <https://query.wikidata.org/#%23%20Works%20created%20by%20females%20died%20i…> Development - <3 Thanks for being awesome. Enjoy the holidays :) - We will take the "in other projects"-sidebar out of beta features in January (phabricator:T103102 <https://phabricator.wikimedia.org/T103102>) - Making ranking information like label and statement counts available to the CirrusSearch <https://www.mediawiki.org/wiki/Extension:CirrusSearch> index in order to improve ranking in search results (phabricator:T110648 <https://phabricator.wikimedia.org/T110648>) - Continued work on the identifier data type for identifiers like VIAF and ISBN so we can easily put them into a separate section in the items and properly link them in the exports (phabricator:T95682 <https://phabricator.wikimedia.org/T95682>, phabricator:T121274 <https://phabricator.wikimedia.org/T121274>) - Continued work on making external identifiers clickable links without the help of the authority control gadget (phabricator:T95684 <https://phabricator.wikimedia.org/T95684>) - Fixed a mistake in the set reference API documentation (gerrit:259171 <https://gerrit.wikimedia.org/r/259171>) - More work on cleaning up the statement section (phabricator:T121390 <https://phabricator.wikimedia.org/T121390>) You can see all open tickets related to Wikidata here <https://phabricator.wikimedia.org/maniphest/query/4RotIcw5oINo/#R>. Monthly Tasks - Hack on one of these <https://phabricator.wikimedia.org/maniphest/query/R8GRzX1eH0tb/#R>. - Help develop the next summary here! <https://www.wikidata.org/wiki/Wikidata:Status_updates/Next> - Contribute to a Showcase item <https://www.wikidata.org/wiki/Wikidata:Showcase_items> - Help translate <https://www.wikidata.org/wiki/Special:LanguageStats> or proofread pages in your own language! - Add labels, in your own language(s), for the new properties listed above. Anything to add? Please share! :) Cheers Lydia -- Lydia Pintscher - http://about.me/lydia.pintscher Product Manager for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

8 years, 4 months

Languages in monolingual text

by John Erling Blad

Just checked, and Kven [2] (fkv), Romanes (rom) [3] and Romani [4] (rotipa, romani rakripa, scandoromani, rmu) is still not valid languages in the monolingual text data type. Those are listed as endangered languages where Norway has a special responsibility. That is "de nasjonale minoritetsspråkene kvensk, romanes og romani"[1] The Kven language is official language in Porsanger municipality in Norway,[5] but we can't add it properly in the item. It is now added as "Finnish" (Finnic language), but this is not correct. The Kven language is a Finnic language, but it is not the same. It is sort of embarrassing to explain time and again that we don't support these languages. I have reported this bug before.[7] Protection of minority languages is a human rights obligation, but we simply dismiss that. I think we should take this serious and fix this now. [1] http://www.sprakradet.no/Spraka-vare/Minoritetssprak/ [2] https://en.wikipedia.org/wiki/Kven_language [3] https://en.wikipedia.org/wiki/Romani_language [4] https://en.wikipedia.org/wiki/Scandoromani_language [5] https://en.wikipedia.org/wiki/Porsanger [6] https://www.wikidata.org/wiki/Q483885#P1448 [7] https://phabricator.wikimedia.org/T74590 [8]http://www.un.org/apps/news/story.asp?NewsID=44352

8 years, 4 months

Units

by John Erling Blad

Can someone give an explanation why development of units are so difficult, or what seems to be the problem? Is there anything other people can do? It seems to me like this has a serious feature creep... https://phabricator.wikimedia.org/T77977

8 years, 4 months

Meta now has access to the data on Wikidata

by Lydia Pintscher

Hey folks :) We've enabled data access for Meta last night. Welcome to Wikidata, Meta! Questions and coordination is happening at https://www.wikidata.org/wiki/Wikidata:Meta-Wiki Cheers Lydia -- Lydia Pintscher - http://about.me/lydia.pintscher Product Manager for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

8 years, 4 months

Wikidata Analyst, a tool to comprehensively analyze quality of Wikidata

by Amir Ladsgroup

Hey, There has been several discussion regarding quality of information in Wikidata. I wanted to work on quality of wikidata but we don't have any source of good information to see where we are ahead and where we are behind. So I thought the best thing I can do is to make something to show people how exactly sourced our data is with details. So here we have *http://tools.wmflabs.org/wd-analyst/index.php <http://tools.wmflabs.org/wd-analyst/index.php>* You can give only a property (let's say P31) and it gives you the four most used values + analyze of sources and quality in overall (check this out <http://tools.wmflabs.org/wd-analyst/index.php?p=P31>) and then you can see about ~33% of them are sources which 29.1% of them are based on Wikipedia. You can give a property and multiple values you want. Let's say you want to compare P27:Q183 (Country of citizenship: Germany) and P27:Q30 (US) Check this out <http://tools.wmflabs.org/wd-analyst/index.php?p=P27&q=Q30|Q183>. And you can see US biographies are more abundant (300K over 200K) but German biographies are more descriptive (3.8 description per item over 3.2 description over item) One important note: Compare P31:Q5 (a trivial statement) 46% of them are not sourced at all and 49% of them are based on Wikipedia **but* *get this statistics for population properties (P1082 <http://tools.wmflabs.org/wd-analyst/index.php?p=P1082>) It's not a trivial statement and we need to be careful about them. It turns out there are slightly more than one reference per statement and only 4% of them are based on Wikipedia. So we can relax and enjoy these highly-sourced data. Requests: - Please tell me whether do you want this tool at all - Please suggest more ways to analyze and catch unsourced materials Future plan (if you agree to keep using this tool): - Support more datatypes (e.g. date of birth based on year, coordinates) - Sitelink-based and reference-based analysis (to check how much of articles of, let's say, Chinese Wikipedia are unsourced) - Free-style analysis: There is a database for this tool that can be used for way more applications. You can get the most unsourced statements of P31 and then you can go to fix them. I'm trying to build a playground for this kind of tasks) I hope you like this and rock on! <http://tools.wmflabs.org/wd-analyst/index.php?p=P136&q=Q11399> Best

8 years, 4 months

Miga Classes and Properties Browser

by Markus Damm

Hi all, there are some good news: I updated the Miga Classes and Properties Browser which collects several statistics about classes and properties used in Wikidata. In the future it will be updated monthly. You can find it here: http://tools.wmflabs.org/wikidata-exports/miga/ Hint: Since Miga uses WebSQL, the browser does not run in Internet Explorer or Mozilla Firefox. Best regards, Markus

8 years, 4 months

Announcing WikiBrowser - Semantically navigating ALL the knowledge

by james＠j1w.xyz

I'd like to share an application that I'm developing for technology demonstrations, entitled WikiBrowser. It is a web application that leverages the structure of Wikidata to semantically navigate Wikipedia articles. It is being developed in Java using technologies such as Spring Boot, Spring Cloud, and Cloud Foundry. This web application is live at http://WikiBrowser.io and the code is open source and located in my GitHub repository. There is a brief video that shows features of WikiBrowser on my most recent blog post at http://JavaFXpert.com and I hope that you'll take WikiBrowser for a spin! Regards, James Weaver Developer Advocate Pivotal Software http://twitter.com/JavaFXpert

8 years, 4 months

weekly summary #188

by Lydia Pintscher

Hey folks :) I hope you're having a good day. Here's what has been happening around Wikidata over the past week. Enjoy! Events <https://www.wikidata.org/wiki/Wikidata:Events>/Press/Blogs <https://www.wikidata.org/wiki/Wikidata:Press_coverage> - Wikidata: knowledge from different points of view <https://en.wikipedia.org/wiki/en:Wikipedia:Wikipedia_Signpost/2015-12-09/Op…> (Signpost op-ed on knowledge diversity and our thinking on data quality) - The Wikimedia Foundation Scholarships Program is now accepting applications for Wikimania 2016 <https://wikimania2016.wikimedia.org/wiki/Scholarships> (deadline: 09 January 2016 23:59 UTC) - Wikidata: A platform for data integration and dissemination for the life sciences and beyond <http://biorxiv.org/content/early/2015/11/16/031971.article-info> received the prize for best paper at SWAT4LS <http://www.swat4ls.org>. Congrats! - Past: 50 hours of Wikidata and Wikipedia editing at Museo Soumaya <https://en.wikipedia.org/wiki/es:Wikiproyecto:Museo_Soumaya/Editat%C3%B3n_5…> Other Noteworthy Stuff - Over 16.000 people have contributed to Wikidata over the last month. <https://www.wikidata.org/wiki/Special:Statistics> - Wikidata Analyst <https://tools.wmflabs.org/wd-analyst/index.php>, a tool to help comprehensively analyze the quality of Wikidata ( announcement <https://lists.wikimedia.org/pipermail/wikidata/2015-December/007811.html> ) - Overview of the current state of Sum of all Paintings and how you can help by Multichill <https://www.wikidata.org/wiki/Wikidata_talk:WikiProject_sum_of_all_painting…> - query.wikidata.org now more prominently shows example queries in case you missed them before, lets you filter and gives you a preview for them. Additionally you can click a little magnifying glass next to an item ID in a query result and explore it further. - WikiJourney now has a first release on the Play Store <https://play.google.com/store/apps/details?id=com.wikijourney.wikijourney> Did you know? - Newest properties <https://www.wikidata.org/wiki/Special:ListProperties>: JMDb film identifier <https://www.wikidata.org/wiki/Property:P2400>, British Council artist ID <https://www.wikidata.org/wiki/Property:P2399>, MLSSoccer.com ID <https://www.wikidata.org/wiki/Property:P2398>, YouTube channel ID <https://www.wikidata.org/wiki/Property:P2397>, image of function <https://www.wikidata.org/wiki/Property:P2396>, MGI gene symbol <https://www.wikidata.org/wiki/Property:P2394>, NCBI Locus tag <https://www.wikidata.org/wiki/Property:P2393>, teaching method <https://www.wikidata.org/wiki/Property:P2392>, OKPO ID <https://www.wikidata.org/wiki/Property:P2391>, Ballotpedia ID <https://www.wikidata.org/wiki/Property:P2390>, organisation directed from the office <https://www.wikidata.org/wiki/Property:P2389>, office held by head of the organisation <https://www.wikidata.org/wiki/Property:P2388>, Elonet actor ID <https://www.wikidata.org/wiki/Property:P2387>, diameter <https://www.wikidata.org/wiki/Property:P2386>, French diocesan architects ID <https://www.wikidata.org/wiki/Property:P2385>, statement describes <https://www.wikidata.org/wiki/Property:P2384>, CTHS person ID <https://www.wikidata.org/wiki/Property:P2383>, Chemins de mémoire ID <https://www.wikidata.org/wiki/Property:P2382>, Academic Tree ID <https://www.wikidata.org/wiki/Property:P2381>, French Sculpture Census ID <https://www.wikidata.org/wiki/Property:P2380>, deprecated in version <https://www.wikidata.org/wiki/Property:P2379>, issued by <https://www.wikidata.org/wiki/Property:P2378> - Query example: Which "Lincoln" was "Lincoln" named for? <https://query.wikidata.org/#PREFIX%20wikibase%3A%20%3Chttp%3A%2F%2Fwikiba.s…>, places by elevation span <https://query.wikidata.org/#PREFIX%20wikibase%3A%20%3Chttp%3A%2F%2Fwikiba.s…>, people who died in 1945 (for upcoming 2016 public domain day) <https://query.wikidata.org/#PREFIX%20wikibase%3A%20%3Chttp%3A%2F%2Fwikiba.s…> - Newest Database reports: List of films without article in Wikipedia of the same language <https://www.wikidata.org/wiki/Q21686779> Development - Working on sorting of statement groups for the ArticlePlaceholder extension - Further work on a separate section for identifiers - Worked on properly linking identifiers in the exports - Removed a number of lines and boxes in the statement section to make it less busy - Made it possible to create a redirect over a deleted item without having to undelete it first - Further work on improving ranking on Special:Search - Making it possible to show and edit more languages for labels/descriptions/aliases than the ones defined in your babel boxes - Getting ready for the holidays :) You can see all open tickets related to Wikidata here <https://phabricator.wikimedia.org/maniphest/query/4RotIcw5oINo/#R>. Monthly Tasks - Hack on one of these <https://phabricator.wikimedia.org/maniphest/query/R8GRzX1eH0tb/#R>. - Help develop the next summary here! <https://www.wikidata.org/wiki/Wikidata:Status_updates/Next> - Contribute to a Showcase item <https://www.wikidata.org/wiki/Wikidata:Showcase_items> - Help translate <https://www.wikidata.org/wiki/Special:LanguageStats> or proofread pages in your own language! - Add labels, in your own language(s), for the new properties listed above. Anything to add? Please share! :) Cheers Lydia -- Lydia Pintscher - http://about.me/lydia.pintscher Product Manager for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

8 years, 4 months

Query Service Examples

by Jonas Kress

Hey, the new query example dialog has just been released on query.wikidata.org. It looks like this: [image: Inline-Bild 2] It has this cool feature to filter queries via tag cloud: [image: Inline-Bild 3] The sample queries are parsed from this wiki page <https://www.mediawiki.org/wiki/Wikibase/Indexing/SPARQL_Query_Examples#US_p…>. When a query defines an item or property use via Q template, those will be shown in the tag cloud. Please feel free to add new fancy queries! Cheers, Jonas -- Jonas KressSoftware Developer Wikimedia Deutschland e. V. | Tempelhofer Ufer 23-24 | 10963 BerlinPhone: +49 (0)30 219 158 26-0http://wikimedia.de Imagine a world, in which every single human being can freely share in the sum of all knowledge. That‘s our commitment. Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 B. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

8 years, 4 months

Fwd: [Wikimedia-l] Quality issues

by Andreas Kolbe

Markus, On Wed, Dec 9, 2015 at 10:32 AM, Markus Krötzsch < markus(a)semantic-mediawiki.org> wrote: What this page suggested was that that Freebase being shutdown means that > Google will use Wikidata as a source. Note that the short intro text on the > page did not say anything else about the subject, so I am surprised that > this sufficed to convince you about the truth of that claim (it seems that > other things I write with more support don't have this effect). Anyway, I > am really sorry to hear that this quickly-written intro on the web has > misled you. When I wrote this after Google had made their Freebase > announcement last year, I really believed that this was the obvious > implication. However, I was jumping to conclusions there without having > first-hand evidence. I guess many people did the same. I fixed the > statement now. > > To be clear: I am not saying that Google is not using Wikidata. I just > don't know. However, if you make a little effort, there is a lot of > evidence that Google is not using Wikidata as a source, even when it could. > For example, population numbers are off, even in cases where they refer to > the same source and time, and Google also shows many statements and sources > that are not in Wikidata at all (and not even in Primary Sources). > > I still don't see any problem if Google would be using Wikidata, but > that's another discussion. > > You mention "multiple sources". > {{Which}}? > > Markus > For the record, here is what your university webpage used to say.[1] ---o0o--- Wikidata is the free, collaborative knowledge base behind Wikipedia and many other Wikimedia projects. The Web site has been online since late 2012 and has since become an important data provider for Wikipedias in all languages. Ten thousands of users have contributed statements about millions of entities. In December 2013, Google announced that their own collaboratively edited knowledge base, Freebase, is to be discontinued in favour of Wikidata*, which gives Wikidata a prominent role as an inut for Google Knowledge Graph*. The research group Knowledge Systems is working in close cooperation with the development team behind Wikidata, and provides, e.g., the regular Wikidata RDF-Exports. Development of Wikidata started in April 2012 with a team of developers based on the Berlin offices of Wikimedia Germany. The project was heavily inspired by Semantic MediaWiki and Markus Krötzsch has been acting as an architectural advisor to the project since its inception. ---o0o--- You were well placed to know. The source I quoted in the op-ed was a different one though, a snippet from an IRC chat[2]. ---o0o--- 16:33:55 <dennyvrandecic> also, Wikidata is not a free ticket into the Knowledge Graph as Freebase was16:34:07 <dennyvrandecic> it is just one source among many 16:34:27 <Lydia_WMDE> i think we really need to highlight this16:34:30 <dennyvrandecic> benestar: actually I think that companies editing Wikidata might be very beneficial ... ---o0o--- As a Google employee working on Wikidata, Denny can be presumed to know what is and isn't a source for the Knowledge Graph. Noam Shapiro in SEJ commented on the above IRC chat, saying:[3] ---o0o--- As one of the insiders notes above, “Wikidata is not a free ticket into the Knowledge Graph as Freebase was.” It may very well be that the direct relationship observed between Freebase and the Knowledge Graph will not be replicated in Wikidata’s relationship with the Knowledge Graph. That being said, *it is still “one source among many,” and likely an important one*. After all, the Knowledge Graph thrives on the existence of structured data, and - especially in the absence of Freebase - that is exactly what Wikidata provides. ---o0o--- In May of this year, Tony Edward published an article in Search Engine Land titled *"Leveraging Wikidata to gain a Google Knowledge Graph result"*.[4] ---o0o--- Back in December 2014, Google anounced that it would be shutting down Freebase <http://wiki.freebase.com/wiki/Main_Page>, a repository of structured data that helps power Google’s Knowledge Graph, and working to migrate all its data to Wikidata. But how does Wikidata measure up? *How can marketers leverage Wikidata to help a business become an entity and gain a Knowledge Graph result? I have personally had success* with gaining Knowledge Graph entries for my clients and myself. Below, I have outlined the steps you can take to both gain and enhance a Knowledge Graph result. [...] ---o0o--- Another article in Search Engine Land, by Barry Schartz, reporting on the closure of Freebase:[5] ---o0o--- This means that the data won’t be lost but instead will be transferred to Wikimedia Foundation’s project Wikidata, which will have their own API to so that developers who want to retrieve facts automatically, as they did with Freebase, can still do so. *This would include Google also pulling data from Wikidata, to help power its Knowledge Graph.* ---o0o--- There are more articles like that ... I actually only came across your university web page *after* I'd written the op-ed. One other point. Denny said today on the Kurier talk page in the German Wikipedia that he stands by his opinion, quoted earlier in this thread, that Wikidata, being under the CC0 licence, must not import data from Share-Alike sources. It would be irresponsible to do so, he said.[6] If Wikidata with its CC0 licence must not import data from Share-Alike sources, then I don't understand why there are mass imports from Wikipedia, which is a Share-Alike source. Andreas P.S. Markus, your crossposts to Wikimedia-l still don't arrive there. Are you a registered member of Wikimedia-l? [1] https://archive.is/O8h8K [2] https://archive.is/LoQXX#selection-2479.0-2519.74 [3] http://www.searchenginejournal.com/wikidata-meets-google-knowledge-graph/13… [4] http://searchengineland.com/leveraging-wikidata-gain-google-knowledge-graph… [5] http://searchengineland.com/google-close-freebase-helped-feed-knowledge-gra… [6] https://archive.is/bu9Io#selection-12005.450-12005.662

8 years, 4 months

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

Wikidata December 2015