Wikidata April 2015

wikidata@lists.wikimedia.org

68 participants
34 discussions

[Wikidata-l] mapping template parameters using Wikidata?

by Amir E. Aharoni

Hi, TL;DR: Did anybody consider using Wikidata items of Wikipedia templates to store multilingual template parameters mapping? Full explanation: As in many other projects in the Wikimedia world, templates are one of the biggest challenges in developing the ContentTranslation extension. Translating a template between languages is tedious - many templates are language-specific, many others have a corresponding template, but incompatible parameters, and even if the parameters are compatible, there is usually no comfortable mapping. Some work in that direction was done in DBpedia, but AFAIK it's far from complete. In ContentTranslation we have a simplistic mechanism for mapping between template parameters in pairs of languages, with proof of concept for three templates. We can enhance it with more templates, but the question is how much can it scale. Some templates shouldn't need such mapping at all - they should pull their data from Wikidata. This is gradually being done for infoboxes in some languages, and it's great. But not all templates can be easily mapped to Wikidata data. For example - reference templates, various IPA and language templates, quotation formatting, and so on. For these, parameter mapping could be useful, but doing this for a single language pair doesn't seem robust and reminds me of the old ways in which interlanguage links were stored. So, did anybody consider using Wikidata items of templates to store multilingual template parameters mapping? -- Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי http://aharoni.wordpress.com ‪“We're living in pieces, I want to live in peace.” – T. Moore‬

9 years

[Wikidata-l] weekly summary #153

by Lydia Pintscher

Hey folks :) Here's your summary of what's been happening around Wikidata over the last week. Events <https://www.wikidata.org/wiki/Wikidata:Events>/Press/Blogs <https://www.wikidata.org/wiki/Wikidata:Press_coverage> - Wikidata by country, or: The Wealth of Nation's Data <http://magnusmanske.de/wordpress/?p=290> - Wikidata the Game - one year in <http://magnusmanske.de/wordpress/?p=293> - Ongoing: GLAM-Wiki 2015 <https://nl.wikimedia.org/wiki/GLAM-WIKI_2015> Other Noteworthy Stuff - Build a RSS image feed based on a query <https://tools.wmflabs.org/wikidata-todo/wdq_image_feed.php> - Periodic table based on Wikidata data <https://lists.wikimedia.org/pipermail/wikidata-l/2015-April/005797.html> by Ricordisamoa - ~8000 biographical items with dates in the description, but no birth/death date statements <https://tools.wmflabs.org/mix-n-match/dated.html> - Magnus says ~8% (~1.4M) of all Wikidata items do not have any site links to Wikipedia etc.; knowledge that exists exclusively on Wikidata - Bene* wrote a user script that adds a filter bar above the statement section and lets you filter it <https://www.wikidata.org/wiki/User:Bene*/statementfilter.js> - Magnus wrote a quick user script to move identifiers into the right sidebar to show how a statement section without identifiers would look like <https://www.wikidata.org/wiki/User:Magnus_Manske/ext-props.js>. This came up as part of a longer discussion on the mailinglist about moving identifier statements into their own section. Progress is being tracked at phabricator:T95287 <https://phabricator.wikimedia.org/T95287>. - Samsung releases Freebase-Wikidata mappings in CC0: 4.4M pairs generated from Wikipedia links and custom code <https://lists.wikimedia.org/pipermail/wikidata-l/2015-April/005806.html> - Thanks to Bene* hovercards now also work for items and properties <https://www.wikidata.org/wiki/Wikidata:Project_chat#Popups_extension_now_al…> - New tool by Magnus for Wikidata item labels <https://twitter.com/MagnusManske/status/586885847712190464> Did you know? - Newest properties: short name <https://www.wikidata.org/wiki/Property:P1813>, list of episodes <https://www.wikidata.org/wiki/Property:P1811>, named as <https://www.wikidata.org/wiki/Property:P1810> - Newest WikiProjects <https://www.wikidata.org/wiki/Wikidata:WikiProjects>: Star Wars <https://www.wikidata.org/wiki/Wikidata:WikiProject_Star_Wars>, Open Access <https://www.wikidata.org/wiki/Wikidata:WikiProject_Open_Access> - Newest gadgets: - Showcase items <https://www.wikidata.org/wiki/Wikidata:Showcase_items>: Development - We are going to change the way value suggestions are ranked when entering a new statement. This will help with "male" and "female" not showing up among the top suggestions. Previously we ranked by number of sitelinks. We will change this to the maximum of sitelinks and labels. So if an item has labels in many languages but no sitelinks like "male" and "female" it will still show up high in the suggestions. ( phabricator:T94404 <https://phabricator.wikimedia.org/T94404>) - Discussed how to move forward with identifiers. Outcome: They should get their own datatype. (phabricator:T95287 <https://phabricator.wikimedia.org/T95287>) - Implemented arbitrary access for the {{#property:…}} parser function. This can be invoked on the wikis that have arbitrary access enabled by using {{#property:P123|from=Q42}}. So far this is only Wikidata itself. - Did further performance work on the client (Wikipedia and co) in preparation for arbitrary access - Improved the performance of wbgetentities significantly when loading a large number of entities - Wrote more browser tests for different datavalues - Further work on RDF mapping and dumps - Fixed in other language box showing old data (phabricator:T90893 <https://phabricator.wikimedia.org/T90893>) - Fixed language fallback on Special:Recent changes and Special:Contributions - Added language fallback for the tooltip on badge icons You can see all open bugs related to Wikidata here <https://phabricator.wikimedia.org/maniphest/query/4RotIcw5oINo/#R>. Monthly Tasks - Hack on one of these <https://phabricator.wikimedia.org/maniphest/query/R8GRzX1eH0tb/#R>. - Help fix these items <https://www.wikidata.org/wiki/Wikidata:The_Game/Flagged_items> which have been flagged using Wikidata - The Game. - Help develop the next summary here! <https://www.wikidata.org/wiki/Wikidata:Status_updates/Next> - Contribute to a Showcase item <https://www.wikidata.org/wiki/Wikidata:Showcase_items> - Help translate <https://www.wikidata.org/wiki/Special:LanguageStats> or proofread pages in your own language! Anything to add? Please share! :) Cheers Lydia -- Lydia Pintscher - http://about.me/lydia.pintscher Product Manager for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

9 years

Re: [Wikidata-l] Preliminary SPARQL endpoint for Wikidata

by Marco Fossati

Hi everyone, I'd just like to announce another experimental Wikidata SPARQL endpoint [1], kindly provided by the folks at SpazioDati [2]. It contains both the simplified and the complete dumps, as per [3]. Each dump file is stored under a different named graph. We are collecting the query logs, and will share the most frequent queries. Cheers! [1] http://wikisparql.org/ [2] http://spaziodati.eu/en/ [3] http://tools.wmflabs.org/wikidata-exports/rdf/exports/20150223/ On 3/21/15 1:00 PM, wikidata-l-request(a)lists.wikimedia.org wrote: > On 3/20/15 2:08 PM, Markus Kroetzsch wrote: >> >Dear all, >> > >> >Thanks to the people at the Center of Semantic Web Research in Chile >> >[1], we have a very first public SPARQL endpoint for Wikidata running. >> >This is very preliminary, so do not rely on it in applications and >> >expect things to fail, but you may still enjoy some things. >> > >> >http://milenio.dcc.uchile.cl/sparql > You have a SPARQL that provides access to Wikidata dumps loaded into an > RDF compliant RDBMS (in this case a Virtuoso RDBMS instance). I emphasis > "a" because "the first" isn't accurate. > > There are other endpoints that provide access to Wikidata dumps: > > [1]http://lod.openlinksw.com/sparql -- 61 Billion+ RDF triples culled > from across the LOD Cloud (if you lookup Wikidata URIs that are objects > of owl:sameAs relations you'll end up in Wikidata own Linked Data Space) > > [2]http://wikidata.metaphacts.com/sparql -- another endpoint I > discovered yesterday . -- Marco Fossati http://about.me/marco.fossati Twitter: @hjfocs Skype: hell_j

9 years

[Wikidata-l] Wikidata-Freebase mappings

by Kim Haklae

Hi all, I am pleased to announce that the Freebase-Wikidata mappings are shared in public. http://github.com/Samsung/KnowledgeSharingPlatform Google is already providing the mapping relation between Freebase and Wikidata (https://developers.google.com/freebase/data), however, they might not offer a updated version. We extract a set of identical relations from both Freebase and Wikidata datasets using Wikipedia links; several algorithms are also tested to find out same entity pairs. Although this approach is limited to identifying all same entities of both datasets, it would be a useful source to understand instances of both data sources. The source code for extracting this data will also be shared soon. The data is serialised using the N-Triples format, and the following is the details of this data: - Total 4,395,258 triples (same entity pairs) - Updated: February 13, 2015 - Data Format: N-Triples RDF - License: CC0 - File size: 236 MB zip - File size: 2.5 GB (uncompressed) Feel free to ask me if you have any questions. Cheers, Haklae Kim Senior Engineer Samsung Electronics Co., Ltd. scot.kim(a)samsung.com / haklaekim(a)gmail.com -- Dr.Dr. Haklae Kim Semantic Web and Open Data Hacker Open Knowledge Foundation Korea http://thedatahub.kr http://getthedata.kr http://blogweb.co.kr Tel: +82-(0)10-3201-0714 Who's Who in the World's 27th Edition - 2010 IBC 2000 Outstanding Scientists - 2010

9 years

[Wikidata-l] Weekly Summary #152

by John Lewis

Hi everyone, Here is the latest update of everything going on around Wikidata! Discussions - Closed RfC: Reforming administrator inactivity criteria <https://www.wikidata.org/wiki/Wikidata:Requests_for_comment/Reforming_admin…> Events <https://www.wikidata.org/wiki/Wikidata:Events>/Press/Blogs <https://www.wikidata.org/wiki/Wikidata:Press_coverage> - WikiArabia <http://arabia.wikimedia.tn/index.php?title=Main_Page> takes place in Monastir, Tunisia, 3-5 April - The GLAM-WIKI 2015 <http://www.glamwiki.nl/> conference in The Hague (10-12 April) features several presentations and tutorials about Wikidata for/with cultural institutions. - The Library world will use Wikidata <http://ultimategerardm.blogspot.nl/2015/04/viaf-move-over-wikipedia-for-wik…> to link its information to any and all Wikipedias. No longer English only, but every Wikipedia will be exposed in this way. - Freebase, SEO and Wikidata <http://infobib.de/2015/04/01/seo-freebase-und-wikidata/> - Office hour on IRC covering overall status/development, Freebase and admin inactivity criteria RfC. You can read the log <https://tools.wmflabs.org/meetbot/wikimedia-office/2015/wikimedia-office.20…> . Other Noteworthy Stuff - Magnus wrote a short tour of Wikidata's tool ecosystem <https://tools.wmflabs.org/wikidata-todo/tour.html#slide.3D0>. - A first version of the Primary Sources Tool has been released <https://lists.wikimedia.org/pipermail/wikidata-l/2015-April/005724.html>. It'll help with migrating Freebase data and more. - Italian Wikipedia's quality festival <https://it.wikipedia.org/wiki/Wikipedia:Festival_della_qualit%C3%A0/Aprile_…> is focusing on interwiki links and Wikidata this month. Help them out? - Lots of new databases have been added to Mix n Match <https://tools.wmflabs.org/mix-n-match/>. - Screenshots of the current state of new constraint reports and checks against 3rd party databases have been posted. <https://twitter.com/wikidata/status/582545229884063744> Did you know? - Newest properties: choreographer <https://www.wikidata.org/wiki/Property:P1809>, senat.fr ID <https://www.wikidata.org/wiki/Property:P1808>, Great Aragonese Encyclopedia ID <https://www.wikidata.org/wiki/Property:P1807> Development - Wikidata development started 3 years ago <https://twitter.com/wikidata/status/583220841711845376>. <3 to everyone who is a part of it. - Went through all the feedback we got for improving watchlist integration on Wikipedia and co and posted our assesment <https://www.wikidata.org/wiki/Wikidata:Watchlist_integration_improvement_in…> - Put the infrastructure for creating Turtle <https://en.wikipedia.org/wiki/Turtle_(syntax)>-Beta dumps in place. All new Wikidata dumps will be in https://dumps.wikimedia.org/wikidatawiki/entities/ from Monday on (the old * directory will be kept around and receive new json dumps for backwards compatibility). - Reduced size of entities pages by removing no longer needed data (to make the UI faster). - Fixed bug that sometimes caused dates and other types of values to be cut short when quickly saving. (phabricator:T92831 <https://phabricator.wikimedia.org/T92831>) - Fixed issues with setting focus after clicking edit. You can see all open bugs related to Wikidata here <https://phabricator.wikimedia.org/maniphest/query/4RotIcw5oINo/#R>. Monthly Tasks - Hack on one of these <https://phabricator.wikimedia.org/maniphest/query/R8GRzX1eH0tb/#R>. - Help fix these items <https://www.wikidata.org/wiki/Wikidata:The_Game/Flagged_items> which have been flagged using Wikidata - The Game. - Help develop the next summary here! <https://www.wikidata.org/wiki/Wikidata:Status_updates/Next> - Contribute to a Showcase item <https://www.wikidata.org/wiki/Wikidata:Showcase_items> - Help translate <https://www.wikidata.org/wiki/Special:LanguageStats> or proofread pages in your own language!

9 years

[Wikidata-l] External identifiers vs. Wikidata-internal links & data

by Erik Moeller

Hi all -- Have we considered separating in some way (in the UI, and possibly the data model) properties which track identifiers in external databases vs. properties that describe the item using Wikidata-internal links? As more and more external identifiers are added, it's easy to get lost in them while looking for the right property to describe an item. We're effectively already doing this with Wikimedia identifiers by calling them "sitelinks" and it seems like a potential logical extension of that concept to group other kinds of external identifiers in their own section rather than having CANTIC, BIBSYS identifiers, Freebase identifiers or even DMOZ links mixed together with the primary descriptors of an author or work, for example. Thanks, Erik

9 years

[Wikidata-l] OWL based ontologies as basis for Wikidata item interactions and property proposal

by Sebastian Burgstaller

Hello all, Wikidata consists of millions of single data items, which is great. In order to facilitate modeling the interactions between the single items, we hereby suggest using OWL based ontologies ( http://en.wikipedia.org/wiki/Web_Ontology_Language). We think that using ontologies brings several advantages: -Looking at an ontology (could collaboratively be generated e.g. on webprotege.stanford.edu) gives a very clear overview of how data is interconnected. This would allow for modeling of even very large and/or complex interactions. -Layouting a data integration project in an ontology first, before really integrating data into WD facilitates property proposal, as a ontology with its properties could first be designed and then the ontology with all its properties and classes could be generated as a whole. -Data could be queried/exported from WD based on an ontology by simply selecting the whole or parts of an ontology. This approach has been suggested and discussed by Benjamin Good, Elvira Mitraka, Andra Wagmeester, Andrew Su and me. As an example, we put together draft properties for gene disease interactions, which allows for WD community discussion of this apporach. A preliminary version can be found here: https://www.wikidata.org/wiki/User:ProteinBoxBot/GeneDiseaseIteraction_Disc… Best regards, Sebastian

9 years

[Wikidata-l] unsubscribe

by Charles Spahr

Please unsubscribe me from this wikilist. Thanks, Charlie

9 years

[Wikidata-l] Mapping Wikidata properties to Freebase properties

by Thad Guidry

Experiment for tonight: Figure out how the hell to map 1 of my Freebase base types...to a wikidata SOMETHING. https://www.wikidata.org/wiki/Q1427887 is the same as https://www.freebase.com/base/beefbase/meat_product?schema= and I noted on Q1427887 that the Freebase identifier is /m/0gxp3zs GREAT. Now, how would the https://www.wikidata.org/wiki/Property:P1628 as mentioned by Tpt_ during the office hour....actually be used somewhere to further enhance a connection between the two ? Help, Thad +ThadGuidry <https://www.google.com/+ThadGuidry>

9 years

[Wikidata-l] Call for Election Committee candidates

by Alice Wiegand

Hi Everyone, 2015 is an election year for the Board of Trustees of the Wikimedia Foundation <https://wikimediafoundation.org/wiki/Board_of_Trustees> as well as for the Funds Dissemination Committee <https://meta.wikimedia.org/wiki/Grants:APG/Funds_Dissemination_Committee>. As you may recall the Board has three directly-elected members who serve for two years. Currently they are Phoebe Ayers (Phoebe <https://meta.wikimedia.org/wiki/User:Phoebe>), Samuel Klein (SJ <https://meta.wikimedia.org/wiki/User:Sj>) and María Sefidari (Raystorm <https://meta.wikimedia.org/wiki/User:Raystorm>). As in the past years we rely on an effective election committee to coordinate the elections for us along with staff support and a Board liaison. Not only do they guarantee that the election is overseen by an independent body, but they also make sure that the tremendous amount of work that needs to be done is taken care of. My job, as this year's Board liaison, is to coordinate the formation of this committee and to support them in their work while serving as the primary point of contact with the Board regarding the process.. This is a call for volunteers to serve on the election committee. If you feel that you can contribute to this committee, please email James Alexander (Jalexander(a)wikimedia.org) and give a small summary of why you think you would be able to help out with this process. The Committee is responsible for planning and maintaining virtually every aspect of the Board election. For example, the Committee plans the type of voting, suffrage criteria, and criteria for candidacy, helps to draft and organize all of the official election pages on Meta, verifies that candidates and voters meet the criteria, audits votes to ensure there are no duplicate votes or other problems, et cetera. You can expect that this work will take an average 5-10 hours a week with a few periods of relative quiet and a few periods of heavy work during and after each election (the FDC and Board elections are planned to be separate this year). If you decide to join the committee you will have to identify to the Wikimedia Foundation <https://meta.wikimedia.org/wiki/Steward_handbook/email_templates#Notificati…> because of the personal information you have access too and must be at least 18 years of age. In addition you cannot be part of the election committee if you are planning to be a candidate or are planning to support any candidate publicly. To ensure we get going as quickly as possible, committee members will start to be seated as soon as we have 4-5 good candidates with an anticipated first meeting of Friday April 10th (or soon after, depending on committee availability). The deadline for volunteers, however, is Friday, April 17th UTC 12:00. The committee and staff will be setting up the election pages soon and the call for candidates, led by a letter from the Board, which will be going out shortly. If you're interested in running for either the Board or the FDC, I encourage you to read up on prior elections <https://meta.wikimedia.org/wiki/Wikimedia_Foundation_elections_2013>and the groups themselves to prepare your statements! Regards, Alice. -- Alice Wiegand Board of Trustees Wikimedia Foundation Support Free Knowledge: https://wikimediafoundation.org/wiki/Donate

9 years

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

Wikidata April 2015