Wikidata June 2016

wikidata@lists.wikimedia.org

60 participants
50 discussions

weekly summary #214
by Lydia Pintscher 20 Jun '16

20 Jun '16

Hey folks :) Here's your summary of what's been happening around Wikidata over the past year: Events <https://www.wikidata.org/wiki/Wikidata:Events>/Press/Blogs <https://www.wikidata.org/wiki/Wikidata:Press_coverage> - Upcoming: Wikimania <https://wikimania2016.wikimedia.org/wiki/Wikidata> - Upcoming: WikiConvention francophone <https://meta.wikimedia.org/wiki/WikiConvention_francophone> - Past: GLAM Wiki Boot Camp <https://en.wikipedia.org/wiki/Wikipedia:GLAM/Boot_Camp> - Past: Silicon Mountain Conf <https://twitter.com/sm_conf> - Past: OpenData.ch <http://opendata.ch/projects/conference-2016/> - Past: Semantic Web + Wikidata <https://www.eventbrite.it/e/biglietti-web-semantico-wikidata-25858790338> - Wikidata beim Hackathon HackHPI in Potsdam <http://blog.wikimedia.de/2016/06/15/wikidata-beim-hackathon-hackhpi-in-pots…> - Data Ex Machina <http://www.rsc.org/chemistryworld/2016/05/linked-data-medicinal-chemistry-c…> - Op-ed in the *Wikipedia Signpost* regarding Wikidata licensing <https://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2016-06-15/Op-ed> Other Noteworthy Stuff - ORES is not available as a beta feature to make it easier to find bad edits <https://www.wikidata.org/wiki/Wikidata:Project_chat#ORES_live_as_a_beta_fea…> - New user script to show the main image of an item is ready for testing <https://www.wikidata.org/wiki/Wikidata:Project_chat#Header_image_on_items> - Into football/soccer? WikiProject Association football/Euro 2016 <https://www.wikidata.org/wiki/Wikidata:WikiProject_Association_football/Eur…> could use your help. - Want to see Wikidata changes in the history of a Wikipedia article? There is a user script that needs your feedback. <https://www.wikidata.org/wiki/Wikidata:Project_chat#user_script_to_show_Wik…> - We are looking for people who work on list articles. <https://www.wikidata.org/wiki/Wikidata:Project_chat#Looking_for_list_articl…> - You can test a gadget that lets you easily run a query for more items with the same statement <https://www.wikidata.org/wiki/Wikidata:Project_chat#Do_you_want_to_see_stat…> . - Language fallback is now happening on Wikipedia and co <https://lists.wikimedia.org/pipermail/wikidata/2016-June/008983.html> - Maps are now enabled on Wikidata <https://www.mediawiki.org/wiki/Help:Extension:Kartographer> - First beta release of StrepHit <https://github.com/Wikidata/StrepHit> Did you know? - Newest properties <https://www.wikidata.org/wiki/Special:ListProperties>: Great Russian Encyclopedia Online ID <https://www.wikidata.org/wiki/Property:P2924>, focal height <https://www.wikidata.org/wiki/Property:P2923>, month of the year <https://www.wikidata.org/wiki/Property:P2922>, label in sign language <https://www.wikidata.org/wiki/Property:P2919>, PO Box <https://www.wikidata.org/wiki/Property:P2918>, COAM ID <https://www.wikidata.org/wiki/Property:P2917>, syntax clarification <https://www.wikidata.org/wiki/Property:P2916>, ECARTICO person ID <https://www.wikidata.org/wiki/Property:P2915>, MSBI person ID <https://www.wikidata.org/wiki/Property:P2914>, date depicted <https://www.wikidata.org/wiki/Property:P2913>, distinctive jersey <https://www.wikidata.org/wiki/Property:P2912>, time gap <https://www.wikidata.org/wiki/Property:P2911>, icon <https://www.wikidata.org/wiki/Property:P2910>, SecondHandSongs artist ID <https://www.wikidata.org/wiki/Property:P2909>, SecondHandSongs song ID <https://www.wikidata.org/wiki/Property:P2908>, timezone offset <https://www.wikidata.org/wiki/Property:P2907> - Query examples: colors of chemical compounds <https://query.wikidata.org/#%23defaultView%3ABubbleChart%0ASELECT%20%3Frgb%…> (source <https://twitter.com/WikidataFacts/status/742775956666978304>), map of braodway venues <https://query.wikidata.org/#%23defaultView%3AMap%0A%23%20Venues%20in%20Broa…> (source <https://twitter.com/sboissel/status/743890784374689792>), sculptures by Max Bill <https://query.wikidata.org/#SELECT%20DISTINCT%20%3Fitem%20%3FitemLabel%20%3…> (source <https://twitter.com/PoulpyWP/status/743880646599319552>), works of art where the name might be a rhyme <https://query.wikidata.org/#SELECT%20%3Fwork%20%3Ftitle%0AWHERE%0A%7B%0A%20…> (source <https://twitter.com/WikidataFacts/status/743493933049733121>), works of art where the title is an alliteration <https://query.wikidata.org/#SELECT%20%3Fwork%20%3Ftitle%0AWHERE%0A%7B%0A%20…> (source <https://twitter.com/WikidataFacts/status/743492549319462912>) - Newest WikiProjects <https://www.wikidata.org/wiki/Special:MyLanguage/Wikidata:WikiProjects>: Professional Wrestling <https://www.wikidata.org/wiki/Wikidata:WikiProject_Professional_Wrestling> - ORDER BY RAND() LIMIT 100 can avoid timeouts on Wikidata Query Service (source <https://lists.wikimedia.org/pipermail/wikidata/2016-June/009018.html>) Development - Worked on making it easier to add new query examples right from the query service by just clicking a button. There are some technical issues with it still though. Will take a while to sort through. - More work on creating new Media-Info entities (the equivalent of item for media file data) on the fly (phabricator:T134259 <https://phabricator.wikimedia.org/T134259>) - Fixed a but with suggestions not showing up (phabricator:T138059 <https://phabricator.wikimedia.org/T138059>) - Improved display of query examples (phabricator:T137589 <https://phabricator.wikimedia.org/T137589>) and cleaned them up - Improved database access (phabricator:T137539 <https://phabricator.wikimedia.org/T137539>) - Fixed a but with the rank selector (phabricator:T109583 <https://phabricator.wikimedia.org/T109583>) - Started concept work for automated list generation - Discussed the proposal for Wiktionary <https://www.wikidata.org/wiki/Wikidata:Wiktionary/Development/Proposals/201…> with a linguist to get more detailed feedback on it. Very positive. You can see all open tickets related to Wikidata here <https://phabricator.wikimedia.org/maniphest/query/4RotIcw5oINo/#R>. Monthly Tasks - Hack on one of these <https://phabricator.wikimedia.org/maniphest/query/R8GRzX1eH0tb/#R>. - Help develop the next summary here! <https://www.wikidata.org/wiki/Wikidata:Status_updates/Next> - Contribute to a Showcase item <https://www.wikidata.org/wiki/Wikidata:Showcase_items> - Help translate <https://www.wikidata.org/wiki/Special:LanguageStats> or proofread pages in your own language! - Help merge identical items <https://www.wikidata.org/wiki/User:Pasleim/projectmerge> across Wikimedia projects. - Add labels, in your own language(s), for the new properties listed above. Anything to add? Please share! :) Cheers Lydia -- Lydia Pintscher - http://about.me/lydia.pintscher Product Manager for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.

1 0

Tip to avoid some timeouts in SPARQL endpoint
by Seb35 20 Jun '16

20 Jun '16

Hello, and my first post here :) For medium-size queries which timeout in a SPARQL endpoint, you can add these clauses and it sometimes gives a result: ORDER BY RAND() LIMIT 100 You will only obtain some results with no specific order, but it is better than nothing. It doesn’t work everytime, but it works more often than without it or with only a limit. Given it is a hacky technique, it should be the last thing to try, after other query optimisations. For instance, when I search all elements with a GND ID (P227) without P31 neither P279, I get a timeout even when I only add a limit. When I add these two clauses, I get a result. Cheers, ~ Seb35 With rand: https://query.wikidata.org/#SELECT%20DISTINCT%20%3Fitem%20%3FitemLabel%20%3… Without rand: https://query.wikidata.org/#SELECT%20DISTINCT%20%3Fitem%20%3FitemLabel%20%3…

2 1

Fwd: [Wikimedia Announcements] Individual Engagement Grants program will fund 8 community-led projects
by Pine W 18 Jun '16

18 Jun '16

Forwarding good news. Pine On Jun 18, 2016 08:45, "Marti Johnson" <mjohnson(a)wikimedia.org> wrote: > Hi all, > > > In the latest round of Individual Engagement Grants > <http://meta.wikimedia.org/wiki/Grants:IEG>, 28 eligible proposals were > submitted for review. The committee recommended 8 for funding, with 9 > grantees selected to receive $87,332 overall. WMF has now approved all 8 > grants. Here’s what we’re funding.[1] > > > Online Community Organizing - 2 projects funded > > > * Health images for all > <https://meta.wikimedia.org/wiki/Grants:IEG/Health_images_for_all>: > User:CFCF <https://meta.wikimedia.org/wiki/User:CFCF>, a veteran > Individual Engagement grantee from Medicine Translation Project Community > Organizing > <https://meta.wikimedia.org/wiki/Grants:IEG/Medicine_Translation_Project_Com…>, > will return to IEG to coordinate a multi-strategy initiative to bring > quality medical images to Wikipedia across many languages.[2][3][4] > > > * Effective Engagement with Health Topic Experts using Guided Checklists > <https://meta.wikimedia.org/wiki/Grants:IdeaLab/Effective_Engagement_with_He…> > : > This project will create infrastructure to make it easier for health > specialists to contribute their knowledge to Wikipedia. Through interviews > and research, User:FloNight > <https://meta.wikimedia.org/wiki/User:FloNight> will develop a robust > health topic quality checklist that medical experts would use to guide > their review of health content on Wikipedia, directing them to make high > quality edits. [5][6] > Tools - 6 projects funded > > > * Improve ‘Upload to Commons’ Android App > <https://meta.wikimedia.org/wiki/Grants:IEG/Improve_%27Upload_to_Commons%27_…> > : > > This project will further enhance ‘Upload to Commons > <http://blog.wikimedia.org/2016/03/25/categorizing-commons-android-photos-ea…>,’ > making the Android app more user-friendly by improving categorization tools > and providing tutorials to help new users create more useful content for > Commons.[7][8] > > > * Enhance the ProveIt gadget > <https://meta.wikimedia.org/wiki/Grants:IEG/Enhance_the_ProveIt_gadget>: > > The ProveIt gadget is a reference manager that creates a visual interface > for easily adding and editing references to a Wikipedia article. This > project will optimise the gadget by connecting it with Wikidata to allow > for autocompletion of references with existing data.[9] > > > * Alt text tools: > <https://meta.wikimedia.org/wiki/Grants:IEG/Alt_text_tools>Alternative > texts provide in-context descriptions of images to increase accessibility > for vision-impaired users. This project would create a collection of tools > to allow volunteers to efficiently add and improve text alternatives on > English Wikipedia.[10] > > > * A graphical and interactive etymology dictionary based on Wiktionary > <https://meta.wikimedia.org/wiki/Grants:IEG/A_graphical_and_interactive_etym…> > : > This project will extract etymological relationships from Wiktionary to > build a database designed for integration into Wikidata. The database will > allow extraction of interesting information like how pronunciations or > semantics evolved through time across etymological trees and across > languages.[11] > > > * Wikiscan Multi-wiki > <https://meta.wikimedia.org/wiki/Grants:IEG/Wikiscan_multi-wiki>: > > This project will expand the Wikiscan site <http://fr.wikiscan.org/>, > currently focused on French Wikipedia, to include all Wikimedia wikis with > more than 100,000 edits. It will provide many daily-updated statistics, > such as most active pages and users for each day and month since a project > was created.[12][13] > > > * Lua libs for behavior-driven development > <https://meta.wikimedia.org/wiki/Grants:IEG/Lua_libs_for_behavior-driven_dev…> > : > > This project will create an extension to support testing of Lua-modules in > a behavior-driven development > <https://en.wikipedia.org/wiki/en:behavior-driven_development> style > using spec-like tests.[14] > > > More information about this round will be covered shortly on the Wikimedia > Foundation blog. > > > The open call for Project Grants > <https://meta.wikimedia.org/wiki/Grants:Project>, which will replace IEG > in the next round, is scheduled for 1 July-2 August, 2016.[15] > > > Congratulations to the successful grantees! > > > > 1. > > <https://meta.wikimedia.org/wiki/Grants:IEG> > 2. > > <https://meta.wikimedia.org/wiki/Grants:IEG/Health_images_for_all> > 3. > > <https://meta.wikimedia.org/wiki/User:CFCF> > 4. > > < > https://meta.wikimedia.org/wiki/Grants:IEG/Medicine_Translation_Project_Com… > > > 5. > > < > https://meta.wikimedia.org/wiki/Grants:IdeaLab/Effective_Engagement_with_He… > > > 6. > > <https://meta.wikimedia.org/wiki/User:FloNight> > 7. > > < > https://meta.wikimedia.org/wiki/Grants:IEG/Improve_%27Upload_to_Commons%27_… > > > 8. > > < > http://blog.wikimedia.org/2016/03/25/categorizing-commons-android-photos-ea… > > > 9. > > <https://meta.wikimedia.org/wiki/Grants:IEG/Enhance_the_ProveIt_gadget> > 10. > > <https://meta.wikimedia.org/wiki/Grants:IEG/Alt_text_tools> > 11. > > < > https://meta.wikimedia.org/wiki/Grants:IEG/A_graphical_and_interactive_etym… > > > 12. > > <https://meta.wikimedia.org/wiki/Grants:IEG/Wikiscan_multi-wiki> > 13. > > <http://fr.wikiscan.org/> > 14. > > < > https://meta.wikimedia.org/wiki/Grants:IEG/Lua_libs_for_behavior-driven_dev… > > > 15. <https://meta.wikimedia.org/wiki/Grants:Project> > > > > > *Marti JohnsonProgram Officer* > *Individual Grants* > *Wikimedia Foundation <http://wikimediafoundation.org/wiki/Home>* > +1 415-839-6885 > Skype: Mjohnson_WMF > > Imagine a world in which every single human being can freely share > <http://youtu.be/ci0Pihl2zXY> in the sum of all knowledge. Help us make > it a reality! > Support Wikimedia <https://donate.wikimedia.org/> > > _______________________________________________ > Please note: all replies sent to this mailing list will be immediately > directed to Wikimedia-l, the public mailing list of the Wikimedia > community. For more information about Wikimedia-l: > https://lists.wikimedia.org/mailman/listinfo/wikimedia-l > _______________________________________________ > WikimediaAnnounce-l mailing list > WikimediaAnnounce-l(a)lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikimediaannounce-l > >

1 0

Schema.org mappings in Wikidata
by Thad Guidry 18 Jun '16

18 Jun '16

Oh boy. I thought I had a few things figured out with Wikidata...until I read through the Property Talk discussions for https://www.wikidata.org/wiki/Property_talk:P2236 (with its mentions of Freebase mapping) So... I've been adding a few bits of Schema.org mapping into Wikidata today, and stumbled upon a few things that made me rethink a few things...lolol. QUESTION: How to state that a Wikidata Entity (not a Property) such as place of birth https://www.wikidata.org/wiki/Q1322263 is the same concept or idea as a Schema.org property http://schema.org/birthPlace ? I thought I could use P2236 above... but then it seems its for WD Properties, not Entities (subjects) ? SOLUTION ? Perhaps we could do a best practice of treating http://schema.org/birthPlace as an actual Identifier for the place of birth concept https://www.wikidata.org/wiki/Q1322263 ...while reserving the WD Property place of brith https://www.wikidata.org/wiki/Property:P19 to use equivalent property https://www.wikidata.org/wiki/Property:P1628 ? TPT and Denny didn't leave enough notes in there for what to do about external mapping cases of external vocabularies that are also loosely considered as metadata dictionaries as well for the common web and developers, like Schema.org is. Thoughts on the SOLUTION proposed ? Thad +ThadGuidry <https://www.google.com/+ThadGuidry>

3 4

ORES live as a beta feature on Wikidata
by Lydia Pintscher 16 Jun '16

16 Jun '16

Hey folks :) Amir and other have worked hard over the past months to bring ORES to Wikidata. The goal is to use machine learning to make it easier to spot potentially bad edits. ORES is now available as a beta feature on Wikidata. Once you have enabled it you can see some edits in recent changes and watchlist will show up in a different color or have a little r in front of them. These edits are judged as potentially bad and should probably get more review. In your preferences you can adjust how harsh ORES should judge. You can also filter your watchlist/recent changes to only show potentially bad edits. Patrolled edits won't be shown as potentially bad. This should be a huge step towards making it easier to find and fight vandalism on Wikidata. Cheers Lydia -- Lydia Pintscher - http://about.me/lydia.pintscher Product Manager for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.

1 0

Re: [Wikidata] [ANNOUNCEMENT] StrepHit 1.0 Beta Release (Benjamin Good)
by Satya Gadepalli 16 Jun '16

16 Jun '16

Can this be used as factoid QA System? thx

2 1

user script for showing the main image in an item
by Lydia Pintscher 16 Jun '16

16 Jun '16

Hey folks :) Jonas has written a user script to show an image on an item. The goal is to make it easier to see what the item is about and also spot potential vandalism/data quality issues. It'd be great if you could give it a try to see if this is something we should explore further. More details at https://www.wikidata.org/wiki/Wikidata:Project_chat#Header_image_on_items Cheers Lydia -- Lydia Pintscher - http://about.me/lydia.pintscher Product Manager for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.

1 0

user script for finding items with the same statement
by Lydia Pintscher 16 Jun '16

16 Jun '16

Hey :) Amir has written a very nice little user script that adds a little icon next to a statement. Clicking on it leads you to a query for items with the same statement. So if you are on an item about a cat you can click it on the "instance of" statement and then find all other cats. More details here: https://www.wikidata.org/wiki/Wikidata:Project_chat#Do_you_want_to_see_stat… Please also leave feedback there. If you like it it can be turned into a gadget later. (I don't think it is a good idea to put it in the default UI at this point.) Cheers Lydia -- Lydia Pintscher - http://about.me/lydia.pintscher Product Manager for Wikidata Wikimedia Deutschland e.V. Tempelhofer Ufer 23-24 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/029/42207.

1 0

[ANNOUNCEMENT] StrepHit 1.0 Beta Release
by Marco Fossati 15 Jun '16

15 Jun '16

[Feel free to blame me if you read this more than once] To whom it may interest, Full of delight, I would like to announce the first beta release of *StrepHit*: https://github.com/Wikidata/StrepHit TL;DR: StrepHit is an intelligent reading agent that understands text and translates it into *referenced* Wikidata statements. It is a IEG project funded by the Wikimedia Foundation. Key features: -Web spiders to harvest a collection of documents (corpus) from reliable sources -automatic corpus analysis to understand the most meaningful verbs -sentences and semi-structured data extraction -train a machine learning classifier via crowdsourcing -*supervised and rule-based fact extraction from text* -Natural Language Processing utilities -parallel processing You can find all the details here: https://meta.wikimedia.org/wiki/Grants:IEG/StrepHit:_Wikidata_Statements_Va… https://meta.wikimedia.org/wiki/Grants:IEG/StrepHit:_Wikidata_Statements_Va… If you like it, star it on GitHub! Best, Marco

2 2

RFC - Primary Sources?
by Tom Morris 15 Jun '16

15 Jun '16

I'm confused by this from today's Wikidata weekly summary: - New request for comments: Semi-automatic Addition of References to Wikidata Statements - feedback on the Primary Sources Tool <https://www.wikidata.org/wiki/Wikidata:Requests_for_comment/Semi-automatic_…> First of all, the title makes no sense because "semi-automatic addition of references to Wikidata statements" is one of the main things that the tool can't currently do. You'll almost always end up with duplicate statements if there's an existing statement, rather than the desired behavior of just adding the statement. Second, I'm not sure who "Hjfocs" is (why does everyone have to make up fake wikinames?), but why are they asking for more feedback when there's been *ample* feedback already? There hasn't been an issue with getting people to test the tool or provide feedback based on the testing. The issue has been with getting anyone to *act* on the feedback. Everything is a) "too hard," or b) "beyond our resources," or depends on something in category a or b, or is incompatible with the arbitrary implementation scheme chosen, or some other excuse. We're 12-18+ months into the project, depending on how you measure, and not only is the tool not usable yet, but it's no longer improving, so I think it's time to take a step back and ask some fundamental questions. - Is the current data pipeline and front end gadget the right approach and the right technology for this task? Can they be fixed to be suitable for users? - If so, should Google continue to have sole responsibility for it or should it be transferred to the Wikidata team or someone else who'll actually work on it? - If not, what should the data pipeline and tooling look like to make maximum use of the Freebase data? The whole project needs a reboot. Tom

8 15

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

Wikidata June 2016