Wikidata July 2012

wikidata@lists.wikimedia.org

29 participants
31 discussions

Re: [Wikidata-l] MerlIwBot
by Snaevar 12 Jul '12

12 Jul '12

The bot operator of MerlIwBot is Merlissimo. His homewiki is on the german wikipedia. You can contact him on this userpage: http://de.wikipedia.org/wiki/Benutzer_Diskussion:Merlissimo --- Snaevar ----- Original Message ----- From: Bináris Sent: 07/11/12 08:39 PM To: Discussion list for the Wikidata project. Subject: [Wikidata-l] MerlIwBot Could http://wikidata-test-repo.wikimedia.de/w/index.php?title=User:MerlIwBot please create a user page that leads to a real user to talk to? TIA -- Bináris

2 1

[Wikidata-l] New York City event on July 19th
by Sumana Harihareswara 11 Jul '12

11 Jul '12

>From your last weekly digest: > we will have a talk at the New York Times, open to the public, on Thursday, July 19th, at 7pm I did some web searching and looked on https://meta.wikimedia.org/wiki/Wikidata/Events and didn't see more information on this -- could you give a few more details so I can forward this to NYC acquaintances? I don't see it at https://www.nytimes.com/marketing/timesopen/ -- is it part of TimesOpen? Thanks! -- Sumana Harihareswara Engineering Community Manager Wikimedia Foundation

3 2

[Wikidata-l] MerlIwBot
by Bináris 11 Jul '12

11 Jul '12

Could http://wikidata-test-repo.wikimedia.de/w/index.php?title=User:MerlIwBotplea… create a user page that leads to a real user to talk to? TIA -- Bináris

1 0

[Wikidata-l] weekly summary #13
by Lydia Pintscher 11 Jul '12

11 Jul '12

Heya folks, Here is the Wikidata summary of the week before 2012-07-06. The wiki version is at http://meta.wikimedia.org/wiki/Wikidata/Status_updates/2012_07_01 I've been away for a week of vacation but it looks like the rest of the team did well without me for a bit :P = Development = * Sprint 8 has finished and Sprint 9 has started, a two-week sprint that will go on throughout Wikimania * Most of the items linked are now shown with their label instead of the cryptic ID * Switching languages is getting better and better * Search is working * A doxygen documentation is online at http://wikidata-docs.wikimedia.de/ See http://meta.wikimedia.org/wiki/Wikidata/Development/Current_sprint for what we’re working on next. = Discussions/Press = * The Wikidata logo submission has been closed as planned, and the voting has started. So far we had more than 500 votes, and the voting is open until next week. The winner of the contest will be announced at Wikimania. = Events = see http://meta.wikimedia.org/wiki/Wikidata/Events * Wikipedia Academy * Wikipedia Meetup NYC * upcoming: Wikimania with several talks * On a short notice, we will have a talk at the New York Times, open to the public, on Thursday, July 19th, at 7pm Anything to add? Please share! :) Cheers Lydia -- Lydia Pintscher - http://about.me/lydia.pintscher Community Communications for Wikidata Wikimedia Deutschland e.V. Obentrautstr. 72 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

2 3

[Wikidata-l] DBpedia usage in the bbc
by Michael Smethurst 05 Jul '12

05 Jul '12

Hello A few notes on the BBC's use of DBpedia which Dan thought might be of interest to this list: Not sure how familiar you are with bbc web stuff so a brief introduction <skip-me> We have a large and somewhat sprawling website with 2 main sections: news article related stuff (including sports) and programme related stuff (tv and radio). In between these sections are various other domain specific bits (http://www.bbc.co.uk/music, http://www.bbc.co.uk/food, http://www.bbc.co.uk/nature etc) In the main we have actual content / data for news articles and programmes. Most of the other bits of co.uk are really just different ways of cutting this content / new aggregations. Because we don't have data for these domains we borrow from elsewhere (mostly from the LOD cloud). So /music is based on a backbone of musicbrainz data, /nature is based on numerous data sources (open and not so open) all tied together with dbpedia identifiers... In the main we don't really use dbpedia as a data source but rather as a source of identifiers to triangulate with other data sources So for example, we have 2 tools for "tagging" programmes with dbpedia identifiers. Short clips are tagged with one tool using dbpedia information resource uris, full episodes are tagged with another tool using dbpedia non-information resource uris (< don't ask) Taking /music as an example: because it's based on musicbrainz and because musicbrainz includes wikipedia uris for artists we can easily derive dbpedia uris (of whatever flavour) and query the programme systems for programmes tagged with that artist's dbpedia uri </skip-me> === some problems we've found when using dbpedia === 1. it's not really intended for use for data extraction. The semantics of extraction depend on the infobox data and this isn't always applied correctly. So http://en.wikipedia.org/wiki/Fox_News_Channel and http://en.wikipedia.org/wiki/Fox_News_Channel_controversies share the same main infobox meaning dbpedia sees them both as tv channels 2. wikipedia tends to conflate many objects into a single item / page. Eg http://en.wikipedia.org/wiki/Penny_Lane has composer details, duration details and release information conflating composition with recording with release 3. the data extraction is a bit flakey in parts. Mainly because it's been done by a small team and it covers so many different domains. 4. wikipedia doesn't do redirects properly. So http://en.wikipedia.org/wiki/Spring_watch and http://en.wikipedia.org/wiki/Autumn_watch are based on the same data / return the same content and are flagged as a redirect internally but they don't actually 30x. This is confusing for editorial staff knowing which uri to "tag" with 5. wikipedia uris are derived from the article title. If the article title changes the uri changes. Dbpedia uris are derived from wikipedia uris so they also change when wikipedia uris / titles change. This has caused us no end of upsets. An example: bbc.co.uk/nature uses wiki|dbpedia uri slugs. So http://en.wikipedia.org/wiki/Stoat on wikipedia is http://www.bbc.co.uk/nature/life/Stoat on bbc.co.uk Apparently people in the UK call stoats stoats and people in the US call them ermine (or the other way round) which lead to an edit war on wikipedia which caused the dbpedia uri to flip repeatedly and our aggregations to break. We've had similar problems with music artists (can't quite remember the details but seem to remember some arguments about how the "and" should appear in Florence and the Machine http://en.wikipedia.org/wiki/Florence_and_the_Machine 6. Titles do change often enough to cause us problems. Particularly names for people Nic (cced) has done some work on dbpedia lite (http://dbpedialite.org/) which aims to provide stable identifiers for dbpedia concepts based on (I think) wikipedia table row identifiers (which wikimedia do claim are guaranteed) 7. wikipedia has a policy that aims toward one outbound link per infobox. So for a person or organisation page eg they tend to settle on that person / orgs's homepage and not their social media accounts or web presence(s) elsewhere. Which makes dbpedia less useful as an identifier triangulation point === end of problems (at least the one's I can remember) === So I think we'd be interested in wikidata for 2 (maybe 3) reasons: 1. as a source of data for domains where there's no established (open) authority (eg the equivalent of musicbrainz for films) 2. as a better, more stable source of identifiers to triangulate to other data sources ?3?. Possibly as a place to contribute of some of our data (eg we're donating our classical music data to musicbrainz; there may be data we have that would be useful to wikidata) Have glanced quickly at the proposed wikidata uri scheme (http://meta.wikimedia.org/wiki/Wikidata/Notes/URI_scheme#Proposal_for_Wikid ata) and <snip> http://{site}.wikidata.org/item/{Title} is a semi-persistent convenience URI for the item about the article Title on the selected site Semi-persistent refers to the fact that Wikipedia titles can change over time, although this happens rarely </snip> Not sure on the definition of infrequently but I know it's caused us problems. Wondering if the id in http://wikidata.org/id/Q{id} is the wikipedia row ID (as used by dbpedialite)? Also wondering why there's a different set of URIs for machine-readable access rather than just using content negotiation? Cheers Michael http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this.

6 9

[Wikidata-l] DBpedia usage in the bbc - selected highlights
by Michael Hopwood 05 Jul '12

05 Jul '12

Hello Michael, Nicholas et list, I hope you don't mind me jumping in here with a few comments on selected highlights of this thread. >>> Taking /music as an example... I wonder if you have looked at book data? I am working on issues to do with linked (open?) book data and it would be useful to compare notes. >>> wikipedia tends to conflate... composition with recording with release... On the other hand, data does exist that separates these (and more!) entity types out very clearly, and it's potentially highly *linked* but it's unlikely to be *open*. See: http://www.ddex.net/ddex-present - ddex descriptive data schemas, but also note the links there to IDs for -names (ISNI) -compositions (ISWC) -recordings (ISRC) -releases (GRid) These are all industry-standard IDs, and thus pretty stable. Maybe a starting point? >>> ...domains where there's no established (open) authority (eg the equivalent of musicbrainz for films)... EIDR? http://eidr.org/ - " EIDR is operated on a non-profit cost-recovery basis..." but maybe you get the stability and granularity you pay for? Plus; "... EIDR is founded on the principle of open participation and welcomes all ecosystem players (commercial and non-profit) to join the Registry as registrant, lookup user or even a promoter. The Registry is intended to provide a foundational namespace for A/V objects that can be leveraged by participant in the eco-system to further their own business needs and offerings." - http://eidr.org/resources/ Cheers, Michael

2 1

Re: [Wikidata-l] DBpedia usage in the bbc Re: DBpedia usage in the bbc - selected highlights - selected highlights
by Yury Katkov 05 Jul '12

05 Jul '12

Re: [Wikidata-l] DBpedia usage in the bbc - selected highlights

1 0

[Wikidata-l] Fwd: MTSR 2012 - call for papers
by Manuel Palomo Duarte 03 Jul '12

03 Jul '12

Hello everyone. Sorry for the off-topic, but I'm sure in the list there are people interested in this conference on meda-data and semantics technologies. King regards. ---------- Forwarded message ---------- From: <conference.mtsr2012(a)uca.es> Date: 2012/7/2 Subject: MTSR 2012 - call for papers To: manuel.palomo(a)uca.es ## MTSR 2012: Metadata and Semantics Research Conference November 28-30, 2012 University of Cádiz, Spain - Edificio Constitución 1812 http://mtsr2012.uca.es Following the success of the first five editions (MTSR'05, MTSR'07, MTSR'09, MTSR'10 and MTSR'11), the sixth International Conference on Metadata and Semantics Research (MTSR'12) aims to bring together researchers and practitioners that share a common interest in metadata, its representation, its semantics and its diverse applications to Information Systems. ## Scope and topics Contributions are welcome on every topic related to Metadata and their relationships with Ontologies, Semantic Web, Knowledge Management and Software Engineering, such as: ### I. Foundations * Typology of metadata and metadata uses * The value and cost of metadata * Quality evaluation in the use of Metadata * Metadata reusability * New or revised metadata schemas or application profilesv * Metadata standardization * Empirical studies on metadata and/or ontologies usage ### II. Languages and Frameworks for Metadata Management * SGML, XML, UML in theory and practice * Languages and Frameworks for Ontology Management * Metadata and the Semantic Web * Metadata and Knowledge Management * Metadata and Software Engineering * Metadata application of Semantic Web technologies * Ontologies and Ontology-based Knowledge Management Systems ### III. Case Studies * Metadata and ontologies for librarianship, management of historical archives and archeological research * Metadata and ontologies for the design of innovative products and processes * Metadata and ontologies for health, biological and clinical information management * Metadata and ontologies in finance, tourism and public administrations * Metadata and ontologies in industry * Metadata and ontologies in education * Metadata and ontologies in agriculture, food and environment ### IV. Technological Issues * Technologies for Metadata and ontology storage * Technologies for Metadata and ontology integration * Technologies for Metadata extraction and navigation, querying and editing of ontologies * Technologies for Learning Objects management ## Paper submission Interested authors can submit to EasyChair [2] either full papers (12 pages) or short papers (6 pages) reporting complete or ongoing research respectively. Papers should be original and not previously submitted to other Conference or Journals. The main Conference will be preceded by a Workshop/Tutorial Day, which aims at presenting new topics, frontiers and ongoing researches in the Metadata, Ontology and Semantic Research fields. Proceedings will be published by Springer in the CCIS book series [3]. In addition, best papers will be selected for publishing revised versions at several International Journals covering the diversity of topics. Authors of accepted papers will be asked to register to the Conference and present their work in the form of either oral presentation or poster presentation. Looking forward to see you in Cadiz [4], the oldest continuously-inhabited city in the Southwestern Europe. ## Important Dates * *2012, April 15th*: Title and abstract (500 words) submission - not mandatory * *2012, July 20th*: Paper submission * *2012, Sept 1st*: Acceptance/rejection notification * *2012, Sept 15st*: Camera-ready papers due * *2012, Nov 28-30th*: Conference at University of Cadiz ## Program Chairs * Juan Manuel Dodero, University of Cádiz, Spain * Pythagoras P. Karampiperis, NCSR Demokritos, Greece ## Organization Chairs * Manuel Palomo Duarte, University of Cádiz, Spain * Iván Ruiz Rube, University of Cádiz, Spain * Giannis Stoitsis, IEEE ## Steering Committee * Miguel-Ángel Sicilia, University of Alcalá, Spain * Nikos Manouselis, Agro-Know Technologies, Greece * Fabio Sartori, Università degli Studi di Milano-Bicocca, Italy ## Local Organization Committee: * Antonio Balderas Alberico, University of Cádiz, Spain * Daniel Crespo Bernal, University of Cádiz, Spain * Nuria Hurtado Rodríguez, University of Cádiz, Spain * Manuel Palomo Duarte, University of Cádiz, Spain * Iván Ruiz Rube, University of Cádiz, Spain * José Tomás Tocino García, University of Cádiz, Spain. ## Program Committee members * Nikos Palavitsinis, Greek Research & Technoogy Network, Greece * Rajendra Akerkar, Western Norway Research Institute (NORWAY) * Arif Altun, Hacettepe University (TURKEY) * Luis Anido Rifón (SPAIN) * Petek Askar, Izmir University of Economics (TURKEY) *(to be confirmed)* * Ioannis N. Athanasiadis, Democritus University of Thrace (GREECE) * Tomaž Bartol, University of Ljubljana (SLOVENIA) * Paolo Bouquet, University of Trento (ITALY) *(to be confirmed)* * Gerhard Budin, University of Vienna (AUSTRIA) * Kürsat Cagiltay, METU (TURKEY) *(to be confirmed)* * Caterina Caracciolo, Food and Agriculture Organization of the United Nations (ITALY) * Artem Chebotko, University of Texas - Pan American (USA) * Stavros Christodoulakis, Technical University of Crete (GREECE) *(to be confirmed)* * Rafael Corchuelo, University of Seville (SPAIN) * Constantina Costopoulou, Agricultural University of Athens (GREECE) * Sally Jo Cunningham, Waikato University (NEW ZEALAND) * Erdogan Dogdu, TOBB Teknoloji ve Ekonomi University (TURKEY) * Emmanouel Garoufallou, Dept. of Library Science and Information Systems, TEI of Thessaloniki (GREECE) * Inigo San Gil, Long Term Ecological Research Network (USA) *(to be confirmed)* * Nikos Houssos, National Documentation Centre (GREECE) * Carlos A. Iglesias, Universidad Politecnica de Madrid (SPAIN) * Pankaj Jaiswal, Oregon State University (USA) * Dimitris Kanellopoulos, University of Patras (GREECE) * Johannes Keizer, Food and Agriculture Organization of the United Nations (ITALY) *(to be confirmed)* * Christian Kop, University of Klangenfurt (AUSTRIA) * José Emilio Labra Gayo, University of Oviedo (SPAIN) * Manuel Lama Penín, Universidade de Santiago de Compostela (SPAIN) * Nikos Manouselis, Agro-Know Technologies (GREECE) * William Moen, University of North Texas (USA) * Xavier Ochoa, Centro de Tecnologías de Información Guayaquil (ECUADOR) * Matteo Palmonari, University of Milano-Bicocca (ITALY) * Laura Papaleo, University of Genova (ITALY) * Ricardo Colomo-Palacios, Universidad Carlos III, (SPAIN) * Marios Poulos, Ionian University (GREECE) * T. V. Prabhakar, Indian Institute of Technology Kanpur (INDIA) * Salvador Sanchez, University of Alcalá (SPAIN) * Pınar Senkul, METU (TURKEY) * Cleo Sgouropoulou, Technological Educational Institute of Athens (GREECE) * Aida Slavic, UDC Consortium (THE NETHERLANDS) *(to be confirmed)* * Shigeo Sugimoto, University of Tsukuba (JAPAN) * Hussein Suleman, University of Cape Town (SOUTH AFRICA) *(to be confirmed)* * Giovanni Tummarello, National University of Ireland (IRELAND) *(to be confirmed)* * Emma Tonkin, University of Bath (UNITED KINGDOM) * Hakan Tüzün, Hacettepe University (TURKEY) *(to be confirmed)* * Murat Osman Ünalır, Ege University (TURKEY) * Telmo Zarraonandia, Universidad Carlos III de Madrid (SPAIN) ## Social Media * Twitter [5] * Facebook [6] * ResearchGate [7] [1]: http://mtsr2012.uca.es [2]: https://www.easychair.org/conferences/?conf=mtsr2012 [3]: http://www.springer.com/series/7899 [4]: http://en.wikipedia.org/wiki/Cadiz [5]: https://twitter.com/#!/mtsr2012 [6]: http://www.facebook.com/groups/163806333647142/ [7]: http://www.researchgate.net/conference/Metadata_and_Semantics_Research/ -- Prof. Manuel Palomo Duarte, PhD Software Process Improvement and Formal Methods group (SPI&FM). Degree Coordinator for Computer Science. Department of Computer Science. Escuela Superior de Ingenieria. C/ Chile, 1 11002 - Cadiz (Spain) University of Cadiz http://neptuno.uca.es/~mpalomo Tlf: (+34) 956 015483 Mobile phone: (+34) 649 280080 Mobile phone from University network: 45483 Fax: (+34) 956 015139 Aviso legal: Este mensaje (incluyendo los ficheros adjuntos) puede contener información confidencial, dirigida a un destinatario y objetivo específico. Si usted no es el destinatario del mismo le pido disculpas, y le pido que elimine este correo, evitando cualquier divulgación, copia o distribución de su contenido, así como desarrollar o ejecutar cualquier acción basada en el mismo. -- Legal Notice: This message (including the attached files) contains confidential information, directed to a specific addressee and objective. In case you are not the addressee of the same, I apologize. And I ask you to delete this mail, and not to resend, copy or distribute its content, as well as develop or execute any action based on the same.

1 0

[Wikidata-l] Wikidata logo vote
by Lydia Pintscher 03 Jul '12

03 Jul '12

Hi folks! We've gotten a lot of great proposals for the Wikidata logo and now it's time to chose. All the details about the vote are here: http://blog.wikimedia.de/2012/07/03/wikidata-logo-its-time-to-pick-a-winner/ I'd be delighted if you all took part in the vote and we decide on a great logo for Wikidata. Cheers Lydia -- Lydia Pintscher - http://about.me/lydia.pintscher Community Communications for Wikidata Wikimedia Deutschland e.V. Obentrautstr. 72 10963 Berlin www.wikimedia.de Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V. Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

1 0

[Wikidata-l] Kayaking anyone?
by John Erling Blad 02 Jul '12

02 Jul '12

http://www.youtube.com/watch?v=hgy8c0GcFu8

2 1

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

Wikidata July 2012