New subject: [Wikidata-l] What is the point of labels?

6 Jun 2014

David,
I am not familiar with Wiktionary and its datamodel. But your summary 
looks like SKOS [1] would be a good fit. Also for your proposal to 
extend the Wikidata datamodel. In short, SKOS distinguishes between 
concepts (they carry the semantics ~ Q item) and labels (they are, well, 
just labels). Concepts and labels are connected via a handful of 
properties, e.g. skos:prefLabel or skos:altLabel. In ordinary SKOS 
labels are simple strings but in SKOS-XL (also part of the spec) they 
are objects (and thus can have properties and relations to other labels 
(or anything) etc.).

Furthermore, SKOS is extensible, i.e. it is based on RDF and one can 
define subclasses of skos:concept and skos-xl:label and one can define 
subproperties of skos:prefLabel and skos:altLabel with particular 
semantics, which might be relevant for Wikidata. With this some 
label-like wikidata-properties could be defined as subproperties of, 
say, skos:altLabel to have them show up in pick lists etc.

just my 2 cents,
   michael

[1] The spec: http://www.w3.org/TR/skos-reference/
       The primer: http://www.w3.org/TR/2009/NOTE-skos-primer-20090818

On 06.06.2014 14:00, wikidata-l-request(a)lists.wikimedia.org wrote:
...
  Message: 3
 Date: Thu, 5 Jun 2014 16:28:30 +0200
 From: David Cuenca&lt;dacuetu(a)gmail.com&gt;
 To: "Discussion list for the Wikidata project."
 	&lt;wikidata-l(a)lists.wikimedia.org&gt;
 Subject: [Wikidata-l] What is the point of labels?
 Message-ID:
 	&lt;CAJBSGSoO60AsQbUFkmefqvpE_miwFYxO2vs8jSeq0p0D82JChg(a)mail.gmail.com&gt;
 Content-Type: text/plain; charset="utf-8"

 When I drafted the functional structure that is appearing on items [1],
 Gerard pointed out that it is drifting into the lexical area. That made me
 think that while useful to have lexical data as an independent item as we
 discussed last year for Wiktionary, the current structure "q item <label>
 string" doesn't seem to be compatible with that wish, or at least it would
 be more difficult to maintain the same label twice. And it is not just one
 label per item, there are many, and each one might have different lexical
 properties.

 For more efficiency, it seems that we would need statements like "q item
 <label> lexical item" to reflect that separation, but that adds further
 complexity, because according to the latest Wikidata:Wiktionary proposal
 [2], the "lexical item" (W) also contains senses/meanings (S). This is
 recurrent, as we already have Q items as the basis for meaning... or at
 least a concept that is more or less shared among languages. The only
 difference between "Q items" and the proposed "S items" is that S
items
 represent only one of the lexeme meanings for one particular language, but
 other than that they have the same nature as Q items (it should be possible
 to add "subclass of" and other statements to them).

 Labels, aliases, and name properties are just normal statements where one
 of them is preferred, I have been wondering why don't we treat them as
 such... That way we could have some coherence, and have both "Q items" and
 "S items" as the units of meaning/sense and later on move the labels
 (lexemes), which now are strings, to the lexical items ("W items" in the
 example on the page Wikidata:Wiktionary).

 Summing up, labels in their current form make complete sense now, but when
 considered together with lexical information, it seems that it would be
 convenient to treat all of them as statements that later on could link with
 "W items". And as Joe pointed out, there are many more properties that are
 equivalent to a label, just more specific, and that now don't show up in
 the suggester, nor up above of the page where they should.

 I know that Wiktionary is still in the future and that there are many other
 priorities on the way, however since the representation of the items is
 being re-considered, I think it is a good moment to think about how to move
 little by little in the right direction. I also would like to point out
 that by keeping lexical information in wikidata, its complexity is going to
 increase inevitably. If new users already struggling to understand it now,
 I cannot imagine how will they cope with added elements...

 Micru

 [1]http://lists.wikimedia.org/pipermail/wikidata-l/2014-June/003941.html
 [2]https://www.wikidata.org/wiki/Wikidata:Wiktionary 

-- 

Dr. Michael Erdmann    |    erdmann(a)diqa-pm.com    |   +49 151 6140 1790
DIQA Projektmanagement GmbH | Pfinztalstr. 90 | 76227 Karlsruhe, Germany
Handelsregister: Amtsgericht Mannheim HRB 715454 | USt-IdNr: DE283037270
Geschäftsführer: Dr. Michael Erdmann,  Dipl.-Wirtsch.-Inf. Daniel Hansch

This email may contain confidential information. If you are not the 
intended recipient please notify the sender immediately and delete this 
email. Any unauthorized copying, disclosure or distribution of this 
email is strictly forbidden.

Re: [Wikidata-l] What is the point of labels?