If the page redirect titles exist in Wikipedia, they are valid in Wikidata
as data, regardless of what they represent in *your view* of "quality". If
cleanup needs to be done, it should be done in the context of the source
first. Evaluating the value of a specific "alias" to a Wikidata item is a
judgment that should be based entirely on a *referenceable* data source.
Wikidata aliases (as well as descriptions and preferred labels) are
completely arbitrary and unreferenced, and in my judgment worthless,
without a primary source or clearly defined semantic relationship. The
judgmental curation of Wikidata is in fact, not that useful. Wikidata
should simply seek to represent data *as it exists* (errors or not) in the
primary source.
Furthermore, apparently you do not get why skos:hiddenLabel exists. Why
you feel that it is not worthwhile is not relevant to its primary function,
which is to facilitate searching. (see
)
And, it is not difficult to argue that the searching in Wikidata could use
improvement.
On 16 March 2016 at 13:00, <wikidata-tech-request(a)lists.wikimedia.org>
wrote:
Send Wikidata-tech mailing list submissions to
wikidata-tech(a)lists.wikimedia.org
To subscribe or unsubscribe via the World Wide Web, visit
https://lists.wikimedia.org/mailman/listinfo/wikidata-tech
or, via email, send a message with subject or body 'help' to
wikidata-tech-request(a)lists.wikimedia.org
You can reach the person managing the list at
wikidata-tech-owner(a)lists.wikimedia.org
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Wikidata-tech digest..."
Today's Topics:
1. Re: Wikipedia Page Redirect Titles in Wikidata (Lydia Pintscher)
----------------------------------------------------------------------
Message: 1
Date: Tue, 15 Mar 2016 16:49:40 +0000
From: Lydia Pintscher <Lydia.Pintscher(a)wikimedia.de>
To: wikidata-tech(a)lists.wikimedia.org
Subject: Re: [Wikidata-tech] Wikipedia Page Redirect Titles in
Wikidata
Message-ID:
<
CABfqUgJj3haDoaA+Oi6WkOT9Zr6HBcnq9w40ZtXtxz9-+vh1mQ(a)mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
On Sat, Mar 12, 2016 at 2:14 PM Christopher Johnson <
christopher.johnson(a)wikimedia.de> wrote:
Hi,
I am developing a scientific terms thesaurus and have discovered that
existing Wikipedia "page redirect titles" provide a useful way to resolve
an odd or archaic form to a "canonical" term label as it is represented
by
the Wikipedia page title (aka Wikidata
"sitelink"). For example,
https://en.wikipedia.org//w/api.php?action=query&format=xml&prop=re…
In Wikidata, these "page redirect titles" are not represented in the data
model except very inconsistently and sparsely as skos:altLabel or
("alias"). My use case is that I would like to be able to query Wikidata
for these page redirect titles in order to resolve odd multi-linguistic
names to an single concept.
My question is that if I were to create a bot that imported all "page
redirect titles" for a given sitelink and created them with the
skos:altLabel property en masse, is this a valid semantic relationship?
Or, should it rather be represented as ?sitelink owl:sameAs <page
redirect
URI>? Or both?
Furthermore,, in some cases (z.B. mis-spellings), skos:hiddenLabel may be
more appropriate, but this has no definition in the data model. There
potentially would be a lot of clutter in the UI without a hiddelLabel
alias
property. Also, there are no types for page
redirects in Wikipedia,
afaik.
Additional value for the searching in the WIkidata UI could probably be
obtained from indexing these alternate page titles as well.
There are several points to address:
1) Should redirects from Wikipedia be imported as aliases on Wikidata? No.
This has been done before and created a massive amount of cleanup work
because the redirects contained a lot of not meaningful misspellings and
more. Please do not import them to Wikidata without approval through the
bot approval process and clear quality control.
2) Should we allow more fine-grained distinction between real aliases and
misspellings in the UI and datamodel? No. I don't believe this is worth the
complexity and resulting discussions/edit wars and more.
Cheers
Lydia
--
Lydia Pintscher -
http://about.me/lydia.pintscher
Product Manager for Wikidata
Wikimedia Deutschland e.V.
Tempelhofer Ufer 23-24
10963 Berlin
www.wikimedia.de
Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e. V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 Nz. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/029/42207.