Hoi, There are many valid possibilities to describe something that is not a language and language used may represent a language that does not have a language code. There is a standard for indicating languages; it allows for something like "US-American Spanish" by combining a country and a language code. This is well known.
The problem with everything that has not been recognised / standardised / defined as a language is that it is highly political. The practical side is that we can use an x in a code to indicate a special use. However, then calling it a language is problematic because a language ought to mean that its understanding is mutually exclusive.
Calling it a language code and use "expressed in" would imho work for any form of language. When Wiktionaries content is imported in Wikidata, we first have to have these languages codes agreed on. To first import the bulk is no problem. It puts pressure on the resolution of such issues and that is not half bad. Thanks, GerardM
On 7 April 2017 at 01:34, Denny Vrandečić vrandecic@gmail.com wrote:
On Thu, Apr 6, 2017, 16:16 Stas Malyshev smalyshev@wikimedia.org wrote:
Hi!
- use Q-Items instead of UserLanguageCodes for Multilingual texts (which
would be quite a migration)
I foresee that might be a bit of a problem for external tools consuming this data - how they would figure out what language it is if it's doesn't have a code? We could of course generate fake codes like mis-x-q12345, maybe that would work.
Q-items for languages already have a property to state their language code. It's just an extra hop away.
I don't think restricting Wiktionary4Wikidata support to the list of languages with a UserLanguageCode is a viable solution, which would happen if we implement the data model as currently suggested, if I understand it correctly.
Aren't we limiting it right now this way in Wikidata?
For labels and descriptions of items yes, and I think that was sensible. It might be time to revisit that decision though.
But for supporting Wiktionary that would be extremely limiting. French Wiktionary supports words in more than a thousand languages currently. Limiting the supported languages of the lemmas is, IMHO, unacceptable.
-- Stas Malyshev smalyshev@wikimedia.org
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata