Am 07.04.2017 um 01:34 schrieb Denny Vrandečić:
> I foresee that might be a bit of a problem for external tools consuming
> this data - how they would figure out what language it is if it's
> doesn't have a code? We could of course generate fake codes like
> mis-x-q12345, maybe that would work.
>
> Q-items for languages already have a property to state their language code. It's
> just an extra hop away.
We want ISO codes (or rather, IANA language subtags [1]), so we can use them in
HTML lang attributes, and in RDF literals. This allows interoperability with
standard tools.
For this reason, I also favor a mixed approach, that allows standard language
tags to be used whenever possible. I have some ideas on how that could work, but
no definite plan yet.
Something like de+Q1980305 could work; when generating HTML or RDF, we'd just
drop the suffix. For transligual entries (e.g. the for number symbol i), we
could use e.g. mis+Q1140046.
[1]
https://www.iana.org/assignments/language-subtag- registry/language-subtag- registry
--
Daniel Kinzler
Principal Platform Engineer
Wikimedia Deutschland
Gesellschaft zur Förderung Freien Wissens e.V.
_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata