Am 07.04.2017 um 01:34 schrieb Denny Vrandečić:
I foresee that might be a bit of a problem for external tools consuming this data - how they would figure out what language it is if it's doesn't have a code? We could of course generate fake codes like mis-x-q12345, maybe that would work.
Q-items for languages already have a property to state their language code. It's just an extra hop away.
We want ISO codes (or rather, IANA language subtags [1]), so we can use them in HTML lang attributes, and in RDF literals. This allows interoperability with standard tools.
For this reason, I also favor a mixed approach, that allows standard language tags to be used whenever possible. I have some ideas on how that could work, but no definite plan yet.
Something like de+Q1980305 could work; when generating HTML or RDF, we'd just drop the suffix. For transligual entries (e.g. the for number symbol i), we could use e.g. mis+Q1140046.
[1] https://www.iana.org/assignments/language-subtag-registry/language-subtag-re...