Re: [Wikidata] Languages in Wikidata4Wiktionary

8 Apr 2017

      Am 07.04.2017 um 01:34 schrieb Denny Vrandečić:
...
I foresee that might be a bit of a problem for external tools consuming
this data - how they would figure out what language it is if it's
doesn't have a code? We could of course generate fake codes like
mis-x-q12345, maybe that would work.

Q-items for languages already have a property to state their language code. It's
just an extra hop away.
We want ISO codes (or rather, IANA language subtags [1]), so we can use them in
HTML lang attributes, and in RDF literals. This allows interoperability with
standard tools.
For this reason, I also favor a mixed approach, that allows standard language
tags to be used whenever possible. I have some ideas on how that could work, but
no definite plan yet.
Something like de+Q1980305 could work; when generating HTML or RDF, we'd just
drop the suffix. For transligual entries (e.g. the for number symbol i), we
could use e.g. mis+Q1140046.
[1]
https://www.iana.org/assignments/language-subtag-registry/language-subtag-re...
-- 
Daniel Kinzler
Principal Platform Engineer

Wikimedia Deutschland
Gesellschaft zur Förderung Freien Wissens e.V.

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

Re: [Wikidata] Languages in Wikidata4Wiktionary