Hi,
For Wikimedia Incubator, a list of ISO 639-1/3 language codes would be useful. I generated a PHP array based on http://sil.org/iso639-3/iso-639-3_20110525.tab, and I wanted to make an extension of that (which would be enabled on Incubator). FYI: the array contains 7706 ISO 639-3 codes, besides the ISO 639-1 codes.
Now, such a list already ships with the Babel extension, and I don't like duplication so I was thinking, maybe it could be added to core? Besides these two extensions, it could provide language names for {{#language}} which only works for languages known in MediaWiki's Names.php (unless CLDR extension is enabled and the second parameter is used).
Actually CLDR is incomplete itself and our CLDR extension has extra languages names that we added to the extension (which makes a third extension that could benefit of this).
And maybe there are more use cases... What do you think?
Regards, SPQRobin
Hoi, How are you going to deal with things like macro languages. As you know, some are valid for Wiki use (like ara - ar) ? Thanks, Gerard
On 31 August 2011 18:05, Robin Pepermans robinp.1273@gmail.com wrote:
Hi,
For Wikimedia Incubator, a list of ISO 639-1/3 language codes would be useful. I generated a PHP array based on http://sil.org/iso639-3/iso-639-3_20110525.tab, and I wanted to make an extension of that (which would be enabled on Incubator). FYI: the array contains 7706 ISO 639-3 codes, besides the ISO 639-1 codes.
Now, such a list already ships with the Babel extension, and I don't like duplication so I was thinking, maybe it could be added to core? Besides these two extensions, it could provide language names for {{#language}} which only works for languages known in MediaWiki's Names.php (unless CLDR extension is enabled and the second parameter is used).
Actually CLDR is incomplete itself and our CLDR extension has extra languages names that we added to the extension (which makes a third extension that could benefit of this).
And maybe there are more use cases... What do you think?
Regards, SPQRobin
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Each code has a scope. For macrolanguages this is listed as 'Scope' => 'M' in the array. For language names this doesn't matter. For Incubator, this is how I get the data so I will adapt the extension to find a good solution for this (currently it allows macrolanguages as the data is not yet available, and I put a template when a certain macrolanguage is *not* allowed: e.g. http://incubator.wikimedia.org/wiki/Wp/rom )
But this email is just about whether the ISO 639 data should be included in MediaWiki. Issues with macrolanguages are not really relevant; any use case that needs to handle macrolanguages separately would have the data available anyway.
2011/8/31 Gerard Meijssen gerard.meijssen@gmail.com:
Hoi, How are you going to deal with things like macro languages. As you know, some are valid for Wiki use (like ara - ar) ? Thanks, Gerard
On 31 August 2011 18:05, Robin Pepermans robinp.1273@gmail.com wrote:
Hi,
For Wikimedia Incubator, a list of ISO 639-1/3 language codes would be useful. I generated a PHP array based on http://sil.org/iso639-3/iso-639-3_20110525.tab, and I wanted to make an extension of that (which would be enabled on Incubator). FYI: the array contains 7706 ISO 639-3 codes, besides the ISO 639-1 codes.
Now, such a list already ships with the Babel extension, and I don't like duplication so I was thinking, maybe it could be added to core? Besides these two extensions, it could provide language names for {{#language}} which only works for languages known in MediaWiki's Names.php (unless CLDR extension is enabled and the second parameter is used).
Actually CLDR is incomplete itself and our CLDR extension has extra languages names that we added to the extension (which makes a third extension that could benefit of this).
And maybe there are more use cases... What do you think?
Regards, SPQRobin
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
On 1 September 2011 01:38, Robin Pepermans robinp.1273@gmail.com wrote:
Each code has a scope. For macrolanguages this is listed as 'Scope' => 'M' in the array. For language names this doesn't matter. For Incubator, this is how I get the data so I will adapt the extension to find a good solution for this (currently it allows macrolanguages as the data is not yet available, and I put a template when a certain macrolanguage is *not* allowed: e.g. http://incubator.wikimedia.org/wiki/Wp/rom )
But this email is just about whether the ISO 639 data should be included in MediaWiki. Issues with macrolanguages are not really relevant; any use case that needs to handle macrolanguages separately would have the data available anyway.
On this general topic, I made a tool on toolserver a few years ago that pulls in lots of official language code data from official sources, and de facto data from MediaWiki sites and APIs and provides various ways to query it via JSON.
It got a bit messy with just me hacking it. It's used behind the scenes in some Wiktionary stuff but I don't think anybody else took to it. It or some of its modii operandii may or may not be of interest.
Andrew Dunbar (hippietrail)
2011/8/31 Gerard Meijssen gerard.meijssen@gmail.com:
Hoi, How are you going to deal with things like macro languages. As you know, some are valid for Wiki use (like ara - ar) ? Thanks, Gerard
On 31 August 2011 18:05, Robin Pepermans robinp.1273@gmail.com wrote:
Hi,
For Wikimedia Incubator, a list of ISO 639-1/3 language codes would be useful. I generated a PHP array based on http://sil.org/iso639-3/iso-639-3_20110525.tab, and I wanted to make an extension of that (which would be enabled on Incubator). FYI: the array contains 7706 ISO 639-3 codes, besides the ISO 639-1 codes.
Now, such a list already ships with the Babel extension, and I don't like duplication so I was thinking, maybe it could be added to core? Besides these two extensions, it could provide language names for {{#language}} which only works for languages known in MediaWiki's Names.php (unless CLDR extension is enabled and the second parameter is used).
Actually CLDR is incomplete itself and our CLDR extension has extra languages names that we added to the extension (which makes a third extension that could benefit of this).
And maybe there are more use cases... What do you think?
Regards, SPQRobin
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
wikitech-l@lists.wikimedia.org