Hi Stephen,
in your CSV [1] I count: 312 WP-languages
so WP-languages is between 277, 294, 312 and 360 ;-)
Hi Denny,
use case is the 'Hydrographic Dictionary' (HD) of the 'International Hydrographic Organization' (IHO). 80 nations as members of IHO. Content of HD are nautical terms and a corresponding description/definition.
The idea is to free the HD, and to share it with WD/WP, so crowdsourcing can help to improve and append it, and to translate it to more languages :-)
We prepare an editing tool. For to select a language, we will use a list. This list should contain the WP-languages. The IHO will decide, which of this languages should be prioritized.
My hope is to find a such list... (yes, I knew: the WP-language-list is 'in process' but I hope it is stable for some days/weeks and I find a valid and complete list somewhere in the WM-space)
Best regards, Markus
Am 30.09.2016 um 20:45 schrieb Denny Vrandečić:
Markus, it really depends on what you mean with "a list of all Wikimedia languages". That is why you get different numbers.
Usually, you will have a use case for this list, and depending on that use case you should select the languages you really care about. Besides looking good in some marketing terms, having a complete list from a random collection is barely a real use case. Which languages do you want to list, what is this list for, and what will happen if a specific language is within that list or not?
Just my 2c
On Fri, Sep 30, 2016 at 10:04 AM Stephen Niedzielski <sniedzielski@wikimedia.org mailto:sniedzielski@wikimedia.org> wrote:
I apologize but I'm coming into this thread a little late. FWIW, I wanted to mention that this is what we use to generate a list of languages[0] for the Wikipedia Android app. Less to your question but similar, we also generate localized main page title and site name[1], and namespace titles[2]. There's also a related thread in T121936[3] discussing the localized name of all Wikimedia sites. [0] https://wikistats.wmflabs.org/api.php?action=dump&table=wikipedias&format=csv&s=good [1] https://es.wikipedia.org/w/api.php?action=query&meta=siteinfo&format=json&siprop=general [2] https://fr.wikipedia.org/w/api.php?action=query&meta=siteinfo&format=json&siprop=namespaces [3] https://phabricator.wikimedia.org/T121936 On Fri, Sep 30, 2016 at 3:21 AM, Markus Bärlocher <markus.baerlocher@lau-net.de <mailto:markus.baerlocher@lau-net.de>> wrote: Thanks Denny for this: > It's based on the query by Jérémie a few mails back. > http://tinyurl.com/h2zky8p Yes this helps very much :-) But there are some inconsistencies: 360 WP-languages from your query 294 WP-languages in https://meta.wikimedia.org/wiki/List_of_Wikipedias 277 WP-languages in https://de.wikipedia.org/wiki/Wikipedia:Sprachen 73 'localname' in the query are empty 9 'name' in the query are double (ell, gsw, lzh, mrj, nan, ron, rup, sgs, vro) There are some problems with characters/charset when I copy&paste the query result to Excel: Burmese, Gothic, Northern Thai, shows only squares as 'local language' But copying after the squares from Excel to Thunderbird works: Burmese မြန်မာဘာသာစကား mya Gothic language 𐌲𐌿𐍄𐌰𐍂𐌰𐌶𐌳𐌰 got Northern Thai ᨣᩴᩤᨾᩮᩥᩬᨦ nod All the rest works fine :-) It would be good to work out a valid and complete table of the WP-languages... Best regards, Markus > Until now I have a CSV with 294 languages in two columns: > - name of language in English > - name of language local in local writing system > > I look for: > - ISO-639-3 short code (corresponding to the names) > > Best regards, > Markus