[Foundation-l] Languages and numbers

M. Williamson node.ue at gmail.com
Sun Jun 26 22:30:14 UTC 2011


Some of these actually already have Wikipedias:

Meadow Mari
Yakut (aka Sakha)
Lak
Balkar (aka Karachay-Balkar)
Yiddish, Eastern (= "standard" Yiddish, "Western Yiddish" is the one we are
missing but it has much fewer speakers; according to Ethnologue there are
only 5,400 around the world)

In addition, in another message you stated that we probably had Wikipedias
in every Sinitic language that was distinct enough from Mandarin to receive
an own Wikipedia; Min Bei has 10.3 million speakers and does not have a
Wikipedia and is definitely far removed from Mandarin; Xiang is also
probably deserving of its own Wikipedia and has 30 million+ speakers.


2011/6/24 Milos Rancic <millosh at gmail.com>

> While preparing Missing Wikipedias [1], I've got numbers of speakers and
> languages by area and country with chapter not covered by Wikipedias.
>
> Numbers are preliminary, some of them should be corrected. I didn't
> exclude Han languages, which mostly shouldn't be counted, and similar.
> Note, also, that every language should be analyzed separately. Many
> languages are spoken not just inside of one country.
>
> Please, fix errors and comment.
>
> * * *
>
> Areas. They approximate the usual definitions of areas, but they are
> different because of linguistic corrections.
>
> * Afro-Asiatic Area: Area where Afro-Asiatic languages are dominant.
> North Africa + Middle East + Sudan, Ethiopia, Eritrea and Somalia - Iran.
> * Europe: Europe (including Caucasus) includes Turkey.
> * South Asia: South Asia + Iran. Dominantly Indo-European and Dravidian
> languages.
> * Sub-Saharan Africa: The rest of Africa.
> * Polynesia, Australia and Oceania: Includes Malaysia and Taiwan
> (Taiwanese languages not covered in Wikipedias are dominantly
> Austronesian.)
> * East Asia: Han China "China (Central)", Korea and Japan.
> * South-East Asia: Includes non-Han south China "China (South)".
> * Latin America: Parts of America where Spanish and Portuguese are
> official languages.
> * Anglo-French America: Parts of America where English, French and Dutch
> are official languages.
> * North Asia: Asian part of former USSR, Mongolia and non-Han northern
> and western China "China (North)".
>
> The first column is number of speakers, the second number of languages,
> the third is area.
>
> 399259294 592 South Asia
> 353676706 1805 Sub-Saharan Africa
> 221855457 253 Afro-Asiatic Area
> 138979263 2198 Polynesia, Australia and Oceania
> 107363760 37 East Asia
> 99260271 447 South-East Asia
> 47901185 143 Europe
> 30361602 724 Latin America
> 8481452 227 Anglo-French America
> 3724384 45 North Asia
>
> * * *
>
> Countries with chapters. (Numbers are not fully correct, as they include
> some languages removed in the list below this one.)
>
> If any chapter (or interested group) is interested in full list of
> missing languages, I'll provide it by request before completing the
> work. I suppose that some chapters are interested in languages with less
> than 100K of speakers, as well.
>
> 296,097,274 349 India
> 71,356,176 681 Indonesia
> 46,676,395 157 Philippines
> 7,819,010 9 Germany
> 7,994,871 76 Russian Federation
> 5,386,580 5 Serbia
> 4,785,299 6 South Africa
> 2,841,300 17 Israel
> 1,139,750 4 Ukraine
> 1,085,931 125 United States
> 832,000 3 Netherlands
> 705,967 70 Canada
> 472,470 1 Czech Republic
> 375,704 17 Taiwan
> 313,642 6 Chile
> 246,900 3 United Kingdom
> 200,500 4 Spain
> 191,430 5 Poland
> 151,240 7 Sweden
> 132,809 12 Argentina
> 86,390 155 Australia
> 50,000 1 France
> 30,000 1 Hungary
> 29,980 4 Switzerland
> 17,460 5 Finland
> 15,000 1 Portugal
> 10,500 2 Norway
> 5,000 1 Denmark
> 4,500 1 Estonia
>
> Languages with more than million or more than 100,000 of speakers
> without Wikipedia and with chapter in the country:
>
> India (more than million)
> 38261000 Awadhi
> 34700000 Maithili
> 17500000 Chhattisgarhi
> 13000000 Magahi
> 13000000 Haryanvi
> 12800000 Deccan
> 10400000 Malvi
> 9500000 Kanauji
> 9000000 Dhundari
> 7760000 Bagheli
> 6970000 Varhadi-Nagpuri
> 6170900 Santali
> 6000000 Lambadi
> 5622600 Marwari
> 5000000 Mewati
> 4730000 Hadothi
> 4004490 Konkani
> 3900000 Merwari
> 3800000 Mina
> 3633900 Konkani, Goan
> 3000000 Shekhawati
> 3000000 Godwari
> 2920000 Garhwali
> 2680000 Indian Sign Language
> 2360000 Kumaoni
> 2110000 Dogri
> 2100000 Bagri
> 2094200 Kurux
> 2000000 Mewari
> 1970000 Sadri
> 1950000 Tulu
> 1950000 Gondi, Northern
> 1930000 Waddar
> 1710000 Wagdi
> 1700000 Kangri
> 1580000 Khandesi
> 1560280 Mundari
> 1543300 Bodo
> 1500000 Ho
> 1430000 Nimadi
> 1391000 Meitei
> 1300000 Bhili
> 1200000 Vasavi
> 1150000 Bhilali
> 1045000 Panjabi, Mirpur
> 1000000 Pahari, Mahasu
>
> Indonesia (more than million)
> 13600900 Madura
> 5530000 Minangkabau
> 3930000 Musi
> 3502300 Banjar
> 3330000 Bali
> 2700000 Betawi
> 2350000 Malay, Central
> 2100000 Sasak
> 2000000 Batak Toba
> 1880000 Malay, Makassar
> 1600000 Makasar
> 1200000 Batak Simalungun
> 1200000 Batak Dairi
> 1100000 Batak Mandailing
> 1000000 Malay, Jambi
>
> Philippines (more than 100k)
> 5770000 Hiligaynon
> 2500000 Bicolano, Central
> 1900000 Bicolano, Albay
> 1062000 Tausug
> 1000000 Maguindanao
> 776000 Maranao
> 639000 Capiznon
> 540000 Bontoc, Central
> 500000 Ibanag
> 395000 Inakeanon
> 378000 Kinaray-a
> 350000 Masbatenyo
> 345000 Surigaonon
> 319000 Sama, Southern
> 293000 Chavacano
> 234000 Bicolano, Iriga
> 200000 Romblomanon
> 200000 Bantoanon
> 185000 Sorsogon, Waray
> 150000 Kankanaey
> 150000 Blaan, Koronadal
> 147000 Davawenyo
> 140000 Subanen, Central
> 134000 Itawit
> 123000 Cuyonon
> 122000 Bicolano, Northern Catanduanes
> 111000 Ibaloi
> 107000 Yakan
> 100000 Philippine Sign Language
> 100000 Binukid
>
> Germany
> 4910000 Mainfränkisch
> 2000000 Saxon, Upper
> 819000 Swabian
>
> Russian Federation
> 783720 Lezgi
> 696630 Erzya
> 614000 Moksha
> 516490 Dargwa
> 499300 Adyghe
> 460090 Mari, Meadow
> 422550 Kumyk
> 413000 Ingush
> 363000 Yakut
> 264400 Tuva
> 217000 Komi-Zyrian
> 164420 Lak
> 128900 Tabassaran
> 113710 Balkar
>
> Serbia and Kosovo
> 4156090 Albanian, Gheg
> 709570 Romani, Balkan
> 318920 Romani, Sinte
> 172000 Romano-Serbian
>
> South Africa
> 4101000 Sotho, Northern
> 640000 Ndebele
>
> Israel
> 1762320 Yiddish, Eastern
> 352500 Arabic, Judeo-Tunisian
> 258930 Arabic, Judeo-Moroccan
> 110000 Bukharic
> 100130 Arabic, Judeo-Iraqi
>
> United States
> 600000 Hawai’i Creole English
> 250000 Sea Island Creole English
>
> Netherlands
> 592000 Gronings
> 220000 Zeeuws
>
> Canada
> 402900 Plautdietsch
>
> Czech Republic
> 472470 Romani, Carpathian
>
> Taiwan
> 138000 Amis
>
> Chile
> 300039 Mapudungun
>
> United Kingdom
> 202900 Angloromani
>
> Spain
> 102000 Spanish Sign Language
>
> Sweden
> 109600 Finnish, Tornedalen
>
> _______________________________________________
> foundation-l mailing list
> foundation-l at lists.wikimedia.org
> Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/foundation-l
>



More information about the wikimedia-l mailing list