[Wikimediaindia-l] Fwd: [Foundation-l] Languages and numbers

Pranesh Prakash pranesh at cis-india.org
Sat Jun 25 17:01:26 UTC 2011



Bishakha Datta <bishakhadatta at gmail.com> wrote:

>Very interesting; lists missing wikipedias in Indic languages too.
>
>Best
>Bishakha
>
>---------- Forwarded message ----------
>From: Milos Rancic <millosh at gmail.com>
>Date: Sat, Jun 25, 2011 at 10:22 AM
>Subject: [Foundation-l] Languages and numbers
>To: Wikimedia Foundation Mailing List <foundation-l at lists.wikimedia.org>
>
>
>While preparing Missing Wikipedias [1], I've got numbers of speakers and
>languages by area and country with chapter not covered by Wikipedias.
>
>Numbers are preliminary, some of them should be corrected. I didn't
>exclude Han languages, which mostly shouldn't be counted, and similar.
>Note, also, that every language should be analyzed separately. Many
>languages are spoken not just inside of one country.
>
>Please, fix errors and comment.
>
>* * *
>
>Areas. They approximate the usual definitions of areas, but they are
>different because of linguistic corrections.
>
>* Afro-Asiatic Area: Area where Afro-Asiatic languages are dominant.
>North Africa + Middle East + Sudan, Ethiopia, Eritrea and Somalia - Iran.
>* Europe: Europe (including Caucasus) includes Turkey.
>* South Asia: South Asia + Iran. Dominantly Indo-European and Dravidian
>languages.
>* Sub-Saharan Africa: The rest of Africa.
>* Polynesia, Australia and Oceania: Includes Malaysia and Taiwan
>(Taiwanese languages not covered in Wikipedias are dominantly Austronesian.)
>* East Asia: Han China "China (Central)", Korea and Japan.
>* South-East Asia: Includes non-Han south China "China (South)".
>* Latin America: Parts of America where Spanish and Portuguese are
>official languages.
>* Anglo-French America: Parts of America where English, French and Dutch
>are official languages.
>* North Asia: Asian part of former USSR, Mongolia and non-Han northern
>and western China "China (North)".
>
>The first column is number of speakers, the second number of languages,
>the third is area.
>
>399259294 592 South Asia
>353676706 1805 Sub-Saharan Africa
>221855457 253 Afro-Asiatic Area
>138979263 2198 Polynesia, Australia and Oceania
>107363760 37 East Asia
>99260271 447 South-East Asia
>47901185 143 Europe
>30361602 724 Latin America
>8481452 227 Anglo-French America
>3724384 45 North Asia
>
>* * *
>
>Countries with chapters. (Numbers are not fully correct, as they include
>some languages removed in the list below this one.)
>
>If any chapter (or interested group) is interested in full list of
>missing languages, I'll provide it by request before completing the
>work. I suppose that some chapters are interested in languages with less
>than 100K of speakers, as well.
>
>296,097,274 349 India
>71,356,176 681 Indonesia
>46,676,395 157 Philippines
>7,819,010 9 Germany
>7,994,871 76 Russian Federation
>5,386,580 5 Serbia
>4,785,299 6 South Africa
>2,841,300 17 Israel
>1,139,750 4 Ukraine
>1,085,931 125 United States
>832,000 3 Netherlands
>705,967 70 Canada
>472,470 1 Czech Republic
>375,704 17 Taiwan
>313,642 6 Chile
>246,900 3 United Kingdom
>200,500 4 Spain
>191,430 5 Poland
>151,240 7 Sweden
>132,809 12 Argentina
>86,390 155 Australia
>50,000 1 France
>30,000 1 Hungary
>29,980 4 Switzerland
>17,460 5 Finland
>15,000 1 Portugal
>10,500 2 Norway
>5,000 1 Denmark
>4,500 1 Estonia
>
>Languages with more than million or more than 100,000 of speakers
>without Wikipedia and with chapter in the country:
>
>India (more than million)
>38261000 Awadhi
>34700000 Maithili
>17500000 Chhattisgarhi
>13000000 Magahi
>13000000 Haryanvi
>12800000 Deccan
>10400000 Malvi
>9500000 Kanauji
>9000000 Dhundari
>7760000 Bagheli
>6970000 Varhadi-Nagpuri
>6170900 Santali
>6000000 Lambadi
>5622600 Marwari
>5000000 Mewati
>4730000 Hadothi
>4004490 Konkani
>3900000 Merwari
>3800000 Mina
>3633900 Konkani, Goan
>3000000 Shekhawati
>3000000 Godwari
>2920000 Garhwali
>2680000 Indian Sign Language
>2360000 Kumaoni
>2110000 Dogri
>2100000 Bagri
>2094200 Kurux
>2000000 Mewari
>1970000 Sadri
>1950000 Tulu
>1950000 Gondi, Northern
>1930000 Waddar
>1710000 Wagdi
>1700000 Kangri
>1580000 Khandesi
>1560280 Mundari
>1543300 Bodo
>1500000 Ho
>1430000 Nimadi
>1391000 Meitei
>1300000 Bhili
>1200000 Vasavi
>1150000 Bhilali
>1045000 Panjabi, Mirpur
>1000000 Pahari, Mahasu
>
>Indonesia (more than million)
>13600900 Madura
>5530000 Minangkabau
>3930000 Musi
>3502300 Banjar
>3330000 Bali
>2700000 Betawi
>2350000 Malay, Central
>2100000 Sasak
>2000000 Batak Toba
>1880000 Malay, Makassar
>1600000 Makasar
>1200000 Batak Simalungun
>1200000 Batak Dairi
>1100000 Batak Mandailing
>1000000 Malay, Jambi
>
>Philippines (more than 100k)
>5770000 Hiligaynon
>2500000 Bicolano, Central
>1900000 Bicolano, Albay
>1062000 Tausug
>1000000 Maguindanao
>776000 Maranao
>639000 Capiznon
>540000 Bontoc, Central
>500000 Ibanag
>395000 Inakeanon
>378000 Kinaray-a
>350000 Masbatenyo
>345000 Surigaonon
>319000 Sama, Southern
>293000 Chavacano
>234000 Bicolano, Iriga
>200000 Romblomanon
>200000 Bantoanon
>185000 Sorsogon, Waray
>150000 Kankanaey
>150000 Blaan, Koronadal
>147000 Davawenyo
>140000 Subanen, Central
>134000 Itawit
>123000 Cuyonon
>122000 Bicolano, Northern Catanduanes
>111000 Ibaloi
>107000 Yakan
>100000 Philippine Sign Language
>100000 Binukid
>
>Germany
>4910000 Mainfränkisch
>2000000 Saxon, Upper
>819000 Swabian
>
>Russian Federation
>783720 Lezgi
>696630 Erzya
>614000 Moksha
>516490 Dargwa
>499300 Adyghe
>460090 Mari, Meadow
>422550 Kumyk
>413000 Ingush
>363000 Yakut
>264400 Tuva
>217000 Komi-Zyrian
>164420 Lak
>128900 Tabassaran
>113710 Balkar
>
>Serbia and Kosovo
>4156090 Albanian, Gheg
>709570 Romani, Balkan
>318920 Romani, Sinte
>172000 Romano-Serbian
>
>South Africa
>4101000 Sotho, Northern
>640000 Ndebele
>
>Israel
>1762320 Yiddish, Eastern
>352500 Arabic, Judeo-Tunisian
>258930 Arabic, Judeo-Moroccan
>110000 Bukharic
>100130 Arabic, Judeo-Iraqi
>
>United States
>600000 Hawai’i Creole English
>250000 Sea Island Creole English
>
>Netherlands
>592000 Gronings
>220000 Zeeuws
>
>Canada
>402900 Plautdietsch
>
>Czech Republic
>472470 Romani, Carpathian
>
>Taiwan
>138000 Amis
>
>Chile
>300039 Mapudungun
>
>United Kingdom
>202900 Angloromani
>
>Spain
>102000 Spanish Sign Language
>
>Sweden
>109600 Finnish, Tornedalen
>
>_______________________________________________
>foundation-l mailing list
>foundation-l at lists.wikimedia.org
>Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/foundation-l
>
>_______________________________________________
>Wikimediaindia-l mailing list
>Wikimediaindia-l at lists.wikimedia.org
>https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l


More information about the Wikimediaindia-l mailing list