---------------------------- Original Message ----------------------------
Subject: [Wikitech-l] Help wanted: Wikimedia engineering/product hiring
From: "Erik Moeller" <erik(a)wikimedia.org>
Date: Wed, June 29, 2011 7:24 am
To: "Wikimedia developers" <wikitech-l(a)lists.wikimedia.org>
we have 12 open positions in Wikimedia engineering right now, and a
few more will go up in coming weeks. If you have connections, we'd
appreciate your help hiring for these roles, forwarding this note to
appropriate listservs, tweeting it, etc.
The full listing is at
http://wikimediafoundation.org/wiki/Job_openings , but here are some
key roles that we need help with:
1) Product Managers: We're looking for great people to act as product
owners for three very important initiatives:
- New editor engagement: helping Wikimedia to attract, nurture and
retain new contributors.
- Analytics: supporting the development of systems and tools for
measuring our impact.
- Mobile: helping us reach hundreds of millions of people on mobile
devices and engaging hundreds of thousands of them to contribute in
The product manager role at WMF entails grooming and prioritizing the
product backlog, engaging with the Wikimedia community, commissioning
and organizing research, hands-on testing, but also helping with
across-the-board priorities triage for ongoing product development.
These folks don't have to be product managers by trade, but they need
to be comfortable negotiating compromise while holding the product
vision. They need to treat engineers as equal partners, and be
excellent communicators. Ideally they have strong domain expertise
relevant to their focus area.
We have two of these currently posted:
The mobile one will go up soon, and we'll refine the definition
further. But please use these as reference points for now.
2) Analytics Engineers: We're hiring for two systems engineers to
build out our analytics infrastructure. What exists so far is still
fairly rudimentary, so we need to build scalable logging and tracking
systems for various purposes, e.g.
- geographic breakdown of access and editing activity
- usage data for specific features; A/B testing of features
- search activity, real-time editor retention measures, new activity
visualizations, and more ..
The ideal candidate here likely is someone who's very strong building
out large scale distributed systems, and has experience with NoSQL
technologies, distributed computing, etc.
The relevant JD is here:
3) A strong QA Lead who can help us write and perform test plans with
shoestring and duct tape, i.e. using a combination of test automation,
work with outside vendors, and volunteer-driven testing to strengthen
our product quality. The relevant JD is here:
4) Strong frontend and backend engineers: for features development,
code review, deployment and release management support, and so forth.
Demonstrable open source experience is always a major plus, and while
PHP is learnable, not being predisposed against it helps. :-)
Your outreach and support is always appreciated.
Deputy Director, Wikimedia Foundation
Support Free Knowledge: http://wikimediafoundation.org/wiki/Donate
Wikitech-l mailing list
It has been pointed out that while the Meta page on WikiConference India
2011 has been updated regarding a few appointments for the Working Group,
this information has not been given to the community.
During the first Working Group meeting held on 19 Jun 2011 at Mumbai, three
functional appointments were decided upon so that the Working Group has a
conventional face to present to the world. The three positions decided upon
were Chair, Secretary and Treasurer.
Some names for these appointments were proposed and accepted. The following
were asked to carry out the functions -
* Chair - Pranav Curumsey
* Secretary - yours truly
* Treasurer - Pradeep Mohandas
As such, these are seen as working appointments only and carry no perks or
status different from others in the Working Group aside from being so named.
Thought you would like to know.
What we do need are the good wishes and the constant support of the
community at all times. Please do advice us and help us to come to important
decisions. Discussions with fewer participants are less likely to have good
solutions to issues than discussions with more participation.
Also, the working group needs many more volunteers even now. We need people
to come forth for logistics, scholarship coordination, speaker management
and other so vital though unglamorous tasks. Without the solid foundation
provided by these extremely important teams no program devised by the
Program Team can succeed even if it were the best program in the world.
So sign up today and lend a hand please.
Bishakha Datta <bishakhadatta(a)gmail.com> wrote:
>Very interesting; lists missing wikipedias in Indic languages too.
>---------- Forwarded message ----------
>From: Milos Rancic <millosh(a)gmail.com>
>Date: Sat, Jun 25, 2011 at 10:22 AM
>Subject: [Foundation-l] Languages and numbers
>To: Wikimedia Foundation Mailing List <foundation-l(a)lists.wikimedia.org>
>While preparing Missing Wikipedias , I've got numbers of speakers and
>languages by area and country with chapter not covered by Wikipedias.
>Numbers are preliminary, some of them should be corrected. I didn't
>exclude Han languages, which mostly shouldn't be counted, and similar.
>Note, also, that every language should be analyzed separately. Many
>languages are spoken not just inside of one country.
>Please, fix errors and comment.
>* * *
>Areas. They approximate the usual definitions of areas, but they are
>different because of linguistic corrections.
>* Afro-Asiatic Area: Area where Afro-Asiatic languages are dominant.
>North Africa + Middle East + Sudan, Ethiopia, Eritrea and Somalia - Iran.
>* Europe: Europe (including Caucasus) includes Turkey.
>* South Asia: South Asia + Iran. Dominantly Indo-European and Dravidian
>* Sub-Saharan Africa: The rest of Africa.
>* Polynesia, Australia and Oceania: Includes Malaysia and Taiwan
>(Taiwanese languages not covered in Wikipedias are dominantly Austronesian.)
>* East Asia: Han China "China (Central)", Korea and Japan.
>* South-East Asia: Includes non-Han south China "China (South)".
>* Latin America: Parts of America where Spanish and Portuguese are
>* Anglo-French America: Parts of America where English, French and Dutch
>are official languages.
>* North Asia: Asian part of former USSR, Mongolia and non-Han northern
>and western China "China (North)".
>The first column is number of speakers, the second number of languages,
>the third is area.
>399259294 592 South Asia
>353676706 1805 Sub-Saharan Africa
>221855457 253 Afro-Asiatic Area
>138979263 2198 Polynesia, Australia and Oceania
>107363760 37 East Asia
>99260271 447 South-East Asia
>47901185 143 Europe
>30361602 724 Latin America
>8481452 227 Anglo-French America
>3724384 45 North Asia
>* * *
>Countries with chapters. (Numbers are not fully correct, as they include
>some languages removed in the list below this one.)
>If any chapter (or interested group) is interested in full list of
>missing languages, I'll provide it by request before completing the
>work. I suppose that some chapters are interested in languages with less
>than 100K of speakers, as well.
>296,097,274 349 India
>71,356,176 681 Indonesia
>46,676,395 157 Philippines
>7,819,010 9 Germany
>7,994,871 76 Russian Federation
>5,386,580 5 Serbia
>4,785,299 6 South Africa
>2,841,300 17 Israel
>1,139,750 4 Ukraine
>1,085,931 125 United States
>832,000 3 Netherlands
>705,967 70 Canada
>472,470 1 Czech Republic
>375,704 17 Taiwan
>313,642 6 Chile
>246,900 3 United Kingdom
>200,500 4 Spain
>191,430 5 Poland
>151,240 7 Sweden
>132,809 12 Argentina
>86,390 155 Australia
>50,000 1 France
>30,000 1 Hungary
>29,980 4 Switzerland
>17,460 5 Finland
>15,000 1 Portugal
>10,500 2 Norway
>5,000 1 Denmark
>4,500 1 Estonia
>Languages with more than million or more than 100,000 of speakers
>without Wikipedia and with chapter in the country:
>India (more than million)
>3633900 Konkani, Goan
>2680000 Indian Sign Language
>1950000 Gondi, Northern
>1045000 Panjabi, Mirpur
>1000000 Pahari, Mahasu
>Indonesia (more than million)
>2350000 Malay, Central
>2000000 Batak Toba
>1880000 Malay, Makassar
>1200000 Batak Simalungun
>1200000 Batak Dairi
>1100000 Batak Mandailing
>1000000 Malay, Jambi
>Philippines (more than 100k)
>2500000 Bicolano, Central
>1900000 Bicolano, Albay
>540000 Bontoc, Central
>319000 Sama, Southern
>234000 Bicolano, Iriga
>185000 Sorsogon, Waray
>150000 Blaan, Koronadal
>140000 Subanen, Central
>122000 Bicolano, Northern Catanduanes
>100000 Philippine Sign Language
>2000000 Saxon, Upper
>460090 Mari, Meadow
>Serbia and Kosovo
>4156090 Albanian, Gheg
>709570 Romani, Balkan
>318920 Romani, Sinte
>4101000 Sotho, Northern
>1762320 Yiddish, Eastern
>352500 Arabic, Judeo-Tunisian
>258930 Arabic, Judeo-Moroccan
>100130 Arabic, Judeo-Iraqi
>600000 Hawai’i Creole English
>250000 Sea Island Creole English
>472470 Romani, Carpathian
>102000 Spanish Sign Language
>109600 Finnish, Tornedalen
>foundation-l mailing list
>Wikimediaindia-l mailing list
English Wikipedia is expected to get this gadget enabled for all users to
encourage Users pat on each other to motivate editors. Translating the
initiative to Indic Wikipedia's will encourage new users motivated when
people give these "awards". Tamil Wikipedia already has the older version of
WikiLove in use and might soon be using the newer version. Customizing the
culture is also a key, Not sure if people would love "Beer" here.
With the deadline for submitting LOGO designs for WikiConference lapsing today, we now request everyone to vote for the design they would like to see as the logo of WikiConference India from the eight designs submitted. Deadline for voting is 2.30PM on 29 June 2011 (48 hours from now).
You see all the designs and vote here: http://meta.wikimedia.org/wiki/Talk:WikiConference_India_2011#Logo
Some users have reported facing problems in accessing wikimedia.in on few
networks. These were analysed to be arising from some network routing
changes in the internet service providers. There is no problem with
wikimedia.in site and connectivity to internet.
The problems are expected to be resolved, as the routing tables updates
spread through the internet routers.
We need to decide by tomorrow night (21 Jun 2011) 2200 hrs IST (arbitrary,
for the sake of speeding things up) as to what the conference is to be
Four options have been listed - WikiMela, WikiConference India, WikiUtsav &
Wikimania India. A few people have already voted but we want wider input
just to be sure.
So please add your vote at -
As of now, WikiConference India - the default placeholder we used seems to
be the winner, but we want to be pretty sure, you guys dont want a name with
Indian overtones such as WikiUtsav or WikiMela.
Tomorrow night at 2200 hrs, the supports for/against will be counted for
each & the result announced so we can go in for the logo design. So do go &
put in your vote.
I think this post from the GLAM list is relevant to the India list as well [again not personally volunteering :-)].
> Date: Sat, 25 Jun 2011 16:12:48 +0100
> From: john(a)bodkinprints.co.uk
> To: cultural-partners(a)wikimedia.ch
> Subject: [cultural-partners] Monthly? Summaries
> Now that this list is so busy, which is great, it is a nightmare to find
> anything from a while back. It would be really useful if someone
> compiled a single summary of posts by month or volume, by thread,
> cutting all the repetitions. I'm not volunteering, but I think it would
> make the list more productive.
> Meanwhile, can someone supply the link for Magnus's new bulk upload
> thingy? Thanks if so.
> cultural-partners mailing list
I think that it will be useful that I am subscribed to this list. Here
is my answer to Vickram's questions which Bishakha resent to foundation-l.
-------- Original Message --------
Subject: Re: [Foundation-l] Languages and numbers
Date: Sat, 25 Jun 2011 16:27:24 +0200
From: Milos Rancic <millosh(a)gmail.com>
To: Wikimedia Foundation Mailing List <foundation-l(a)lists.wikimedia.org>
On 06/25/2011 03:11 PM, Bishakha Datta wrote:
> I posted this on the India list (many people are not subscribed to
> foundation-l) - forwarding this question which just popped up.
First of all, although numbers look fascinatingly precise, they are far
from that. When you make a sum of approximations like
~1M+800k+30k+4k+700+20+ the language spoken by three individuals, you
will get fascinating number 1,834,723. So, the numbers are far from
being census-level precision.
All of the numbers are based on Ethnologue data , which varies from
very good to very bad approximations. Ethnologue varies even in
linguistic classification a lot. (Being educated in Serbian linguistics,
I know how bad the description of the South Slavic area is.) BUT, it is
the best source for all languages of the world ever been made, and it
gives good general picture.
> [296,097,274 349 India]
> Does the population number mean that the existing indic language
> wikipedias covers the rest of the population ie over 90 crore? Is
> this information updated from the current census?
By making a quick approximation of number of speakers of some large
official languages of India  and not counting English, I've come to
the number of ~650M and stopped counting (BTW, that includes the number
of 180M of Hindi speakers from 1991; and according to the population
growth in India, there should be at least 250M of Hindi speakers today).
Thus, I think that ~300M more could be gathered by other languages with
Wikipedias and by adjusting existing numbers for population growth. (I
could make more precise calculation if needed, but I would need some
time.) It should be also noted that dates of the entries in Ethnologue
vary a lot and that some of them could be old 20 years or more.
And, again, this should be used as very general guideline, not as a
precise one. This list would be very good in telling that there are much
more speakers of Awadhi than Merwari today. However, it is not good to
be used for comparison of number of speakers between Awadhi and
Maithili. But, anyway, that's not important. We know that we should work
to cover both Awadhi and Maithili.
At the other side, I will, indeed, try to make those numbers more useful
(although I think that the most important usefulness is about pointing
to the large populations without Wikipedias).
> It is fascinating, although I think I may not have understood the
> classifications. Is there only one Indian Sign Language, for
> instance? I was told by a user (in the UK) that several are in use in
> different parts of the country. Still, perhaps the variants do not
> have sufficient numbers of users to qualify for this listing.
> However, the context in which I was told was precisely the severe
> lack of support materials for helping users become self-sufficient
> and good communicators, so the list itself becomes a barrier.
> Unfortunately, I do not know at the moment how to fix the problem.
I've checked the whole database and just one Indian Sign Language has
been listed, which doesn't tell us a lot. Ethnologue entry about Indian
Sign Language  says that it is called "Indo-Pakistani Sign Language"
or "Urban Indian Sign Language". However, according to the fact that
"Deaf schools mainly do not use ISL...", it could mean that dialectical
divergence could be very high (thus, it could look as a number of
different languages), no matter the fact that it's been used in Pakistan
and Bangladesh, as well.
Said so, I have to admit that my knowledge about sign languages is very