Usage statistics link is broken. Correct one:
On Thu, Mar 31, 2016 at 10:42 PM, Eran Rosenthal <eranroz89(a)gmail.com>
TL;DR: Parsoid isn't i18n friendly and uses
English keywords instead of
localized. Is it a bug or feature? Please voice your opinion!
For some funny reasons Parsoid is reading arrays from "right to left",
that is, it uses the LAST alias of the magic words rather than the first
One of the reasons for this is because in English the shorter "thumb" is
preferred compared to the long "thumbnail". However, instead of fixing
MessagesEn.php to define thumb as the first option, parsoid uses the last
This choice result in all other wikis using the English alias (which
appears last in magic words) rather than the localized one - so Parsoid
isn't i18n friendly.
However there are different POVs regarding the correct solution for it:
1. Use English aliases in all projects - these are the most used aliases
[and one of the reasons is people copying code from enwiki or using biased
tools such as Parsoid]
2. Use localized aliases - keep the article content and syntax in the same
language. This is especially important for non-latin languages with
And there is a consensus for English being bad choice for RTL languages as
it cause mixed directional content which should be avoided. So if we go
with 1 choice, RTL languages should be exception.
I believe there is a cultural point of view here, and would like to hear
what do you think (especially non RTL and non English speakers): Do you
prefer mini (German), vignette (French), miniaturadeimagen (Spanish),
мини (Russian) instead of thumb (for example)?
I did some dump-minning to get the usage statistics:
And based on this I wrote a python script to suggest a reordering of the
aliases by usage, so if choice 2 is selected, we can merge and all
languages will use the preferred choice.