Hi Nik,
Thanks for the update, and I look forward to the output.
Some questions. 1) Is this going out independently of the WMF7 update? Or is part of that update and silent in the detail? 2) This will be in all English language wikis, including the central wikimedia.org wikis? 3) What is happening with the multilanguage wikis? eg. commons, wikidata (or are they considered English?)
Thanks. Regards, Billinghurst
On Mon, 2 Jun 2014 12:21:53 -0400, Nikolas Everett neverett@wikimedia.org wrote:
Ambassadors,
Sorry for being silent for so long. I have a (maybe) important update
for
CirrusSearch. I'm currently in the process of pushing unicode normalization [0] to many languages [1]. In some languages this will (hopefully!) be great and in others it won't change anything. If this
has
broken anything please let me know. Reply or file a bug or whatever is easiest for you.
Thanks for reading!
Nik
[0]: NFKC with case folding
http://unicode.org/reports/tr15/#Norm_Forms
for those who want to read more/already know and love unicode normalization. [1]: All languages _but_ these: arabic, armenian, basque, brazilian, bulgarian, catalan, chinese, czech, danish, dutch, finnish, french, galician, german, greek, hindi, hungarian, indonesian, italian,
norwegian,
persian, portuguese, romanian, russian, spanish, swedish, turkish, thai. Its a long story why they aren't getting it, but they will in time if everything goes well....