Thanks Giuseppe !
I've subscribed to T219279 to know when the pages are properly converted,
and when I can remove the hack in my code.
On Mon, Aug 5, 2019 at 7:03 AM Giuseppe Lavagetto <
On Sun, Aug 4, 2019 at 11:34 AM Nicolas Vervelle
Great for the link to Php72ToUpper.php !
I think I understand with it : for example, the first line says 'ƀ' =>
which should mean that this letter shouldn't
be converted to uppercase
That's one of the letter I found that wasn't converted to uppercase and
that was generating a false positive in my code : so it's because
MW code is preventing the conversion :-)
No, that file is a temporary measure during a transition between two
versions of php.
In HHVM and PHP 5.x, calling mb_toupper("ƀ") would give the erroneous
In PHP 7.x, the result is the correct capitalization.
The issue is that the titles of wiki articles get normalized, so under
we would have
ƀar => Ƀar
which would prevent you from being able to reach the page.
Once we're done with the transition and we go through the process of
coverting the (several hundred) pages/users that have the wrong title
normalization, we will remove that table, and obtain the correct
You just need to subscribe https://phabricator.wikimedia.org/T219279
wait for its resolution I think - most unicode horrors are fixed in recent
versions of PHP, including the one you were citing.
Principal Site Reliability Engineer, Wikimedia Foundation
Wikitech-l mailing list