On Nov 28, 2004, at 6:32 AM, Petr Kadlec wrote:
It seems to me that handling of non-ASCII characters
in interwiki
links (or in URLs in general) is a bit problematic. As an example,
take [[en:Václav Havel]]. Since en: does not use UTF-8, the URL is
".../V%E1clav_Havel". If you try to use the interwiki link to cs:
(specified in the source as [[cs:Václav Havel]]), it leads to
http://cs.wikipedia.org/wiki/V%E1clav_Havel, which is _wrong_, because
the cs: Wikipedia uses UTF-8 and the proper link should be
".../V%C3%A1clav_Havel".
It detects the encoding on the incoming link and redirects
transparently. Where's the problem?
And, vice versa, the Czech article contains
an interwiki link (specified again as [[en:Václav Havel]]) leads to
http://en.wikipedia.org/wiki/V%C3%A1clav_Havel, which is, again,
wrong.
It detects the encoding on the incoming link and redirects
transparently. Where's the problem?
I believe that a correct solution (apart from the
long-term solution
of using UTF-8 everywhere) could be:
* Accept UTF-8 in URLs on en: (but how could they be recognized??)
We already do, see above.
-- brion vibber (brion @
pobox.com)