Will the upgrade to MySQL 5 give us (within the limits of Wikipedia per language) culture correct sorting or just the default Unicode collation?
Peter
Peter Jacobi wrote:
Will the upgrade to MySQL 5 give us (within the limits of Wikipedia per language) culture correct sorting or just the default Unicode collation?
MySQL has several language-specific Unicode collations in addition to the generic one:
Czech Danish Estonian Icelandic Latvian Lithuanian Persian Polish Roman Romanian Slovak Slovenian Spanish Swedish Turkish
Note that "Language-specific collations for the utf8 character set are implemented only if the ordering with utf8_unicode_ci does not work well for a language. For example, utf8_unicode_ci works fine for German and French, so there is no need to create special utf8 collations for these two languages."
For the full list and notes see: http://dev.mysql.com/doc/refman/5.0/en/charset-unicode-sets.html
-- brion vibber (brion @ pobox.com)
wikitech-l@lists.wikimedia.org