On dim, 2002-03-10 at 09:54, Jimmy Wales wrote:
pl.wikipedia.com
How about I set this up on Monday? Do I need a new conversion script?
I'll need to tweak the conversion script to convert the Latin-2 database to UTF-8, and drop in a quick fix to make searching work on non-ASCII words. Shouldn't be hard, I'll drop a note when it's checked in.
The full set of encoding-related changes to the code still need to be determined and made, but if we start it off using UTF-8, the database should be ready.
(Also, we may need to rebuild the search index later, as that part of the code may still change. But then this is the case if we change the mysql indexing anyway.)
I believe that this is best:
- I set up pl2.wikipedia.com as the "beta test" Polish wikipedia,
and import the existing data from pl.wikipedia.com using a conversion script.
- We run this for one week, inviting as many Polish speakers as we
can find to test it, incorporating bug fixes all week as quickly as they come in.
- The following Monday, I make the conversion "for real".
Sounds good to me. Can we commit to such a schedule for Esperanto as well? We've only put it off so far because we were going to change the database internals, but since we seem to have decided on UTF-8 it's not a problem to convert ASAP and make the changes later.
I'd like to follow up with Spanish and Esperanto as quickly as possible.
FYI, I currently have a beta running the converted Esperanto wiki at http://leukas.dyndns.org/wiki/ . A more officialish server wouldn't hurt though, as this is limited to the reliability of my DSL line.
-- brion vibber (brion @ pobox.com)
wikitech-l@lists.wikimedia.org