Hello Milos,
Tuesday, April 12, 2005, 12:27:27 AM, you wrote:
I am implementing the transliteration from Cyrillic to Latin for Serbian Wikipedia. I am not sure that Serbian or Belarussian transliteration can pass "live transliteration" without problems.
Problems obviously exist, but they are solvable.
Transliteration between Cyrillic and Latin alphabets is complicated because of a number of problems (Serbian Latin and Cyrillic ortographies have some differences, too): If Latin alphabet in Belarussian has equal status (such as in Serbian, almost) with Cyrillic, you can't forbid writing in Latin. And if you want to transliterate from Latin to Cyrillic, you'll have a lot of English words transliterated in Cyrillic.
Here is suggestion to put some edit-time markup around non-convertable Latin words.
Also, what about referencing in Cyrillic? If you add some Russian bibliography, you'll have Russian text transliterated into Latin.
Here is the problem no solution was found for yet. I think we will end up with some markup just in database in the end.
So, I started to build some (static) infrastructure (using pywikipediabot) for Serbian. As the situation is similar for Belarussian, we can do it together (or you can use my code when I finish it for Serbian Wikipedia).
Great, could you show test site or something? could you explain the idea of your work, especially why do you use pywikipedia?