Hi Francis,
I have a suggestion to improve this software: build a corpus from the Tajik RFE website http://www.ozodi.org/
As you can pretty obviously tell, not all vowels are indicated in Farsi, so in some cases there *should* be multiple candidates for transliteration. For example, Farsi "yeh" can be transliterated in a number of different ways.
In these cases, a simple search of the corpus should reveal which alternative is an actual word, or which is most frequent, and select it.
Mark
On 28/05/06, Francis Tyers spectre@ivixor.net wrote:
Are there actually any Tajik native speakers working on the Tajik Wikipedia at the moment?
I'd like to discuss some software I'm making with them...
http://82.133.33.43/~spectre/tajik/tajik.php
I've had a look over at tg. but it seems to be very inactive.
Regards,
Fran
Wikipedia-l mailing list Wikipedia-l@Wikimedia.org http://mail.wikipedia.org/mailman/listinfo/wikipedia-l