But then there is one issue of missing characters in some scripts. For example Tamil has
very few characters. When we transliterate from other language into Tamil, say Kannada, we
will have to put one character instead of another. The original character will be lost.
When we transliterate back to Kannada from Tamil 2 or more characters will become just
one. One classic example is the case of kaanthi and gaandhi. In Tamil, both are written
same.
There were huge discussions on this topic in the Indic Unicode mailing list.
Thanks and regards,
Pavanaja
-----Original Message-----
From: wikimediaindia-l-bounces(a)lists.wikimedia.org
[mailto:wikimediaindia-l-bounces@lists.wikimedia.org] On Behalf Of sankarshan
Sent: 11 March 2013 13:53
To: Wikimedia India Community list
Subject: Re: [Wikimediaindia-l] Getting a new language Wikipedia going...
On Mon, Mar 11, 2013 at 10:38 AM, Pavanaja U B <
<mailto:pavanaja@vishvakannada.com> pavanaja(a)vishvakannada.com> wrote:
This is not a very big task. All Indic scripts are
separated by
decimal 128 in Unicode charts. For ex., take Hindi
“ka”. Add 128 to
its Unicode value to get Bengali “ka”. I had written a
simple such
transliterator long ago, using
VB.NET. I did not write
any web interface to that.
Then I'd say all that remains is pointing an interested party to the source code of
your tool and, asking them to assess writing a web-app.
--
sankarshan mukhopadhyay
< <https://twitter.com/#!/sankarshan>
https://twitter.com/#!/sankarshan>
_______________________________________________
Wikimediaindia-l mailing list
<mailto:Wikimediaindia-l@lists.wikimedia.org> Wikimediaindia-l(a)lists.wikimedia.org
To unsubscribe from the list / change mailing preferences visit
<https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l>
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l