Re: [Wikitech-l] Different alphabets for the same language

12 Apr 2005

      ...
very nice indeed, a lot of languages should benefit of such feature.
However, languages that have latin script in their list show that
there are various exceptions to implement:

html entities (starting with '&' and ending with ';') must not be
converted.

noted.
...

interwiki links should be handled as exceptions too (as the list
of valid interwiki domains is known (eg, the possible xx in [[xx:foo]])
it should be easy to implement
it would be nice also to detect urls and not convert them

The conversion happens at a rather late stage of the wiki parser at
which the input should be largely (x)html. I have attempted to avoid
converting them but may have missed something. Can you provide an
example at the test site when this is not done correctly?
...
then, it would be need to have a way to force conversion even for
things otherwise being exceptions (that is, the opposite of -{ }- );
and the very nice thing would be a way to suggest the appropriate
conversion for a given string (for example, foreign people names could
be written differently in cyrillic and latin,
maybe something like "blablabla ={Latn:Saratxaga|Cyrl:Сарачага}= blabla",
that would be displayed as "blablabla Saratxaga blaba"
or "блаблаблабла Сарачага блаблабла" but not as
"блаблабла Саратхага блаблабла"
maybe the Latn:/Cyrl: could be removed, as the script can be found
from the strings, syntax will then be easier for the editors:
"blablabla ={Saratxaga|Сарачага}= blabla" or, if they write in cyrillic:
"блаблаблабла ={Сарачага|Saratxaga}= блаблабла"
This function is built in and is running at ZH. At the BE test site,
you can use the following syntax (note how close it is to your
suggestion:)
-{be-cyrillics: Foo; be-latin: Bar}-
This will show "Foo" in cyrillics mode, and "Bar" in latin mode.
-- 
zhengzhu

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] Different alphabets for the same language