Re: [Wikitech-l] Feature request: Kana entities

8 Mar 2002


      On Thu, Mar 07, 2002 at 11:34:53PM -0800, Brion L. VIBBER wrote:
...
On ??a??, 2002-03-07 at 22:07, Tomasz Wegrzanowski wrote:
...
Just see articles about anything Japanese on English Wikipedia.
They contain Japanese names of everything.
Sure, but more often kanji than kana, so special kana markup wouldn't be
that big a win. See the thread "International Upgrades"; the vague plan
is to standardise the internal character set and present the wikipedias
in Unicode to capable browsers. (Please comment!)
Uhm, right. But most non-japanese people don't know names of too many kanjis,
so kanjis aren't that important. ;) On the other hand more people that
it is usually though know kana, so it might be beneficial for them.
Hmmm. Now I think that some general method would be more useful:
&katakana_a; &kanji_b; &hebrew_c; or &cyrilic_d;
I think that it won't need too many changes in parser.
Perl code:
Init:
%Entities =	{'&katakana_o;' => '&#x30aa;',
    	...
    	};
On HTML output:
s/(&[a-zA-Z0-9_]+;)/$Entities{$x}?$Entities{$x}:$x;/eg;
...
As a result, we should be able to use the customary input methods or
cut-n-paste to put any characters into any of the wikis, which is
certainly a lot easier than looking up entities or running text through
a UTF-8-to-entities convertor (which is what I currently do).
-- brion vibber (brion @ pobox.com)
Hmmm. Wouldn't that need some modifications to browsers ?

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] Feature request: Kana entities