Re: [Wikitech-l] Feature request: Kana entities

8 Mar 2002

On Thu, Mar 07, 2002 at 11:34:53PM -0800, Brion L. VIBBER wrote:
...
  On ??a??, 2002-03-07 at 22:07, Tomasz Wegrzanowski
wrote:
  Just see articles about anything Japanese on
English Wikipedia.
 They contain Japanese names of everything.  
 Sure, but more often kanji than kana, so special kana markup wouldn't be
 that big a win. See the thread "International Upgrades"; the vague plan
 is to standardise the internal character set and present the wikipedias
 in Unicode to capable browsers. (Please comment!) 
Uhm, right. But most non-japanese people don't know names of too many kanjis,
so kanjis aren't that important. ;) On the other hand more people that
it is usually though know kana, so it might be beneficial for them.

Hmmm. Now I think that some general method would be more useful:
&katakana_a; &kanji_b; &hebrew_c; or &cyrilic_d;

I think that it won't need too many changes in parser.
Perl code:
Init:

%Entities =	{'&katakana_o;' => '&#x30aa;',
		...
		};

On HTML output:

s/(&[a-zA-Z0-9_]+;)/$Entities{$x}?$Entities{$x}:$x;/eg;

...
  As a result, we should be able to use the customary
input methods or
 cut-n-paste to put any characters into any of the wikis, which is
 certainly a lot easier than looking up entities or running text through
 a UTF-8-to-entities convertor (which is what I currently do).

 -- brion vibber (brion @ pobox.com) 
Hmmm. Wouldn't that need some modifications to browsers ?

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] Feature request: Kana entities