On Sat, Jan 11, 2003 at 12:44:18PM -0800, Toby Bartels wrote:
You can (and we often do, on [[en:]]) using HTML
entities,
such as Č (for "C" with a hacek, TeX's "\v C").
That approach borks things up. Specifically, it screws websearches.
How many people are going to enter, or know to enter, the HTML entity
when they type in a search term? Related, but slightly different,
it screws up collation. With collation you can find things with
diacritics even when you aren't putting the diacritics in yourself,
and sorting order gets done properly.
I think UTF-8 is the way to go. It's been out for years, and is now
widely supported.
Jonathan
--
Geek House Productions, Ltd.
Providing Unix & Internet Contracting and Consulting,
QA Testing, Technical Documentation, Systems Design & Implementation,
General Programming, E-commerce, Web & Mail Services since 1998
Phone: 604-435-1205
Email: djw(a)reactor-core.org
Webpage:
http://reactor-core.org
Address: 2459 E 41st Ave, Vancouver, BC V5R2W2