On Sat, Jan 11, 2003 at 12:44:18PM -0800, Toby Bartels wrote:
You can (and we often do, on [[en:]]) using HTML entities, such as Č (for "C" with a hacek, TeX's "\v C").
That approach borks things up. Specifically, it screws websearches. How many people are going to enter, or know to enter, the HTML entity when they type in a search term? Related, but slightly different, it screws up collation. With collation you can find things with diacritics even when you aren't putting the diacritics in yourself, and sorting order gets done properly.
I think UTF-8 is the way to go. It's been out for years, and is now widely supported.
Jonathan