On Mon, Nov 17, 2003 at 11:29:23PM +0100, Luc Van Oostenryck wrote:
I don't know how much browser are non-UTF-8 compliant, but I'm sure there will problems. My point of view is more like the following: why should we need UTF-8 on fr when 99.9999% of text are fairly happy with ISO8859-1 (the charset that most european use and french in particular) and named or numeric HTML entities are OK for the rest.
Why should we need ISO 8859-N with named or numeric HTML entities when 100% of text works with UTF-8 ? The number is much smaller than 99.9999%. It's probably much smaller than 99% too.
I worked with articles about Japanese language when Polish Wikipedia used ISO 8859-2 and it wasn't fun. I wasted lot of time trying to locate which &-entity corresponds to which character to fix a typo or make some other change. In the end I installed local mirror Wikipedia with UTF-8, and make a bunch of Perlscripts that converted from ISO 8859-2 + &codes; to readable representation.