Gerard Meijssen wrote:
some time ago to move to UTF-8. At the time it
was not such
a good idea
as UTF-8 does take more room.
More room? UTF-8 does not use more memory, if that's what you
mean. HTML
entities (like Ӓ) use 5 up to 7 bytes, while a character
in UTF-8
uses at most 4 bytes.
It wasnt the pb, to convert nl.wikipedia, we had to uncompress the old table, which once
uncompressed take much more space.
Shaihulud