Alexander Prudnikov wrote:
- How is the UTF-8 encoding encoded and decoded into other encodings?
Where in the sources can I find it? And what additional libraries or software (except php, apache etc.) should I have in order to encode/decode UTF-8?
I am not aware that MediaWiki currently does any encoding or decoding at all. All our wikis use either ISO-8859-1 or UTF-8, and each of them uses this one encoding for input, database storage, output and everything. The only conversion we ever need to do is when switching a wiki to a different encoding, but this is a one-time thing per wiki, and if you start with UTF-8 right from the start, you never need to bother with this.
- How can I define (in the code) what encoding my Wiki uses? I mean what
variable contain information about the encoding?
$wgInputEncoding and $wgOutputEncoding. I don't know what happens if you specify different encodings for the two; just use UTF-8 for both of them and you'll be fine.
Timwi