[Mediawiki-l] Web page source - "strange" characters
lmhelp2
lea.massiot at ign.fr
Wed Mar 24 14:40:33 UTC 2010
----------------------------------------------------------------------
Hi Alexis,
Thank you, I hadn't realized...
and "Platonides"'s post explains why...!
----------------------------------------------------------------------
Hi Platonides,
Thanks a lot for your explanations and examples!
Line 1: "E t o i l é <space>"
Line 2: 0x45 0x74 0x6f 0x69 0x6c 0xe9 0x20
Line 3: 0x45 0x74 0x6f 0x69 0x6c 0xc3 0xa9 0x20
Do we say:
----- "Line 2" is the "iso-8859-1" representation of "Line 1"?
----- "Line 3" is the "utf-8" representation of "Line 1"?
Question: shouldn't we have 7 * 2 "codepoints" instead of 8?
Maybe you omitted them, didn't you?
----- "Line 1" is made of characters?
----- "Line 2" and "Line 3" are made of codepoints?
Let's consider:
Line 1: "E t o i l
é <space>"
Line 4: 0x00 0x45 0x00 0x74 0x00 0x6f 0x00 0x69 0x00 0x6c 0x00 0xe9
0x00 0x20
Line 5: 0x45 0x00 0x74 0x00 0x6f 0x00 0x69 0x00 0x6c 0x00 0xe9 0x00
0x20 0x00
----- Is "Line 4" the "utf-16 BE" representation of "Line 1"?
----- Is "Line 5" the "utf-16 LE" representation of "Line 1"?
Can you tell me where to find the various tables which
allow one to find a given representation ("iso-8859-1",
"utf-8", "utf-16 BE", "utf-16 LE") for a given "character"?
I mean, how did you know that:
- 0xe9 is the "iso-8859-1" representation of é?
- 0xc3 0xa9 is the "utf-8" representation of é?
- 0x00 0xe9 is the "utf-16 BE" representation of é?
- 0xe9 0x00 is the "utf-16 LE" representation of é?
(Apart from the fact that you are a super-pro :) of course).
Please tell me if I misunderstood something and correct me if I
didn't use the proper terminology :) .
----------------------------------------------------------------------
Hi Nakohdo,
I was trying to do it with Vim...
Thanks a lot for your help :) :) :) .
All the best,
--
Lmhelp
--
View this message in context: http://old.nabble.com/Web-page-source---%22strange%22-characters-tp27999218p28015920.html
Sent from the WikiMedia General mailing list archive at Nabble.com.
More information about the MediaWiki-l
mailing list