----------------------------------------------------------------------
Hi Alexis,
Thank you, I hadn't realized...
and "Platonides"'s post explains why...!
----------------------------------------------------------------------
Hi Platonides,
Thanks a lot for your explanations and examples!
Line 1: "E t o i l é <space>"
Line 2: 0x45 0x74 0x6f 0x69 0x6c 0xe9 0x20
Line 3: 0x45 0x74 0x6f 0x69 0x6c 0xc3 0xa9 0x20
Do we say:
----- "Line 2" is the "iso-8859-1" representation of "Line
1"?
----- "Line 3" is the "utf-8" representation of "Line 1"?
Question: shouldn't we have 7 * 2 "codepoints" instead of 8?
Maybe you omitted them, didn't you?
----- "Line 1" is made of characters?
----- "Line 2" and "Line 3" are made of codepoints?
Let's consider:
Line 1: "E t o i l
é <space>"
Line 4: 0x00 0x45 0x00 0x74 0x00 0x6f 0x00 0x69 0x00 0x6c 0x00 0xe9
0x00 0x20
Line 5: 0x45 0x00 0x74 0x00 0x6f 0x00 0x69 0x00 0x6c 0x00 0xe9 0x00
0x20 0x00
----- Is "Line 4" the "utf-16 BE" representation of "Line
1"?
----- Is "Line 5" the "utf-16 LE" representation of "Line
1"?
Can you tell me where to find the various tables which
allow one to find a given representation ("iso-8859-1",
"utf-8", "utf-16 BE", "utf-16 LE") for a given
"character"?
I mean, how did you know that:
- 0xe9 is the "iso-8859-1" representation of é?
- 0xc3 0xa9 is the "utf-8" representation of é?
- 0x00 0xe9 is the "utf-16 BE" representation of é?
- 0xe9 0x00 is the "utf-16 LE" representation of é?
(Apart from the fact that you are a super-pro :) of course).
Please tell me if I misunderstood something and correct me if I
didn't use the proper terminology :) .
----------------------------------------------------------------------
Hi Nakohdo,
I was trying to do it with Vim...
Thanks a lot for your help :) :) :) .
All the best,
--
Lmhelp
--
View this message in context:
http://old.nabble.com/Web-page-source---%22strange%22-characters-tp27999218…
Sent from the WikiMedia General mailing list archive at
Nabble.com.