[Mediawiki-l] Web page source - "strange" characters

lmhelp2 lea.massiot at ign.fr
Wed Mar 24 14:40:33 UTC 2010


----------------------------------------------------------------------
Hi Alexis,

Thank you, I hadn't realized...
and "Platonides"'s post explains why...!

----------------------------------------------------------------------
Hi Platonides,

Thanks a lot for your explanations and examples!

Line 1: "E       t       o      i        l       é             <space>" 
Line 2:  0x45  0x74  0x6f  0x69  0x6c  0xe9         0x20
Line 3:  0x45  0x74  0x6f  0x69  0x6c  0xc3 0xa9  0x20

Do we say:

----- "Line 2" is the "iso-8859-1" representation of "Line 1"?

----- "Line 3" is the "utf-8" representation of "Line 1"?
Question: shouldn't we have 7 * 2 "codepoints" instead of 8?
Maybe you omitted them, didn't you?

----- "Line 1" is made of characters?

----- "Line 2" and "Line 3" are made of codepoints?

Let's consider:

Line 1: "E              t              o              i              l              
é              <space>" 
Line 4:  0x00 0x45  0x00 0x74  0x00 0x6f  0x00 0x69  0x00 0x6c  0x00 0xe9 
0x00 0x20
Line 5:  0x45 0x00  0x74 0x00  0x6f 0x00  0x69 0x00  0x6c 0x00  0xe9 0x00 
0x20 0x00

----- Is "Line 4" the "utf-16 BE" representation of "Line 1"?
----- Is "Line 5" the "utf-16 LE" representation of "Line 1"?

Can you tell me where to find the various tables which 
allow one to find a given representation ("iso-8859-1", 
"utf-8", "utf-16 BE", "utf-16 LE") for a given "character"?

I mean, how did you know that:
- 0xe9 is the "iso-8859-1" representation of é?
- 0xc3 0xa9 is the "utf-8" representation of é?
- 0x00 0xe9 is the "utf-16 BE" representation of é?
- 0xe9 0x00 is the "utf-16 LE" representation of é?
(Apart from the fact that you are a super-pro :) of course).

Please tell me if I misunderstood something and correct me if I
didn't use the proper terminology :) .

----------------------------------------------------------------------
Hi Nakohdo,

I was trying to do it with Vim...

Thanks a lot for your help :) :) :) .
All the best,
--
Lmhelp
-- 
View this message in context: http://old.nabble.com/Web-page-source---%22strange%22-characters-tp27999218p28015920.html
Sent from the WikiMedia General mailing list archive at Nabble.com.




More information about the MediaWiki-l mailing list