Re: [Wikitech-l] Encoding (Localisation updates from https://translatewiki.net.)

25 Mar 2014

On Mar 25, 2014 7:17 PM, "Brian Wolff" &lt;bawolff(a)gmail.com&gt; wrote:
...

 On Mar 25, 2014 3:11 PM, "Jon Robson" &lt;jdlrobson(a)gmail.com&gt; wrote:
 >
 > Now MobileFrontend is using JSON for languages, I jumped on this to
 > create a script to make language addition easier - basically a command
 > line interface called `make message` that edits the JSONs to add an
 > English message and QQQ code and maintains alphabetical ordering [1].
 >
 > Recently this was used and some updates came from translatewiki.net
 >
 > I ran my `make message` script and noticed it made some changes to
 > those from translation updator bot [2].
 > I was wondering - what would be the correct way to store these messages?
 > Do I need to update my script or should Translator bot being doing
 > things differently?
 >
 > "아라" or "\uc544\ub77c"
 > "\u003Ccode\u003E" or "<code>" ?
 >
 > Thanks in advances for your opinions!
 >
 > [1] https://gerrit.wikimedia.org/r/#/c/119637/
 > [2] https://gist.github.com/jdlrobson/9767604
 >
 > 
I recentish looked at json encoding for a different project. The conclusion
there too pretty much was to never use Unicode escapes except when demanded
by the spec.

As often, non bmp stuff may be painful. Composite pairs of Unicode escapes
may be used to describe a codepoint in json. Whether you prefer to believe
that the json encoder of your consumer is less likely to choke on astral
plane characters or on encoded composite pairs is up for debate (I have
seen both go wrong). _______________________________________________
...
   Wikitech-l
mailing list
 Wikitech-l(a)lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l 
 Lets not escape unicode characters unnessarily - humans read those files
 too.

 JSON files are allowed to have most characters as unescaped utf8

 -bawolff
 _______________________________________________
 Wikitech-l mailing list
 Wikitech-l(a)lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l 

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] Encoding (Localisation updates from https://translatewiki.net.)