Alex, this was addressed with a new deploy yesterday, the content-type is now explicitly set.

From: Alex Druk
Sent: Thursday, February 4, 2016 04:48
To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics.
Reply To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics.
Subject: Re: [Analytics] Pagecounts dumps page title UTF-8 escaping

Hi all,

I have a similar question: why MediaWiki API and new pageviews API send different content-type responses headers?
MediaWiki API  sends '
content-type:
text/html; charset=UTF-8
'
and new pageviews API sends only '
content-type:
application/json
' without explicitly setting UTF-8.
For example  I  see in my Crome browser "Goiânia_accident" correctly in MediaWiki responses and "Goiânia_accident" in pageviews API responses. 
Was it done intentionally or just a bug?

Thanks in advance!
Alex


On Thu, Feb 4, 2016 at 8:22 AM, Federico Leva (Nemo) <nemowiki@gmail.com> wrote:
Bo Han, 04/02/2016 00:40:
Is the logic for the escaping available somewhere?

MediaWiki API does https://phabricator.wikimedia.org/T29849
For the new pageviews API I got this reply on Unicode normalisation: https://phabricator.wikimedia.org/T44259#1351880

(Phabricator is down right now; wait a couple hours or check web.archive.org.)

Nemo


_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics



--
Thank you.

Alex Druk, PhD
wikipediatrends.com
alex.druk@gmail.com
(775) 237-8550 Google voice