The consumer that writes Event Logging events into the SQL database was
down yesterday for about 9 hours. We restarted it and it consumed the data
it missed, and inserted it into the database. The incident report is here:
https://wikitech.wikimedia.org/wiki/Incident_documentation/20151015-EventLo…
We don't yet know if any data was lost, I'm going to run some queries now
on a few schemas and I'll update the incident report.
If you had reports run on October 14th, between 06:00 UTC and 21:00 UTC,
you should re-run them.
The current site stats.grok.se is frequently down and very unstable. Are there any plans to set in motion an ongoing stable version that isn't maintained solely by a single individual.
The information in it is highly relevant to predictive analytics and of considerable use.
Doug Stone
Team:
As of today incoming request data includes an extra bit of information on
the X-analytics header.
If an incoming request to any wikipedia project had no cookies whatsoever
it will be tagged with nocookie=1. A requests without any cookies could
correspond to a fresh browser session, a user browsing with cookies
disabled or, most likely, a bot request as most bots will not accept
cookies. We *might* be able to use this setting as a cheap proxy to
quantify bot traffic.
Documentation about this change can be found here:
https://wikitech.wikimedia.org/wiki/X-Analytics
Thanks,
Nuria
Hi my name is SeminoleNation and I have been wondering if Wikipedia plans
on implementing a stable way of looking at article traffic statistics. The
current http://stats.grok.se/link is very unstable and frequently crashes
and will lose information of multiple days at a time. Thank you
Hello,
I work for a consulting firm called Strategy&. We have been engaged by Facebook on behalf of Internet.org to conduct a study on assessing the state of connectivity globally. One key area of focus is the availability of relevant online content. We are using a the availability of encyclopedic knowledge in one's primary language as a proxy for relevant content. We define this as 100K+ Wikipedia articles in one's primary language. We have a few questions related to this analysis prior to publishing it:
* We are currently using the article count by language based on Wikimedia's foundation public link: Source: http://meta.wikimedia.org/wiki/List_of_Wikipedias. Is this a reliable source for article count - does it include stubs?
* Is it possible to get historic data for article count. It would be great to monitor the evolution of the metric we have defined over time?
* What are the biggest drivers you've seen for step change in the number of articles (e.g., number of active admins, machine translation, etc.)
* We had to map Wikipedia language codes to ISO 639-3 language codes in Ethnologue (source we are using for primary language data). The 2 language code for a wikipedia language in the "List of Wikipedias" sometimes matches but not always the ISO 639-1 code. Is there an easy way to do the mapping?
Many Thanks,
Rawia
[Description: Strategy& Logo]
Formerly Booz & Company
Rawia Abdel Samad
Direct: +9611985655 | Mobile: +97455153807
Email: Rawia.AbdelSamad(a)strategyand.pwc.com<mailto:Rawia.AbdelSamad@strategyand.pwc.com>
www.strategyand.com
I believe its possible to create a template that would make the page be excluded in searches. I would need to check on that though. If not, then I would suggest such a capability would be a useful improvement and a phabricator request should be opened for it.
But I would argue against deleting most things without some discussion.
Sent from my T-Mobile 4G LTE device
------ Original message------From: LegoktmDate: Thu, Oct 15, 2015 12:59 PMTo: analytics@lists.wikimedia.org;Subject:Re: [Analytics] Canonical location for metrics documentation
On 10/14/2015 06:34 AM, Dan Andreescu wrote:> We have a documentation cleanup day coming up soon, and we've just got> delete permissions so we can actually clean.Please don't delete old content, mark it as {{historical}} or{{outdated}} and archive it instead.-- Legoktm_______________________________________________Analytics mailing listAnalytics@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/analytics