Analytics May 2014

analytics@lists.wikimedia.org

35 participants
39 discussions

Geowiki no longer showing up-to-date numbers

by Christian Aistleitner

Hi, just a quick heads up that due to database issues, geowiki currently cannot update daily with new data. So pages with daily active editor counts like http://gp.wmflabs.org/graphs/active_editors_total http://gp.wmflabs.org/graphs/enwiki_editor_counts http://gp.wmflabs.org/graphs/frwiki_editor_counts http://gp.wmflabs.org/graphs/eowiki_editor_counts [...] and the private per country breakdowns at https://stats.wikimedia.org/geowiki-private/ will not see updates until the issue is resolved. Older data is not affected by the issue. So data up to May 1st is good to use (with the usual geowiki caveats). Best regards, Christian P.S.: The root issue is not severe, and I guess it can be fixed in the next couple of days. -- ---- quelltextlich e.U. ---- \\ ---- Christian Aistleitner ---- Companies' registry: 360296y in Linz Christian Aistleitner Gruendbergstrasze 65a Email: christian(a)quelltextlich.at 4040 Linz, Austria Phone: +43 732 / 26 95 63 Fax: +43 732 / 26 95 63 Homepage: http://quelltextlich.at/ ---------------------------------------------------------------

9 years, 6 months

Gerrit permissions for group “Analytics” on “analytics/*” projects

by Christian Aistleitner

Hi, people from gerrit's “Analytics” group [1] currently hold * Push (including Force Push) * Push Merge Commit * Forge Author Identiy * Forge Committer Identity permissions on “analytics/*” projects in gerrit. But those permissions got and get in the way one way or the other. Do we need those permissions for our repos? If no one objects, I'll start removing them on 2014-04-28. Best regards, Christian [1] https://gerrit.wikimedia.org/r/#/admin/groups/uuid-d34747bee94be39cff54b5fd… -- ---- quelltextlich e.U. ---- \\ ---- Christian Aistleitner ---- Companies' registry: 360296y in Linz Christian Aistleitner Gruendbergstrasze 65a Email: christian(a)quelltextlich.at 4040 Linz, Austria Phone: +43 732 / 26 95 63 Fax: +43 732 / 26 95 63 Homepage: http://quelltextlich.at/ ---------------------------------------------------------------

9 years, 6 months

Measuring ulsfo's impact on site performance

by Faidon Liambotis

Hi folks, As you've probably heard, last week we deployed ulsfo in production, reducing latency for Oceania, East/Southeast Asia & US/Canada pacific/west coast states. My estimation of the user base affected by this is 360 million users (as in, Internet users, not Wikipedia users). I was wondering if you have an easy way to measure and plot the impact in page load time, perhaps using Navigation Timing data? The operations team has spent a considerable amount of time and money to deploy ulsfo and I believe it'd be useful for us and the organization at large to be able to quantify this effort. The exact dates of the rollout by country/region codes can be found in operations/dns' git history: https://git.wikimedia.org/summary/?r=operations/dns.git (the commits should be self-explanatory, but I'd be happy to clarify if needed) Thanks! Faidon

9 years, 10 months

redlinks/link table

by Toby Negrin

Hi all -- We've received a request for a list of red links. I've been told we can get this list from the link table. Does anyone have any more information on this? thanks, -Toby

9 years, 10 months

Re: [Analytics] Wikipedia featured on FiveThirtyEight blog

by ENWP Pine

Thanks for the interesting information! "List of total drama characters" is shown as having approximately 25k revisions. But "List of total drama characters" is a redirect with a single edit, and the end page of the redirect "Total drama" has only 3,270 revisions according go xtools. What happened here? Pine

9 years, 11 months

one box to rule them all

by Dario Taraborelli

I’m very excited to share some updates from ops on analytics-store.eqiad.wmnet [1] aka “the one box to rule them all”. This box (which you access with the “research" SQL credentials) gives you: 1) read access to replicas of all production DBs consolidated on a single machine 2) read access to all EventLogging data via the log DB 3) read/write access to a shared staging DB that can be used as scratch space for temporary tables (similar to the staging DB on s1-analytics). If you create tables on staging, please prefix them with your shell user id (e.g. dartar_foo). This is one of the best news I got from ops since I joined WMF and it will make my work way easier – thanks Sean and anybody else who helped make this happen. Ops is also working on a solution to consolidate all credentials for analytics databases in a single place, via the creation of a “researcher” user group [2]. I’lll send a not one the list when this is completed Dario [1] a CNAME for dbstore1002.eqaid.wmnet. [2] https://gerrit.wikimedia.org/r/#/c/136273/

9 years, 11 months

Wikipedia featured on FiveThirtyEight blog

by Kevin Leduc

It's nice to see they are interested in our data: http://fivethirtyeight.com/datalab/the-100-most-edited-wikipedia-articles/

9 years, 11 months

purging old data from eventlogging db

by Sean Pringle

Hi! I'd like to hear from stakeholders about purging old data from the eventlogging database. Yes, no, why [not], etc. I understand from Ori that there is a 90 day retention policy, and that purging has been discussed previously but not addressed for various reasons. Certainly there are many timestamps older than 90 days still in the db, and apparently largely untouched by queries? Perhaps we're in a better position now to do this properly what with data now in multiple places: log files, database, hadoop... Can we please purge stuff? :-) BR Sean -- DBA @ WMF

9 years, 11 months

EventLogging postmortem, and maintenance responsibilities

by Ori Livneh

At about 2014-03-18 00:04 UTC, db1047 stopped accepting incoming connections. At some point during the subsequent hour, MariaDB had either crashed or been manually restarted. Sean noticed that the database was choking on some queries from the researchers and notified the wmfresearch list. During the time that the database server was out or rejecting connection, the EventLogging writer that writes to db1047 was repeatedly failing to connect to it: sqlalchemy.exc.OperationalError: (OperationalError) (2003, "Can't connect to MySQL server on 'db1047.eqiad.wmnet' (111)") The Upstart job for EventLogging is configured to re-spawn the writer, up to a certain threshold of failures. Because the writer repeatedly failed to connect, it hit the threshold, and was not re-spawned. This triggered an Icinga alert: [00:04:24] <icinga-wm> PROBLEM - Check status of defined EventLogging jobs on vanadium is CRITICAL: CRITICAL: Stopped EventLogging jobs: consumer/mysql-db1047 This alert was not responded to. I finally got pinged by Tillman, who noticed the blog visitor stats report was blank, and by Gilles, who noticed image loading performance data was missing. We have to fix this. The level of maintenance that EventLogging gets is not proportional to its usage across the organization. Analytics, I really need you to step up your involvement. It was not long ago that EventLogging was running reliably for months at a time. What has changed is not system load, but the owner seat becoming vacant, leading to a gradual deterioration of the quality of monitoring and auditing practices. Sean proposed moving the EventLogging database to m2, so that it runs on separate hardware from the research databases. I think he's right. I filed < https://rt.wikimedia.org/Ticket/Display.html?id=7081> to request the migration. There is some code rot around the Ganglia and Graphite monitoring code for EventLogging. I don't think it would take much to fix. Could the Analytics team take this on? The Puppet code is well-documented. < https://wikitech.wikimedia.org/wiki/EventLogging> could use some updating, but it is mostly current. Finally, I think EventLogging Icinga alerts should have a higher profile, and possibly page someone. Issues can usually be debugged using the eventloggingctl tool on Vanadium and by inspecting the log files on vanadium:/var/log/upstart/eventlogging-*. --- Ori Livneh ori(a)wikimedia.org

9 years, 11 months

s1-analytics-slave lag

by Sean Pringle

Before someone emails me about this... :-) s1-analytics-slave eventlogging replication is starting to lag again (enwiki replication is ok). I noticed that new eventlogging tables are using InnoDB instead of TokuDB on that slave. The issue is being fixed and we should be back up to speed within the day. -s

9 years, 11 months

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

Analytics May 2014