On 16 June 2015 at 20:50, Stas Malyshev smalyshev@wikimedia.org wrote:
Hi!
In a recent meeting, Oliver expressed concerns about us having services running in labs which are treated sort of like they were in production. Examples include WDQS (already) and maps (potentially).
What specifically are the concerns?
As I understand, the requirements to run service in production are much higher than in labs, so we can either run it on labs, or not run it at all, at least for the time it takes to complete all the work required to fill the delta.
See below.
issues to explain them properly, so hopefully someone else will step in and do so. I believe one big area of concern is analytics.
What about analytics? Is it about analyzing WDQS? I'd be glad to help if I can though not sure what needs to be done there.
The problem, as we've gone back and forth about for a while on phabricator, is that labs has absolutely zero inbuilt infrastructure for analytics.
If things are in production they go through the frontend varnishes, which are hooked up to HDFS, and all is fine. We have the request logs. If things are in labs...nothing. There is no access to HDFS, there is no consistent varnish setup that pipes things there, and analytics engineering has pretty much no plans to set up that sort of infrastructure.
-- Stas Malyshev smalyshev@wikimedia.org
Wikimedia-search mailing list Wikimedia-search@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikimedia-search