On 16 June 2015 at 20:50, Stas Malyshev <smalyshev(a)wikimedia.org> wrote:
Hi!
In a recent meeting, Oliver expressed concerns
about us having services
running in labs which are treated sort of like they were in production.
Examples include WDQS (already) and maps (potentially).
What specifically are the concerns?
As I understand, the requirements to run service in production are much
higher than in labs, so we can either run it on labs, or not run it at
all, at least for the time it takes to complete all the work required to
fill the delta.
See below.
issues to
explain them properly, so hopefully someone else will step in
and do so. I believe one big area of concern is analytics.
What about analytics? Is it about analyzing WDQS? I'd be glad to help if
I can though not sure what needs to be done there.
The problem, as we've gone back and forth about for a while on
phabricator, is that labs has absolutely zero inbuilt infrastructure
for analytics.
If things are in production they go through the frontend varnishes,
which are hooked up to HDFS, and all is fine. We have the request
logs. If things are in labs...nothing. There is no access to HDFS,
there is no consistent varnish setup that pipes things there, and
analytics engineering has pretty much no plans to set up that sort of
infrastructure.
--
Stas Malyshev
smalyshev(a)wikimedia.org
_______________________________________________
Wikimedia-search mailing list
Wikimedia-search(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikimedia-search
--
Oliver Keyes
Research Analyst
Wikimedia Foundation