I have no evidence that this is the case. A scan for the user table using the same fields/keys as the ones I used on the private slaves takes less than a second on tool labs.

On Nov 21, 2013, at 10:00 AM, Steven Walling <swalling@wikimedia.org> wrote:


On Thu, Nov 21, 2013 at 9:57 AM, Dario Taraborelli <dtaraborelli@wikimedia.org> wrote:
I tried uploading a cohort from a recent A/B test (1,780 unique user_id’s). The async validation took about 5 minutes to complete.

If I create a temporary table with the data in my CSV and run a join with the user table against a slave, the query to validate that these users exist takes about 400ms if I use user_id (primary key in enwiki.user) and about 3s using user_name (unique in enwiki.user). 

What’s the reason why it takes so long to validate a cohort in the application?

My understanding is that this is due to Labs being slow compared to stat1? 


--
Steven Walling,
Product Manager
_______________________________________________
Wikimetrics mailing list
Wikimetrics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikimetrics