Ok, so I deployed the new cohort upload: https://metrics.wmflabs.org/cohorts/upload. Hopefully the explanation and examples make a bit more sense. And I think it's much cleaner this way. It's also faster :)
As always, let me know if you have any trouble.
On Fri, Nov 22, 2013 at 8:16 PM, Jessie Wild jwild@wikimedia.org wrote:
Good catch, Dan!
adding in the upfront option of indicating names or ids seems like a good/needed addition. I would recommend also including the definition of each as well - I think some of our users might get a little confused about the delineation just from the names (i.e., not realize that user_names are associated with user_ids).
On Fri, Nov 22, 2013 at 3:28 PM, Dan Andreescu dandreescu@wikimedia.orgwrote:
So, for now, until I figure out how to fix this, it will always prefer
user_names before user_ids.
I think this is an argument for making users specifying whether it's names or ids up front, and not allowing mixtures. Assuming it might be a mixture and looking for names first is almost certain to produce inaccurate results at some point. We have ids precisely to avoid collisions with names, allowing for renaming users, and other cases.
Yep, I just learned this the hard way and made a fool of myself in front of a bunch of people I admire. So, I'd be glad if I'm the only one that this happens to. If nobody objects, I'm going to allow the user to select whether their cohort contains user_ids OR user_names, and strictly prohibit mixtures.
Wikimetrics mailing list Wikimetrics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikimetrics
--
*Jessie WildGrantmaking Learning & Evaluation * *Wikimedia Foundation*
Imagine a world in which every single human being can freely share in the sum of all knowledge. Help us make it a reality! Donate to Wikimedia https://donate.wikimedia.org/
Wikimetrics mailing list Wikimetrics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikimetrics