So we've just had invalid IPs...for how long? And this hasn't been fixed after how long reported?
Awesome work surfacing the bug. But the fact that this was not fixed thus far and, moreover, that nobody on Analytics Engineering let consumers know (unless there's a thread I've missed somewhere) is deeply concerning. We have schemas and analysis that rely on this field. As a customer I would like to know what the scheduling on fixing this bug.
On 2 January 2016 at 20:47, Tilman Bayer tbayer@wikimedia.org wrote:
Per https://phabricator.wikimedia.org/T119144 , you are probably out of luck, as it seems there is basically no current EventLogging table with valid IPs (IP hashes) ...
Disregarding that, you could take a look at MobileWebSectionUsage or MobileWebUIClickTracking.
On Sat, Jan 2, 2016 at 10:00 AM, Oliver Keyes okeyes@wikimedia.org wrote:
Hey y'all
I'm working on a piece of research (largely recreational) on the old problem of fingerprinting users with minimal information - namely the combination of a user agent and an IP address. Basically I'm looking to put together a piece of work showing:
- How sub-standard it is;
- How fast it decays;
- How the sub-standardness varies by (platform|location)
This would be pretty doable with internal data; basically I'd need a schema with IP, user agent and a per-user UUID that's got a decent (>=24 hours) expiry time. My question: does anyone know of a table with recent data that meets these requirements? And, if not, anyone with EventLogging experience interested in working on the problem with me?
-- Oliver Keyes Count Logula Wikimedia Foundation
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
-- Tilman Bayer Senior Analyst Wikimedia Foundation IRC (Freenode): HaeB
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics