Hey y'all
I'm working on a piece of research (largely recreational) on the old
problem of fingerprinting users with minimal information - namely the
combination of a user agent and an IP address. Basically I'm looking
to put together a piece of work showing:
1. How sub-standard it is;
2. How fast it decays;
3. How the sub-standardness varies by (platform|location)
This would be pretty doable with internal data; basically I'd need a
schema with IP, user agent and a per-user UUID that's got a decent
(>=24 hours) expiry time. My question: does anyone know of a table
with recent data that meets these requirements? And, if not, anyone
with EventLogging experience interested in working on the problem with
me?
--
Oliver Keyes
Count Logula
Wikimedia Foundation
_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics