On Sun, Nov 23, 2008 at 5:29 PM, Marco Schuster marco@harddisk.is-a-geek.org wrote:
Hi all,
as Domas' magic stuff(tm) currently gathers article traffic data very efficient: Could this system be expanded to get a list of user agents used to browse Wikipedia, sorted by their count? I think this would be a very cool way to have accurate statistics about browser usage, not only on Wikipedia.
Can you suggest a good user-agent scrubber? Many user-agents strings have various degrees of private/semi-private data stuffed into them.
I've looked at publishing user-agent stats for Wikimedia site before, but realized that I don't have enough knowledge to safely canonicize them without throwing out a ton of information. (I.e. I could break down IE vs Firefox vs Opera vs Safari; but if you want to know about less common user agents I'm not quite sure what information can be safely released)