Re user agents, 2010 was a long time ago from an ops perspective.
The need for a user agent is no doubt as strong as in the 2010 edict from Domas, but I would like clarification from ops regarding 'gold standard' before we build something which may no longer be needed by them.
On Fri, Jul 11, 2014 at 11:32 PM, Amir Ladsgroup ladsgroup@gmail.com wrote:
On Fri, Jul 11, 2014 at 1:23 PM, Antoine Musso hashar+wmf@free.fr wrote:
Le 10/07/2014 21:29, Amir Ladsgroup a écrit :
As ISO 8859 supports % character, this sounds like a reasonable solution for me. we need to just use urllib2 library to encode it.
That it is a bit hard to read though :-D The whole purpose is for site operators to quickly find out who is behind the bot and work with them to fix it / stop hammering the site. A human readable user-agent with detailed point of contact for the bot operator will dramatically speed up looking up the contact.
I disagree, decoding websites can decode username in just a second, and note that just a very low proportion of bot usernames needs to encoded (i.e. the encoded version is not the same as real one.)
Best
-- Antoine "hashar" Musso
Pywikipedia-l mailing list Pywikipedia-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/pywikipedia-l
-- Amir
Pywikipedia-l mailing list Pywikipedia-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/pywikipedia-l