We have some bot information from wikistats here. I don't think it's particularly actionable for what you are doing, but it might be interesting directionally.

-Toby


On Wed, Jul 10, 2013 at 3:09 PM, Jeremy Baron <jeremy@tuxmachine.com> wrote:
On Wed, Jul 10, 2013 at 9:54 PM, Noneof MicrosoftsBusiness
<phonenumberofthebeast@hotmail.com> wrote:
> We've been working on tracking down the top 25 articles for each week, but
> as you can see
>
> http://en.wikipedia.org/wiki/Wikipedia:5000
>
> it requires determining which rankings are due to actual human views and
> which are due to bots, and recently, the bots have been having a field day.
> I've been asked by the creator of the list to ask you for help and/or advice
> on how to use analytics to separate human from non-human views. Please let
> me know if there's anything that can be done.

I think at this point that would either require a change to the format
of the domas (anonymized) stats or an NDA and maybe some other
approvals. (or kraken! but rumor is that's not yet ready for the
general public)

-Jeremy

_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics