Is there a way to retrieve a canonical list of bots on enwiki or elsewhere?
I'm interested in omitting automated revisions (sorry Stuart!) for the
purposes of building co-authorship networks.
Grabbing everything under 'Category:All Wikipedia bots' excludes some major
ones like SmackBot, Cydebot, VIAFbot, Full-date unlinking bot, etc. because
these bots have changed names but the redirect is not categorized, the
account has been removed/deprecated, or a user appears to have removed the
relevant bot categories from the page.
Can anyone advise me on how to kill all the bots in my data without having
to resort to manual cleaning or hacky regex?
--
Brian C. Keegan, Ph.D.
Post-Doctoral Research Fellow, Lazer Lab
College of Social Sciences and Humanities, Northeastern University
Fellow, Institute for Quantitative Social Sciences, Harvard University
Affiliate, Berkman Center for Internet & Society, Harvard Law School
b.keegan(a)neu.edu
www.brianckeegan.com
M: 617.803.6971
O: 617.373.7200
Skype: bckeegan