Hallo!
I'm needing a list of all active Wikipedia bots in all language versions in order to judge how important the edits are which I read from the table `revision' from databases like `dewiki_p'. I want to discriminate bot-edits from other minor edits by checking if the corresponding user name is that of a bot.
I could, of course, extract the names of all bots from HTML by parsing the page about `All Wikipedia bots' http://en.wikipedia.org/wiki/Category:All_Wikipedia_bots
But very likely there's a much more convenient way, like an sql table containing the bots.
(The list of bots running on the Wikimedia Toolserver http://en.wikipedia.org/wiki/Category:Wikipedia_bots_running_on_the_Wikimedi... contains to few bots for my purposes.)
Best Philipp
All bots on a given wiki have a row in the user_groups table with ug_group set to 'bot'. So, you can use that to get the list of all bot accounts on a wiki.
Petr Onderka [[User:Svick]]
On Tue, Jan 31, 2012 at 10:12, Philipp Zedler philipp@neue-musik.com wrote:
Hallo!
I'm needing a list of all active Wikipedia bots in all language versions in order to judge how important the edits are which I read from the table `revision' from databases like `dewiki_p'. I want to discriminate bot-edits from other minor edits by checking if the corresponding user name is that of a bot.
I could, of course, extract the names of all bots from HTML by parsing the page about `All Wikipedia bots' http://en.wikipedia.org/wiki/Category:All_Wikipedia_bots
But very likely there's a much more convenient way, like an sql table containing the bots.
(The list of bots running on the Wikimedia Toolserver http://en.wikipedia.org/wiki/Category:Wikipedia_bots_running_on_the_Wikimedi... contains to few bots for my purposes.)
Best Philipp
Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/toolserver-l Posting guidelines for this list: https://wiki.toolserver.org/view/Mailing_list_etiquette
Not all bots are in the bot group though, IIRC ClueBot isn't, and neither are a few others.
Simon On Jan 31, 2012 9:31 AM, "Petr Onderka" gsvick@gmail.com wrote:
All bots on a given wiki have a row in the user_groups table with ug_group set to 'bot'. So, you can use that to get the list of all bot accounts on a wiki.
Petr Onderka [[User:Svick]]
On Tue, Jan 31, 2012 at 10:12, Philipp Zedler philipp@neue-musik.com wrote:
Hallo!
I'm needing a list of all active Wikipedia bots in all language versions
in
order to judge how important the edits are which I read from the table `revision' from databases like `dewiki_p'. I want to discriminate
bot-edits
from other minor edits by checking if the corresponding user name is
that of
a bot.
I could, of course, extract the names of all bots from HTML by parsing
the
page about `All Wikipedia bots' http://en.wikipedia.org/wiki/Category:All_Wikipedia_bots
But very likely there's a much more convenient way, like an sql table containing the bots.
(The list of bots running on the Wikimedia Toolserver
http://en.wikipedia.org/wiki/Category:Wikipedia_bots_running_on_the_Wikimedi...
contains to few bots for my purposes.)
Best Philipp
Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/toolserver-l Posting guidelines for this list: https://wiki.toolserver.org/view/Mailing_list_etiquette
Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/toolserver-l Posting guidelines for this list: https://wiki.toolserver.org/view/Mailing_list_etiquette
ClueBots certainly are in the group:
mysql> select user_name from user_groups join user on user_id = ug_user where ug_group = 'bot' and user_name like 'Clue%'; +-------------+ | user_name | +-------------+ | ClueBot | | ClueBot II | | ClueBot III | | ClueBot IV | | ClueBot NG | | ClueBot VI | +-------------+
Also, according to WP:BOTPOL (which applies only to enwiki):
Historically, being flagged as a bot account was distinct from the approval process; not all approved bots had that property. This stemmed from the fact that all bot edits were hidden from recent changes, and that was not universally desirable. Now that bot edits can be allowed to show up on recent changes, this is no longer necessary.
So, if there actually are some bots that don't have the flag, a bureaucrat could probably give it to the bot.
Petr Onderka [[User:Svick]]
On Tue, Jan 31, 2012 at 11:07, Simon Walker stwalkerster@googlemail.com wrote:
Not all bots are in the bot group though, IIRC ClueBot isn't, and neither are a few others.
Simon
On Jan 31, 2012 9:31 AM, "Petr Onderka" gsvick@gmail.com wrote:
All bots on a given wiki have a row in the user_groups table with ug_group set to 'bot'. So, you can use that to get the list of all bot accounts on a wiki.
Petr Onderka [[User:Svick]]
On Tue, Jan 31, 2012 at 10:12, Philipp Zedler philipp@neue-musik.com wrote:
Hallo!
I'm needing a list of all active Wikipedia bots in all language versions in order to judge how important the edits are which I read from the table `revision' from databases like `dewiki_p'. I want to discriminate bot-edits from other minor edits by checking if the corresponding user name is that of a bot.
I could, of course, extract the names of all bots from HTML by parsing the page about `All Wikipedia bots' http://en.wikipedia.org/wiki/Category:All_Wikipedia_bots
But very likely there's a much more convenient way, like an sql table containing the bots.
(The list of bots running on the Wikimedia Toolserver
http://en.wikipedia.org/wiki/Category:Wikipedia_bots_running_on_the_Wikimedi... contains to few bots for my purposes.)
Best Philipp
Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/toolserver-l Posting guidelines for this list: https://wiki.toolserver.org/view/Mailing_list_etiquette
Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/toolserver-l Posting guidelines for this list: https://wiki.toolserver.org/view/Mailing_list_etiquette
Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/toolserver-l Posting guidelines for this list: https://wiki.toolserver.org/view/Mailing_list_etiquette
Hallo Petr and Simon (and everyone else),
thank you very much for your replies!
All bots on a given wiki have a row in the user_groups table with ug_group set to 'bot'. So, you can use that to get the list of all bot accounts on a wiki.
I'm now using this solution by Petr in my program and it works very well. If a few bots among a few thousand are missing that should not matter too much.
Philipp
toolserver-l@lists.wikimedia.org