[Labs-l] Some using a Python framework is relentlessly hammering Harvard sites, resulting an IP range ban.
Tim Landscheidt
tim at tim-landscheidt.de
Sun Dec 4 15:50:07 UTC 2016
"Maximilian Doerr" <maximilian.doerr at gmail.com> wrote:
> […]
> Consequently, InternetArchiveBot cannot ascertain if the site is alive or
> not because of this ban, and while I am working on a solution for these
> cases, it's really best if the bot can be able to make these decisions on
> its own rather than deferring to a whitelist of sorts.
Ceterum censeo: Bots checking external web links should be
merged, and ideally made a production-grade thingy by WMF.
(MediaWiki extension for reuse by other sites?) In the
past, interwiki bots were en vogue which meant that for each
wiki there were a number of them doing the exact same thing,
and now this pattern has shifted to link checking bots.
To apply [[Cunningham's Law]]: From the looks of it, Inter-
netArchiveBot seems to be the most useful bot checking ex-
ternal web links, and I think it is most productive to aim
efforts in that direction.
Tim
More information about the Labs-l
mailing list