On 18/07/06, Dan Davis <hokie99cpe+wiki(a)gmail.com> wrote:
On 7/18/06, Andy Roberts <aroberts(a)gmail.com>
wrote:
In the end I modified Rotem's suggestion. I
tested it using
http://ioctl.org/jan/test/regexp.htm and found that it matched with
the number "1" anywhere in the top level domain, not just at the start
hence:
|\.1[a-z\.]*\.org|
seems to cover it.
Oops... forgot about that part... $wgSpamRegex="/\.1.*\.org/";
Are you worried about sites with multiple numbers at the beginning
11site.org? Or, are they all a single 1 followed by text?
Just single 1's for now.
Here's another pattern though -
I suffer roughly weekly from a bot which doesn't add any links, it
just edits several existing pages and adds a line consisting of about
a dozen random digits eg:
300142760257
But different every time.
That kind of behaviour, combined with changing IP nos and delays of a
few minutes between edits seems to be theoretically impossible to
defend against, as well as pointless.
I'm loathe to force login to edit, because the number of genuine
contributions does drop a little when I resort to that.
--
Andy Roberts