On Fri, 18 Mar 2005 15:20:34 -0500, Evan Prodromou evan@bad.dynu.ca wrote:
It's probably worth noting that there's already an excellent shared regexp list here:
For the record, Wikimedia also has a blacklist (which only matches inside URLs) at http://meta.wikimedia.org/wiki/Spam_blacklist
Perhaps this too could be combined with others using an auto-update + whitelist, as described at CommunityWiki (an ingenious system, I must say). Although, I note that cw's list is quite large, and there seems to be some legitimate concern at over-general entries (e.g. is *everything* under .uk.net *really* going to be spam?), so it might need heavy weeding before going "live" for something as large as {the Wikimedia group of sites}