Re: [Wikitech-l] Smart machine-learning based anti-spam system (I wish!)

16 Aug 2012

Hi Daniel,

A lot of your ideas are covered by
http://en.wikipedia.org/wiki/Wikipedia:STiki. Andrew has done a lot of
great research, if you haven't read his papers yet that might be a
good intro to the type of machine learning approaches that have been
used.

That being said, I would love to have some system that is constantly
learning from the edits that are flagged as spam, that we can query
with new edits from AbuseFilter to get a score of how likely it is
that this new edit is spam. If you get around to working on your
system, it would be great to work out some way to interface.

On Thu, Aug 16, 2012 at 11:16 AM, Daniel Friesen
&lt;lists(a)nadir-seen-fire.com&gt; wrote:
...
  I've had a good idea for an anti-spam system for
awhile.
 Blocks, Captchas, and local filters, all the tricks we've been using end up
 not working well enough to easily deal with the spam on a lot of wikis.

 I know this because I've been continually dealing with the spam on a small
 dead wiki. Simple AntiSpam, AntiBot, Captchas, TorBlock, Abuse Filter...
 Time after time I expand my filters more and more. But inevitably a few days
 later spam not covered by my filters comes through and I have to do it
 again.

 I ended up having to deal with it more today and then started writing out
 the details I've had for awhile on a machine-learning based anti-spam
 system.

 https://www.mediawiki.org/wiki/User:Dantman/Anti-spam_system

 Of course. While I have the whole idea for the ui, backend stuff, how to
 handle the service, etc... I haven't done the actual machine-learning stuff
 before.
 Also naturally just like Gareth, OAuth, and other things this is just
 another one of my ideas I don't have the time and resources to do and wish I
 had the financial backing to work on.

 --
 ~Daniel Friesen (Dantman, Nadir-Seen-Fire) [http://daniel.friesen.name]

 _______________________________________________
 Wikitech-l mailing list
 Wikitech-l(a)lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l 

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] Smart machine-learning based anti-spam system (I wish!)