On 2013-04-12 7:33 PM, "Platonides" Platonides@gmail.com wrote:
On 09/04/13 18:20, Quim Gil wrote:
Hi Anubhav,
I have done a first reality check with Chris Steipp, who oversees the area of security and also spam prevention. Your idea is interesting and it seems to be feasible. This is a very good first step!
It would require adding a hook to MediaWiki core, but this could be a small, acceptable change.
I agree. Adding a hook is no problem.
Well a hook is obviously no problem, im not sure why a new one would be needed. Surely if the abuse filter has all the hooks it needs, so would this.
Qgill wrote:
It might have a performance penalty in a site like English Wikipedia with
plenty of concurrent edits, but for starters it could be potentially useful to the 99% of MediaWiki instances that have a significantly smaller number of daily edits and especially a very small number of editors and tools able / happy to deal with spam.
Hmm. I was playing with nlp-ish automated newpage patrol recently. One thing that crossed my mind was if it becomes too expensive, one could run the classifier in the job queue (and hence on a dedicated server(s) ) and tag changes shortly after the fact.
Last of all I would suggest you also read up on other people who have done machine learning approaches to vandalism detection. In particular user:cluebot_NG - http://en.wikipedia.org/wiki/User:Cluebot_NG . There is also a list of academic papers on the subject at http://en.wikipedia.org/w/index.php?title=User:Emijrp/Anti-vandalism_bot_cen... said, an extension like you are proposing does not have to be as good as the rather complex state of the art in order to be useful. Any effective system would probably be quite useful).
-bawolff