I'm hoping for a tool to help clean up a spammed wiki. (We are merging with Ekopedia, and en.ekopedia.org has more spam than genuine pages.) Has anyone created such a script?
If not, then the way I imagine it is: * Find pages with exactly one editor and at least one link * Display title, a chunk of text and the link(s) * Offer to delete the page.
A few years I asked about finding pages with one editor and received some good suggestions (but ended up using AbuseFilter instead).
Bináris suggested an editor list and requiring that len(editorlist)==1 http://osdir.com/ml/python-pywikipediabot-general/2012-03/msg00015.html
Merlijn suggested a database query - I don't know where to start with that. http://osdir.com/ml/python-pywikipediabot-general/2012-03/msg00031.html
I have learnt since very basic python since then...
Thanks
One more idea to concern: spambots often follow certain name patterns. If anon edit is allowed, anons creating new pages or adding links may be suspicious. If not, then you may look for some spambots and recognize their naming patterns which may be matched by regular expressions.
2013/10/29 Chris Watkins chriswaterguy@appropedia.org
I'm hoping for a tool to help clean up a spammed wiki. (We are merging with Ekopedia, and en.ekopedia.org has more spam than genuine pages.) Has anyone created such a script?
If not, then the way I imagine it is:
- Find pages with exactly one editor and at least one link
- Display title, a chunk of text and the link(s)
- Offer to delete the page.
A few years I asked about finding pages with one editor and received some good suggestions (but ended up using AbuseFilter instead).
Bináris suggested an editor list and requiring that len(editorlist)==1 http://osdir.com/ml/python-pywikipediabot-general/2012-03/msg00015.html
Merlijn suggested a database query - I don't know where to start with that. http://osdir.com/ml/python-pywikipediabot-general/2012-03/msg00031.html
I have learnt since very basic python since then...
Thanks
Pywikipedia-l mailing list Pywikipedia-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/pywikipedia-l