On 29/12/2008, Brian Brian.Mingus@colorado.edu wrote:
Using logistic regression we achieve 83% precision at 77% recall with our model.* Compared to the rule-based methods that are currently applied* *in Wikipedia, our approach increases the F -Measure performance by 49% while* *being faster at the same time.*
In my experience and reasonably expert knowledge of spam fighting, these are not very good statistics. If they had achieved over 99% then I would have been impressed, with if they did that with even fewer false positives then I would have been thoroughly impressed.
And I don't consider it either-or. We should fight spammers of all kinds with all techniques that work.